Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 1067servers and find the perfect MCPs for your needs.

FastDomainCheck icon

FastDomainCheck

Checks the registration status of multiple domain names in bulk, providing availability and verification method.

Memory Store icon

Memory Store

Provides web search capabilities using Puppeteer, delivering structured JSON results.

Go Mdurl icon

Go Mdurl

Converts web content to Markdown using a simple MCP server.

Reputation Checker icon

Reputation Checker

Validates URLs and checks their reputation to help identify AI hallucinations and verify web page authenticity.

Browser Recorder icon

Browser Recorder

Records browsing sessions using MCPML and Playwright.

News Scraper icon

News Scraper

Programmatically fetches news headlines and article content from Khaleej Times.

Apify icon

Apify

Enables AI applications and agents to access and utilize Apify Actors as external tools for data extraction, web searching, and various automation tasks.

Access icon

Access

Extends Model Context Protocol servers with the ability to extract text from web pages and PDFs, and execute predefined commands.

Websets icon

Websets

Manages AI-powered web search collections and data using Exa's Websets API.

Youtube Transcripts icon

Youtube Transcripts

Retrieves transcripts from YouTube videos.

Google Search icon

Google Search

Integrates Google Search capabilities into a Model Context Protocol server for AI clients and applications.

Webtools icon

Webtools

Provides web analysis tools, including HTML extraction, markdown conversion, screenshot capture, performance analysis, and Lighthouse audits.

Hubble icon

Hubble

Facilitates data retrieval and analysis from Google Search and other online sources through API integration with Claude Desktop.

Food icon

Food

Provides comprehensive search for food products, including pricing and nutritional data, for AI agents.

Web Search icon

Web Search

Enables AI systems to access real-time web search capabilities.

Job Search Node icon

Job Search Node

Scrapes LinkedIn job listings, performs AI-driven analysis against a candidate profile, persistently indexes relevant jobs, and offers an API for management and retrieval.

PostCrawl icon

PostCrawl

Provides access to the PostCrawl API for searching and extracting content from social media platforms, particularly Reddit, optimized for AI assistants.

Model Context Protocol Servers icon

Model Context Protocol Servers

Provides a collection of specialized Model Context Protocol (MCP) servers for diverse use cases.

ArXiv Search icon

ArXiv Search

Provides search functionality for arXiv.org papers using the official arXiv API.

Youtube Transcript icon

Youtube Transcript

Retrieves transcripts from YouTube videos, providing direct access to captions and subtitles.

Showing 20 of 1067 results

Scroll for more results...