Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 425 servers and find the perfect MCPs for your needs.
WebSearch
Provides web search capabilities to AI assistants via the Model Context Protocol (MCP).
Markdownify UTF-8
Converts various file types and web content into Markdown format with enhanced UTF-8 encoding support.
Google Jobs
Provides Google Jobs search capabilities via SerpAPI integration as a Model Context Protocol (MCP) server.
Web Search
Enables Google searches and web content viewing with bot detection avoidance within MCP environments.
Puppeteer
Enables LLMs to interact with web pages by providing browser automation capabilities, including screenshot capture and JavaScript execution.
Screenshot
Captures screenshots of web pages and local HTML files via an MCP tool interface.
Playwright
Provides Playwright web page content retrieval and interaction functionality using the Model Context Protocol (MCP).
Scraper.is
Integrates Scraper.is with the Model Context Protocol (MCP) for web scraping capabilities in AI assistants.
YouTube
Downloads YouTube subtitles using yt-dlp and connects them to claude.ai via Model Context Protocol.
PubMed Search
Enables searching and retrieving academic papers from the PubMed database.
Research
Aggregates multiple search APIs via the Model Context Protocol (MCP) for enhanced research capabilities.
Swapi
Integrates Smithery with the SWAPI API to enable searching and listing Star Wars universe data.
Research Orchestration Service
Orchestrates research tasks by gathering, analyzing, and synthesizing information from multiple sources using AI to answer complex queries.
Pdf Extraction
Extracts content from PDF files using a local file path.
Fred
Accesses and retrieves economic data series from the Federal Reserve Economic Data (FRED) API.
Puremd
Enables MCP clients to access web content in markdown format, bypassing bot detection.
Data Gouv
Enables interaction with the Data.gouv.fr API, specifically the API Recherche Entreprises, via HTTP+SSE transport.
Youtube Research
Aggregates and returns YouTube video IDs and metadata based on user-defined search queries.
Webscan
Scans and analyzes web content, extracting information from web pages.
Cloudflare Browser Rendering
Extracts web content using Cloudflare Browser Rendering for use in LLM context.
Scroll for more results...