Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 740servers and find the perfect MCPs for your needs.
JGrants
Facilitates access to and retrieval of subsidy information from the JGrants system, operated by the Digital Agency of Japan.
Server Fetch Typescript
Retrieves and converts web content into various formats using a Model Context Protocol server.
Firecrawl
Enables web scraping, content searching, site crawling, and data extraction using the Firecrawl API.
Pulse CN
Provides AI models with real-time access to trending content and data from major Chinese internet platforms.
IMDb
Provides access to movie and TV show information from the IMDb database via the IMDb API.
QAnon
Provides a Model Context Protocol (MCP) server enabling AI analysis of QAnon posts for sociological research.
DeepRe
Generates in-depth research reports on specified topics using the Google Gemini AI API.
RSS3
Query data across decentralized chains, social media platforms, and the RSS3 network.
Firecrawl
Automates browser interactions for web scraping and data collection.
PatSnap
Collects patent-related information from PatSnap's API for trend analysis and reporting.
Browserbase
Enables LLMs to interact with web pages through cloud browser automation, data extraction, and JavaScript execution.
Perplexity
Enables web searches using the Perplexity AI API through a simple server.
Git Repo Browser
Provides a tree-like representation of a Git repository's directory structure and reads specified file contents.
BuiltWith
Queries the BuiltWith API to retrieve information about website technology stacks.
Eget Connector
Connects Claude for Desktop to a locally running eGet web scraper, enabling web content scraping directly within Claude conversations.
Puppeteer Extra
Automates browser interactions with enhanced features like stealth mode to avoid bot detection.
Chrome Integration
Enables AI models to control the Chrome browser and perform web automation tasks.
Hot News
Aggregates real-time hot topics data from various online platforms.
WebforAI Text Extractor
Extracts plain text from web pages using the WebforAI library via a Cloudflare Workers-based Model Context Protocol (MCP) server.
Jina AI Grounding
Enhances LLM responses with factual, real-time web content via Jina.ai's Grounding API.
Scroll for more results...