Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1226servers and find the perfect MCPs for your needs.
A-Stock Data
Provides A-share (China stock market) data to large language models via the Model Content Protocol (MCP).
Web Research
Enables Claude to access real-time information from the web for enhanced research capabilities.
Airbnb
Searches Airbnb listings and retrieves detailed listing information.
Open Web Search
Enables web search across multiple engines without requiring API keys, supporting Bing, Baidu, DuckDuckGo, Brave, Exa, and CSDN.
Deep Research
Conducts in-depth, iterative research on any topic using AI-powered search, web scraping, and source evaluation to generate comprehensive reports.
Douyin
Extracts watermark-free video links, video captions, and audio transcriptions from Douyin (TikTok) share links.
Playwright
Enables browser automation capabilities using Playwright for LLMs to interact with web pages.
Nodemw
Provides a Node.js client for interacting with the MediaWiki API and WikiData.
Selenium
Automates browser interactions through the Model Context Protocol using Selenium WebDriver.
GPT Researcher
Enables LLM applications to perform in-depth research through the MCP protocol.
12306
Provides a high-performance backend system for querying China Railway 12306 train ticket information using the Model Context Protocol (MCP).
PDF Reader
Securely reads and extracts text, metadata, and page counts from PDF files (local or URL) for use by AI agents.
CoexistAI
Automates and simplifies diverse research workflows by integrating large language models with web search, social media, mapping, and code exploration.
Puppeteer
Automates browser interactions through Puppeteer for both new and existing Chrome instances.
Reddit Content
Fetches and analyzes content from Reddit, providing access to hot threads and post details.
G-Search
Enables parallel Google searches with multiple keywords using a Playwright-powered MCP server.
Paper Search
Searches and downloads academic papers from multiple sources like arXiv, PubMed, and bioRxiv.
MCPBench
Evaluates the performance of MCP servers for web search and database query tasks.
SearXNG
Integrates the SearXNG API to provide web search capabilities within an MCP environment.
Rag Web Browser
Enables AI agents and LLMs to interact with the web and extract information from web pages via the RAG Web Browser Actor.
Scroll for more results...