Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 425 servers and find the perfect MCPs for your needs.
Puppeteer
Automates browser interactions through Puppeteer for both new and existing Chrome instances.
Github Mapper
Maps and analyzes GitHub repositories, providing structural information and summary statistics directly within your code editor.
Jina AI
Provides access to Jina AI's web services through Claude, enabling web page reading, web search, and fact-checking.
Aoai Web Browsing
Automates web browser interactions via Playwright, driven by Azure OpenAI and the Model Context Protocol.
DataForSEO
Enables Large Language Models to interact with DataForSEO API functions and other SEO tools via a comprehensive stdio MCP server.
Hacker News
Facilitates fetching information from Hacker News via a Model Context Protocol (MCP) server.
Surf
Retrieves tide information for a specified location and date, aiding surfers and ocean enthusiasts.
Moz Readability Parser
Extracts and transforms webpage content into clean, LLM-optimized Markdown using Mozilla's Readability algorithm.
Npx Fetch
Fetches and transforms web content into various formats such as HTML, JSON, Markdown, and plain text.
Polymarket
Provides access to prediction market data through the PolyMarket API.
Markdown Downloader
Downloads webpages as Markdown files for easy access and integration with AI tools.
Deep Research
Generates comprehensive, well-cited research reports from research questions by elaborating on the question, finding relevant sources, and analyzing content.
Omnisearch
Provides unified access to multiple search engines, AI tools, and content processing services.
AKShare
Provides financial data analysis capabilities, accessing Chinese and global market information via the AKShare library.
OSINT Tools
Performs open-source intelligence (OSINT) tasks leveraging common network reconnaissance tools.
Rod
Automates browser interactions and provides web interaction capabilities for applications using the Rod browser automation framework.
OneSearch
Integrates web search and scraping capabilities using Searxng, Firecrawl, and Tavily.
Web Research
Enables Claude to access and utilize real-time web information for enhanced research capabilities.
Deno2 Playwright
Automates browser interactions via Playwright, enabling LLMs to interact with web pages.
LLMS.txt Explorer
Discovers and analyzes websites implementing the llms.txt standard.
Scroll for more results...