Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 987servers and find the perfect MCPs for your needs.
ArXiv Search
Searches and fetches scientific papers from arXiv.org based on category and number of results.
Codeforces
Accesses the Codeforces API to provide comprehensive contest, user, and submission data through a standardized MCP interface.
TheGraph
Empowers AI agents by providing access to indexed blockchain data from The Graph.
Manus
Automates browser interactions through the Model Context Protocol (MCP), enabling integration between large language models and web browsing.
Dataset
Provides datasets and statistical libraries for JSer.info, a JavaScript information website.
DDG
Provides DuckDuckGo search capabilities through the Model Context Protocol.
CoinStats
Provides access to cryptocurrency market data, portfolio tracking, and news via the CoinStats API.
Research Orchestration Service
Orchestrates research tasks by gathering, analyzing, and synthesizing information from multiple sources using AI to answer complex queries.
Pubmed Smithery
Enhance PubMed searches with features such as MeSH term lookup, publication count statistics, and PICO-based evidence search.
Pagespeed
Analyzes webpage performance using Google PageSpeed Insights, providing detailed metrics and improvement suggestions.
Scrapezy
Enables AI models to extract structured data from websites using the Model Context Protocol.
Vishu
Automates comprehensive reconnaissance, security analysis, and task orchestration by leveraging AI-driven Large Language Models for efficient vulnerability detection and information gathering.
Website Downloader
Downloads websites and their assets for use with Retrieval-Augmented Generation (RAG) systems.
AWorld
Provides a collection of API endpoints and examples for web scraping, deep research workflows, and service health monitoring.
Patchright Lite
Enables AI models to perform stealth browser automation using the Model Context Protocol.
HotNews
Provides real-time hot trending topics from major Chinese social platforms and news sites via the Model Context Protocol (MCP).
Undetected Chromedriver
Automates Chrome browser control while bypassing anti-bot detection mechanisms.
NHL
Access live NHL game data, scores, stats, teams, and generate reports using the Model Context Protocol.
Screenshot
Captures and tiles web page screenshots into AI-friendly dimensions, specifically optimized for Claude Vision API.
Goodreads
Integrates with Goodreads to retrieve a user's book library.
Scroll for more results...