Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 987servers and find the perfect MCPs for your needs.

ArXiv Search icon

ArXiv Search

Searches and fetches scientific papers from arXiv.org based on category and number of results.

Codeforces icon

Codeforces

Accesses the Codeforces API to provide comprehensive contest, user, and submission data through a standardized MCP interface.

TheGraph icon

TheGraph

Empowers AI agents by providing access to indexed blockchain data from The Graph.

Manus icon

Manus

Automates browser interactions through the Model Context Protocol (MCP), enabling integration between large language models and web browsing.

Dataset icon

Dataset

Provides datasets and statistical libraries for JSer.info, a JavaScript information website.

DDG icon

DDG

Provides DuckDuckGo search capabilities through the Model Context Protocol.

CoinStats icon

CoinStats

Provides access to cryptocurrency market data, portfolio tracking, and news via the CoinStats API.

Research Orchestration Service icon

Research Orchestration Service

Orchestrates research tasks by gathering, analyzing, and synthesizing information from multiple sources using AI to answer complex queries.

Pubmed Smithery icon

Pubmed Smithery

Enhance PubMed searches with features such as MeSH term lookup, publication count statistics, and PICO-based evidence search.

Pagespeed icon

Pagespeed

Analyzes webpage performance using Google PageSpeed Insights, providing detailed metrics and improvement suggestions.

Scrapezy icon

Scrapezy

Enables AI models to extract structured data from websites using the Model Context Protocol.

Vishu icon

Vishu

Automates comprehensive reconnaissance, security analysis, and task orchestration by leveraging AI-driven Large Language Models for efficient vulnerability detection and information gathering.

Website Downloader icon

Website Downloader

Downloads websites and their assets for use with Retrieval-Augmented Generation (RAG) systems.

AWorld icon

AWorld

Provides a collection of API endpoints and examples for web scraping, deep research workflows, and service health monitoring.

Patchright Lite icon

Patchright Lite

Enables AI models to perform stealth browser automation using the Model Context Protocol.

HotNews icon

HotNews

Provides real-time hot trending topics from major Chinese social platforms and news sites via the Model Context Protocol (MCP).

Undetected Chromedriver icon

Undetected Chromedriver

Automates Chrome browser control while bypassing anti-bot detection mechanisms.

NHL icon

NHL

Access live NHL game data, scores, stats, teams, and generate reports using the Model Context Protocol.

Screenshot icon

Screenshot

Captures and tiles web page screenshots into AI-friendly dimensions, specifically optimized for Claude Vision API.

Goodreads icon

Goodreads

Integrates with Goodreads to retrieve a user's book library.

Showing 20 of 987 results

Scroll for more results...