Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 987servers and find the perfect MCPs for your needs.
Webscraper
Transcribes webpages, YouTube videos, and PDFs for use with Large Language Models.
Authless Website Scraper and Analyzer
Analyzes websites and answers questions about their content using Cloudflare's browser rendering and AI.
Slunk
Monitors and locally stores Slack messages in real-time, enabling intelligent semantic search and analytics through an MCP server.
FetchSERP
Integrates a comprehensive API for SEO, SERP analysis, web scraping, and keyword research.
Safari
Enables programmatic control of the Safari browser on macOS, facilitating web automation, testing, and debugging with AI assistants.
Docs Fetch
Recursively fetches and extracts content from web pages for LLM consumption.
Manus
Automates browser interactions through the Model Context Protocol (MCP), enabling integration between large language models and web browsing.
Web Scout
Integrates web search and content extraction capabilities into an MCP environment.
Deep Research
Enables Claude and other MCP-compatible AI assistants to perform comprehensive research by integrating web and academic search functionalities.
Scientific Papers
Empowers Large Language Models with real-time access to scientific papers from arXiv and OpenAlex.
Scraper.is
Integrates Scraper.is with the Model Context Protocol (MCP) for web scraping capabilities in AI assistants.
Headline Vibes
Analyzes the sentiment of news headlines from major US publications for a given date.
Light Researcher
Orchestrates LLMs with efficient web content search and extraction capabilities, including DuckDuckGo and GitHub Code search.
AgentSource
Provides live, comprehensive company and contact data from Explorium's AgentSource platform, exclusively for use within Claude Desktop.
Wikipedia
Enables language models to search and retrieve Wikipedia articles programmatically.
PubMed Search
Enables searching and retrieving academic papers from the PubMed database.
MercadoLibre
Provides access to the MercadoLibre API for searching products, retrieving reviews and descriptions, and accessing seller reputation data.
Server Fetch
Fetches content from the internet using browser automation and multiple extraction methods.
YouTube
Establishes a Model Context Protocol server to integrate YouTube's data and functionalities directly into Claude Desktop.
DuckDuckGo Search
Enables web search and content extraction using DuckDuckGo within Model Context Protocol (MCP) environments.
Scroll for more results...