Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 985servers and find the perfect MCPs for your needs.
GPT Researcher
Conducts in-depth web and local research on any topic, generating comprehensive reports with citations.
Skyvern
Automates browser-based workflows using LLMs and computer vision for robust and adaptable web interactions.
YouTube Transcript
Retrieves transcripts and subtitles from YouTube videos, including automatically generated ones, without requiring an API key or headless browser.
Steel Browser
Automates web interactions for AI agents and applications without managing infrastructure.
Trafilatura
Extracts text and metadata from web pages and online resources, offering various output formats.
Firecrawl
Empowers LLMs with advanced web scraping capabilities for content extraction, crawling, and search functionalities.
ENScan Go
Collects and aggregates domestic enterprise information from various sources to aid in reconnaissance tasks.
Chrome Browser
Exposes Chrome browser functionality to AI assistants for complex browser automation, content analysis, and semantic search.
Browser
Enables AI applications to control a user's existing browser instance.
Browserbase
Enables LLMs to control cloud browsers for web interaction, data extraction, and task automation using Browserbase and Stagehand.
Superglue
Automates data integration via a stable, self-healing SDK, providing automated schema-drift detection, retries, and remappings to maintain continuous data flow without connector maintenance or rewrites.
DevDocs
Crawls, extracts, and organizes technical documentation into an LLM-ready format, streamlining research and implementation for developers.
Agent Twitter Client
Automates interactions with Twitter, including scraping data, sending tweets, and engaging with Grok AI, all without needing the official Twitter API.
Zenfeed
Empowers RSS with AI to automatically filter, summarize, and deliver important information, reducing information overload.
Mobile Next
Enables scalable mobile automation through a platform-agnostic interface for interacting with native iOS/Android applications and devices.
Crawl4AI RAG
Empowers AI agents and coding assistants with web crawling and retrieval-augmented generation (RAG) capabilities.
OpenDia
Exposes comprehensive browser functions via the Model Context Protocol, enabling external applications and AI models to programmatically interact with a web browser.
Notte
Enables the development, deployment, and scaling of web browsing agents through a single API.
Bright Data
Empowers AI agents to access, discover, and extract real-time web data, bypassing restrictions and bot detection.
Deepwiki
Fetches and converts Deepwiki content to Markdown for use in code editors and other MCP-compatible clients.
Scroll for more results...