Discover our curated collection of MCP servers for web scraping & data collection. Browse 1887servers and find the perfect MCPs for your needs.
Empowers LLMs with advanced web scraping capabilities for content extraction, crawling, and search functionalities.
Enables LLMs to control cloud browsers for web interaction, data extraction, and task automation using Browserbase and Stagehand.
Crawls, extracts, and organizes technical documentation into an LLM-ready format, streamlining research and implementation for developers.
Enables scalable mobile automation through a platform-agnostic interface for interacting with native iOS/Android applications and devices.
Conducts in-depth web and local research on any topic, generating comprehensive reports with citations.
Enables AI applications to control a user's existing browser instance.
Retrieves transcripts and subtitles from YouTube videos, including automatically generated ones, without requiring an API key or headless browser.
Extracts text and metadata from web pages and online resources, offering various output formats.
Collects and aggregates domestic enterprise information from various sources to aid in reconnaissance tasks.
Empowers AI agents and coding assistants with web crawling and retrieval-augmented generation (RAG) capabilities.
Automates web interactions for AI agents and applications without managing infrastructure.
Empowers AI agents to access, discover, and extract real-time web data, bypassing restrictions and bot detection.
Automates data integration via a stable, self-healing SDK, providing automated schema-drift detection, retries, and remappings to maintain continuous data flow without connector maintenance or rewrites.
Exposes Chrome browser functionality to AI assistants for complex browser automation, content analysis, and semantic search.
Extracts and downloads content from XiaoHongShu (RedNote), including user posts, collections, likes, albums, and search results, while removing watermarks.
Integrates powerful web scraping and content extraction capabilities into LLM clients like Cursor and Claude.
Transforms any documentation website into a production-ready Claude AI skill in minutes.
Aggregates trending topics from over 35 platforms, offering intelligent filtering, automated multi-channel notifications, and AI-powered conversational analysis for deep news insights.
Enables AI agents to control web browsers via a Chrome extension, offering full Playwright API access with minimal context overhead.
Aggregates search results from various web search services through a unified metasearch library.
Scroll for more results...