Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 936servers and find the perfect MCPs for your needs.

GPT Researcher icon

GPT Researcher

22,186

Conducts in-depth web and local research on any topic, generating comprehensive reports with citations.

Skyvern icon

Skyvern

13,746

Automates browser-based workflows using LLMs and computer vision for robust and adaptable web interactions.

YouTube Transcript icon

YouTube Transcript

5,617

Retrieves transcripts and subtitles from YouTube videos, including automatically generated ones, without requiring an API key or headless browser.

Trafilatura icon

Trafilatura

4,446

Extracts text and metadata from web pages and online resources, offering various output formats.

Steel Browser icon

Steel Browser

4,345

Automates web interactions for AI agents and applications without managing infrastructure.

ENScan Go icon

ENScan Go

3,729

Collects and aggregates domestic enterprise information from various sources to aid in reconnaissance tasks.

Firecrawl icon

Firecrawl

3,663

Empowers LLMs with advanced web scraping capabilities for content extraction, crawling, and search functionalities.

Browser icon

Browser

2,809

Enables AI applications to control a user's existing browser instance.

Browserbase icon

Browserbase

2,113

Enables LLMs to control cloud browsers for web interaction, data extraction, and task automation using Browserbase and Stagehand.

Superglue icon

Superglue

1,785

Automates data integration via a stable, self-healing SDK, providing automated schema-drift detection, retries, and remappings to maintain continuous data flow without connector maintenance or rewrites.

DevDocs icon

DevDocs

1,705

Crawls, extracts, and organizes technical documentation into an LLM-ready format, streamlining research and implementation for developers.

Agent Twitter Client icon

Agent Twitter Client

1,689

Automates interactions with Twitter, including scraping data, sending tweets, and engaging with Grok AI, all without needing the official Twitter API.

Zenfeed icon

Zenfeed

1,303

Empowers RSS with AI to automatically filter, summarize, and deliver important information, reducing information overload.

Mobile Next icon

Mobile Next

1,225

Enables scalable mobile automation through a platform-agnostic interface for interacting with native iOS/Android applications and devices.

Fetcher icon

Fetcher

748

Fetches web page content using a Playwright headless browser, enabling JavaScript execution and intelligent content extraction.

Notte icon

Notte

741

Enables the development, deployment, and scaling of web browsing agents through a single API.

RedNote icon

RedNote

652

Access content from the RedNote (XiaoHongShu) platform via an MCP server.

Tavily icon

Tavily

538

Integrates Tavily's search and data extraction capabilities with AI assistants via the Model Context Protocol.

Bright Data icon

Bright Data

518

Empowers AI agents to access, discover, and extract real-time web data, bypassing restrictions and bot detection.

Fetch icon

Fetch

471

Fetches and transforms web content into various formats.

Showing 20 of 936 results

Scroll for more results...