Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 2768 servers and find the perfect MCPs for your needs.

Firecrawl

Empowers LLMs with advanced web scraping capabilities for content extraction, crawling, and search functionalities.

4,195

Browserbase

Enables LLMs to control cloud browsers for web interaction, data extraction, and task automation using Browserbase and Stagehand.

3,216

DevDocs

Crawls, extracts, and organizes technical documentation into an LLM-ready format, streamlining research and implementation for developers.

2,047

Mobile Next

Enables scalable mobile automation through a platform-agnostic interface for interacting with native iOS/Android applications and devices.

4,218

GPT Researcher

Conducts in-depth web and local research on any topic, generating comprehensive reports with citations.

26,105

Browser

Enables AI applications to control a user's existing browser instance.

6,193

YouTube Transcript

Retrieves transcripts and subtitles from YouTube videos, including automatically generated ones, without requiring an API key or headless browser.

7,183

Trafilatura

Extracts text and metadata from web pages and online resources, offering various output formats.

5,612

ENScan Go

Collects and aggregates domestic enterprise information from various sources to aid in reconnaissance tasks.

4,277

Crawl4AI RAG

Empowers AI agents and coding assistants with web crawling and retrieval-augmented generation (RAG) capabilities.

2,119

Steel Browser

Automates web interactions for AI agents and applications without managing infrastructure.

6,752

Bright Data

Empowers AI agents to access, discover, and extract real-time web data, bypassing restrictions and bot detection.

2,245

Chrome Browser

Exposes Chrome browser functionality to AI assistants for complex browser automation, content analysis, and semantic search.

11,025

XHS Downloader

Extracts and downloads content from XiaoHongShu (RedNote), including user posts, collections, likes, albums, and search results, while removing watermarks.

10,583

Firecrawl

Integrates powerful web scraping and content extraction capabilities into LLM clients like Cursor and Claude.

5,901

Skill Seeker

Transforms any documentation website into a production-ready Claude AI skill in minutes.

11,454

TrendRadar

Aggregates trending topics from over 35 platforms, offering intelligent filtering, automated multi-channel notifications, and AI-powered conversational analysis for deep news insights.

49,939

Playwriter

Enables AI agents to control web browsers via a Chrome extension, offering full Playwright API access with minimal context overhead.

3,276

DDGS

Aggregates search results from various web search services through a unified metasearch library.

2,370

Slackdump

Securely save or export your private and public Slack messages, threads, files, and user data locally without requiring administrator privileges.

2,498

20 results loaded • More available

Scroll for more results...