Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 2985 servers and find the perfect MCPs for your needs.

Firecrawl

Empowers LLMs with advanced web scraping capabilities for content extraction, crawling, and search functionalities.

4,195

Browserbase

Enables LLMs to control cloud browsers for web interaction, data extraction, and task automation using Browserbase and Stagehand.

3,256

DevDocs

Crawls, extracts, and organizes technical documentation into an LLM-ready format, streamlining research and implementation for developers.

2,059

Mobile Next

Enables scalable mobile automation through a platform-agnostic interface for interacting with native iOS/Android applications and devices.

4,521

GPT Researcher

Conducts in-depth web and local research on any topic, generating comprehensive reports with citations.

26,452

Browser

Enables AI applications to control a user's existing browser instance.

6,325

YouTube Transcript

Retrieves transcripts and subtitles from YouTube videos, including automatically generated ones, without requiring an API key or headless browser.

7,325

Trafilatura

Extracts text and metadata from web pages and online resources, offering various output formats.

5,709

ENScan Go

Collects and aggregates domestic enterprise information from various sources to aid in reconnaissance tasks.

4,328

Crawl4AI RAG

Empowers AI agents and coding assistants with web crawling and retrieval-augmented generation (RAG) capabilities.

2,139

Steel Browser

Automates web interactions for AI agents and applications without managing infrastructure.

6,851

Bright Data

Empowers AI agents to access, discover, and extract real-time web data, bypassing restrictions and bot detection.

2,296

Chrome Browser

Exposes Chrome browser functionality to AI assistants for complex browser automation, content analysis, and semantic search.

11,212

XHS Downloader

Extracts and downloads content from XiaoHongShu (RedNote), including user posts, collections, likes, albums, and search results, while removing watermarks.

10,797

Firecrawl

Integrates powerful web scraping and content extraction capabilities into LLM clients like Cursor and Claude.

6,053

Skill Seeker

Transforms any documentation website into a production-ready Claude AI skill in minutes.

12,783

TrendRadar

Aggregates trending topics from over 35 platforms, offering intelligent filtering, automated multi-channel notifications, and AI-powered conversational analysis for deep news insights.

51,655

Playwriter

Enables AI agents to control web browsers via a Chrome extension, offering full Playwright API access with minimal context overhead.

3,368

DDGS

Aggregates search results from various web search services through a unified metasearch library.

2,450

Slackdump

Securely save or export your private and public Slack messages, threads, files, and user data locally without requiring administrator privileges.

2,537

20 results loaded • More available

Scroll for more results...