Web Scraping & Data Collection MCPサーバー

web scraping & data collection向けの厳選されたMCPサーバーコレクションをご覧ください。990個のサーバーを閲覧し、ニーズに最適なMCPを見つけましょう。

GPT Researcher icon

GPT Researcher

22,352

Conducts in-depth web and local research on any topic, generating comprehensive reports with citations.

Skyvern icon

Skyvern

13,831

Automates browser-based workflows using LLMs and computer vision for robust and adaptable web interactions.

XHS Downloader icon

XHS Downloader

8,149

Extracts and downloads content from XiaoHongShu (RedNote), including user posts, collections, likes, albums, and search results, while removing watermarks.

YouTube Transcript icon

YouTube Transcript

5,759

Retrieves transcripts and subtitles from YouTube videos, including automatically generated ones, without requiring an API key or headless browser.

Steel Browser icon

Steel Browser

4,831

Automates web interactions for AI agents and applications without managing infrastructure.

Trafilatura icon

Trafilatura

4,487

Extracts text and metadata from web pages and online resources, offering various output formats.

Firecrawl icon

Firecrawl

3,820

Empowers LLMs with advanced web scraping capabilities for content extraction, crawling, and search functionalities.

ENScan Go icon

ENScan Go

3,768

Collects and aggregates domestic enterprise information from various sources to aid in reconnaissance tasks.

Chrome Browser icon

Chrome Browser

3,578

Exposes Chrome browser functionality to AI assistants for complex browser automation, content analysis, and semantic search.

Browser icon

Browser

3,119

Enables AI applications to control a user's existing browser instance.

Browserbase icon

Browserbase

2,244

Enables LLMs to control cloud browsers for web interaction, data extraction, and task automation using Browserbase and Stagehand.

Superglue icon

Superglue

1,848

Automates data integration via a stable, self-healing SDK, providing automated schema-drift detection, retries, and remappings to maintain continuous data flow without connector maintenance or rewrites.

DevDocs icon

DevDocs

1,734

Crawls, extracts, and organizes technical documentation into an LLM-ready format, streamlining research and implementation for developers.

Agent Twitter Client icon

Agent Twitter Client

1,689

Automates interactions with Twitter, including scraping data, sending tweets, and engaging with Grok AI, all without needing the official Twitter API.

Mobile Next icon

Mobile Next

1,389

Enables scalable mobile automation through a platform-agnostic interface for interacting with native iOS/Android applications and devices.

Zenfeed icon

Zenfeed

1,382

Empowers RSS with AI to automatically filter, summarize, and deliver important information, reducing information overload.

Crawl4AI RAG icon

Crawl4AI RAG

1,337

Empowers AI agents and coding assistants with web crawling and retrieval-augmented generation (RAG) capabilities.

OpenDia icon

OpenDia

1,198

Exposes comprehensive browser functions via the Model Context Protocol, enabling external applications and AI models to programmatically interact with a web browser.

Notte icon

Notte

1,100

Enables the development, deployment, and scaling of web browsing agents through a single API.

Bright Data icon

Bright Data

944

Empowers AI agents to access, discover, and extract real-time web data, bypassing restrictions and bot detection.

Showing 20 of 990 results

Scroll for more results...