Web Scraping & Data Collection MCPサーバー

web scraping & data collection向けの厳選されたMCPサーバーコレクションをご覧ください。744個のサーバーを閲覧し、ニーズに最適なMCPを見つけましょう。

Perplexica icon

Perplexica

22,134

Provides an open-source AI-powered search engine alternative to Perplexity AI.

GPT Researcher icon

GPT Researcher

21,635

Conducts in-depth web and local research on any topic, generating comprehensive reports with citations.

Skyvern icon

Skyvern

13,472

Automates browser-based workflows using LLMs and computer vision for robust and adaptable web interactions.

PyMuPDF icon

PyMuPDF

7,177

Enables data extraction, analysis, conversion, and manipulation of PDF, XPS, and eBook documents in Python.

Steel Browser icon

Steel Browser

4,345

Automates web interactions for AI agents and applications without managing infrastructure.

Trafilatura icon

Trafilatura

4,322

Extracts text and metadata from web pages and online resources, offering various output formats.

YouTube Transcript icon

YouTube Transcript

3,972

Retrieves transcripts and subtitles from YouTube videos, including automatically generated ones, without requiring an API key or headless browser.

ENScan Go icon

ENScan Go

3,649

Collects and aggregates domestic enterprise information from various sources to aid in reconnaissance tasks.

Firecrawl icon

Firecrawl

3,264

Empowers LLMs with advanced web scraping capabilities for content extraction, crawling, and search functionalities.

Browserbase icon

Browserbase

1,790

Enables LLMs to control cloud browsers for web interaction, data extraction, and task automation using Browserbase and Stagehand.

Browser icon

Browser

1,772

Enables AI applications to control a user's existing browser instance.

Agent Twitter Client icon

Agent Twitter Client

1,678

Automates interactions with Twitter, including scraping data, sending tweets, and engaging with Grok AI, all without needing the official Twitter API.

DevDocs icon

DevDocs

1,584

Crawls, extracts, and organizes technical documentation into an LLM-ready format, streamlining research and implementation for developers.

Zenfeed icon

Zenfeed

949

Empowers RSS with AI to automatically filter, summarize, and deliver important information, reducing information overload.

Mobile Next icon

Mobile Next

802

Enables scalable mobile automation through a platform-agnostic interface for interacting with native iOS/Android applications and devices.

Notte icon

Notte

741

Enables the development, deployment, and scaling of web browsing agents through a single API.

Fetcher icon

Fetcher

705

Fetches web page content using a Playwright headless browser, enabling JavaScript execution and intelligent content extraction.

RedNote icon

RedNote

529

Access content from the RedNote (XiaoHongShu) platform via an MCP server.

Crawl4AI RAG icon

Crawl4AI RAG

451

Empowers AI agents and coding assistants with web crawling and retrieval-augmented generation (RAG) capabilities.

Deepwiki icon

Deepwiki

430

Fetches and converts Deepwiki content to Markdown for use in code editors and other MCP-compatible clients.

Showing 20 of 747 results

Scroll for more results...