Web Scraping & Data Collection MCP 服务器
发现我们为 web scraping & data collection 精心策划的 MCP 服务器集合。浏览 1226 个服务器,找到满足您需求的完美 MCP。
Gres API
Provides a minimalist AI command server for agents and developers to snap pages, grab sites, source docs, and ask questions via a single interface.
YTComment
Enables AI systems to download and analyze YouTube video comments without requiring API keys.
AI Use Cases
Collects, analyzes, and manages AI use case data from diverse information sources.
Tavily Web Search
Enables AI models to search the web and retrieve up-to-date information using the Tavily API.
Paperscraper
Scrape publication metadata from PubMed, arXiv, medRxiv, bioRxiv, and ChemRxiv.
Analysis Alpaca
Provides a production-ready Model Context Protocol (MCP) server for comprehensive research and analysis, integrating web and academic search with intelligent content extraction and an optional interactive web interface.
ITIS
Accesses the Integrated Taxonomic Information System (ITIS) database to retrieve taxonomic data via its SOLR API.
Call For Papers
Searches and retrieves detailed information about academic conferences and events from WikiCFP.
Sonata
Orchestrates browser automation for government services, transforming bureaucratic web interfaces into programmable APIs for LLM agents.
Weather
Provides weather information, including alerts and forecasts, using the National Weather Service API.
Live Weather Alerts
Provides real-time weather alerts and detailed forecasts from the US National Weather Service via a Model Context Protocol (MCP) server.
Master Puppeteer
Orchestrates advanced browser automation tasks using Puppeteer, providing token-efficient and comprehensive web data for AI agents.
Google Search
Provides web search capabilities using Google Custom Search API and extracts content from webpages for AI model integration.
Google Search & Webpage Reader
Provides web search capabilities via Google Custom Search API and extracts content from any webpage.
MediaWiki Syntax
Provides comprehensive and up-to-date MediaWiki markup syntax documentation by dynamically fetching and consolidating information from official MediaWiki help pages for consumption by large language models.
SearxNG Search
Enables web searches using a SearxNG instance via the Model Context Protocol.
Tavily
Enables real-time web search, intelligent data extraction, structured website mapping, and systematic web crawling through Tavily's suite of tools, supporting both SSE and STDIO transport protocols.
XPath
Evaluates XPath queries on XML and HTML content, both from strings and URLs.
JNews
Provides access to current trending news headlines and detailed content for use by Large Language Models.
CleanWeb
Extracts and cleans core web content, filtering ads and converting it into a pristine Markdown format.
Scroll for more results...