Web Scraping & Data Collection MCP 服务器
发现我们为 web scraping & data collection 精心策划的 MCP 服务器集合。浏览 757 个服务器,找到满足您需求的完美 MCP。
Data Analytics Server
Provides a comprehensive data analytics environment, accessible through a web interface, for processing, analyzing, and visualizing data.
Webscan
Analyzes web content, extracts information, and identifies potential issues.
Handaas Enterprise Data
Provides comprehensive enterprise information query and analysis capabilities based on the MCP protocol.
Semantic Scholar
Provides comprehensive access to academic paper data, author information, and citation networks via the Semantic Scholar API.
Screenshot Server
Captures screenshots of web pages and local HTML files through a simple interface.
Perplexity
Enables AI assistants to access and utilize the Perplexity API for search and information retrieval.
XPath
Executes XPath queries on XML and HTML content, including fetching content from URLs.
Barnsworthburning
Searches the barnsworthburning.net website via the Model Context Protocol.
Cloudbrowser
Enables browser automation and data extraction within Claude Desktop using Browserbase API.
Scrapy
Scrapes websites, from basic HTTP requests to dynamic sites requiring JavaScript execution, using TypeScript.
Pagespeed
Analyze webpage performance using Google PageSpeed Insights through a Model Context Protocol (MCP) server.
Google Search
Scrapes Google Search results and provides structured data, including titles, URLs, and snippets.
Model Context
Enables querying real-time weather information and accessing internet usage data using the Model Context Protocol.
Tavily Extract
Extracts web page content using the Tavily API.
Tavily
Enables AI systems to access and interact with real-time web information through search, extraction, mapping, and crawling tools.
Reputation Checker
Validates URLs and checks their reputation to help identify AI hallucinations and verify web page authenticity.
Dominican Congress
Provides access to information about the Dominican Congress, including legislative agendas, legislator activity, and new bills.
Agents
Automate browser interactions using natural language commands, powered by AI.
Access
Extends Model Context Protocol servers with the ability to extract text from web pages and PDFs, and execute predefined commands.
Tripmatch
Query flight and train information using the Variflight API.
Scroll for more results...