Web Scraping & Data Collection Servidores MCP
Descubre nuestra colección curada de servidores MCP para web scraping & data collection. Explora 990 servidores y encuentra los MCP perfectos para tus necesidades.
Webscan
Scans and analyzes web content, extracting information from web pages.
Puppeteer Real Browser
Provides AI assistants with detection-resistant browser automation capabilities using puppeteer-real-browser.
Read Website Fast
Extracts web content quickly and converts it to clean, token-efficient Markdown for AI agents and LLM pipelines.
Markdownify UTF-8
Converts various file types and web content into Markdown format with enhanced UTF-8 encoding support.
Internetdata
Enables TypeScript projects to retrieve and interact with internet data through a structured API.
Docy
Provides LLMs with access to documentation websites by scraping them with crawl4ai, enabling real-time documentation access for AI tools.
Opengov
Enables Claude Desktop to access and analyze open datasets from Socrata-powered government data portals.
Google Patents
Enables searching Google Patents information via the SerpApi Google Patents API.
Scrapi
Scrapes web pages using the ScrAPI service via a Model Context Protocol (MCP) server.
OpenRegister
Accesses the German company register via the OpenRegister API to search and retrieve company information.
Data Gouv
Enables interaction with the Data.gouv.fr API, specifically the API Recherche Entreprises, via HTTP+SSE transport.
Mult Fetch
Fetches web content in multiple formats, supporting browser and Node.js environments with intelligent proxy detection.
NetworksDB
Enables querying IP addresses, organizations, ASNs, and DNS records using natural language via Model Context Protocol integration for NetworksDB API.
MCP Servers
Centralizes configurations and scripts for Model Context Protocol servers to integrate external tools with language models.
DataForSEO
Connects to the DataForSEO API via Model Context Protocol to retrieve SEO and marketing data.
Akshare
Converts Akshare data interfaces into a standardized MCP tool format.
Inbound
Generates leads for inbound sales efforts by scraping and enriching data from multiple sources.
Cloudflare Browser Rendering
Extracts web content using Cloudflare Browser Rendering for use in LLM context.
Wikipedia Image Crawler
Searches and retrieves images from Wikipedia Commons, ensuring adherence to Creative Commons licenses.
Chrome Debug
Facilitates Chrome browser automation through its debugging protocol, maintaining persistent login sessions for various tasks.
Scroll for more results...