Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1067servers and find the perfect MCPs for your needs.
Master Puppeteer
Orchestrates advanced browser automation tasks using Puppeteer, providing token-efficient and comprehensive web data for AI agents.
Youtube Transcript
Retrieves transcripts from YouTube videos using the Model Context Protocol.
CleanWeb
Extracts and cleans web content, filtering ads and irrelevant elements, and converts it to a clean Markdown format for various applications.
Job Search Node
Scrapes LinkedIn job listings, performs AI-driven analysis against a candidate profile, persistently indexes relevant jobs, and offers an API for management and retrieval.
Query Table
Scrapes tabular data from websites like Eastmoney, Iwencai, and TDX using Playwright.
Marketaux
Integrates the Marketaux API to provide comprehensive news search capabilities based on various criteria.
PDFtotext
Extracts text from PDF documents using the robust `pdftotext` utility, designed for reliable integration with Model Context Protocol servers.
FoFa
Provides access to FoFa API functionality through the Model Context Protocol, allowing AI assistants to query information about internet-connected devices and services.
ZoomEye
Provides AI assistants with access to the ZoomEye v2 API for querying internet-wide host and web data.
End of results