Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1067servers and find the perfect MCPs for your needs.
FastDomainCheck
Checks the registration status of multiple domain names in bulk, providing availability and verification method.
Memory Store
Provides web search capabilities using Puppeteer, delivering structured JSON results.
Go Mdurl
Converts web content to Markdown using a simple MCP server.
Reputation Checker
Validates URLs and checks their reputation to help identify AI hallucinations and verify web page authenticity.
Browser Recorder
Records browsing sessions using MCPML and Playwright.
News Scraper
Programmatically fetches news headlines and article content from Khaleej Times.
Apify
Enables AI applications and agents to access and utilize Apify Actors as external tools for data extraction, web searching, and various automation tasks.
Access
Extends Model Context Protocol servers with the ability to extract text from web pages and PDFs, and execute predefined commands.
Websets
Manages AI-powered web search collections and data using Exa's Websets API.
Youtube Transcripts
Retrieves transcripts from YouTube videos.
Google Search
Integrates Google Search capabilities into a Model Context Protocol server for AI clients and applications.
Webtools
Provides web analysis tools, including HTML extraction, markdown conversion, screenshot capture, performance analysis, and Lighthouse audits.
Hubble
Facilitates data retrieval and analysis from Google Search and other online sources through API integration with Claude Desktop.
Food
Provides comprehensive search for food products, including pricing and nutritional data, for AI agents.
Web Search
Enables AI systems to access real-time web search capabilities.
Job Search Node
Scrapes LinkedIn job listings, performs AI-driven analysis against a candidate profile, persistently indexes relevant jobs, and offers an API for management and retrieval.
PostCrawl
Provides access to the PostCrawl API for searching and extracting content from social media platforms, particularly Reddit, optimized for AI assistants.
Model Context Protocol Servers
Provides a collection of specialized Model Context Protocol (MCP) servers for diverse use cases.
ArXiv Search
Provides search functionality for arXiv.org papers using the official arXiv API.
Youtube Transcript
Retrieves transcripts from YouTube videos, providing direct access to captions and subtitles.
Scroll for more results...