Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 547 servers and find the perfect MCPs for your needs.
Searxng
Runs a Searxng meta-search instance from a remote script.
BuiltWith
Queries the BuiltWith API to retrieve information about website technology stacks.
Prysm
Scrape web content with high accuracy and flexibility for AI assistants.
Playwright
Automates web browser interactions for LLMs using structured accessibility data from Playwright.
OpenFEC
Provides access to Federal Election Commission campaign finance data through the OpenFEC API using the Model Context Protocol.
ScreenshotOne
Connect AI assistants to ScreenshotOne API to capture website screenshots.
ElfProxy
Facilitates secure and scalable web data access for AI systems through dynamic IP rotation and AI-optimized web interaction.
Awesome
Provides a collection of Model Context Protocol (MCP) tools for analyzing crypto market data and Vietnam stock market data.
YouTube
Fetches and extracts transcripts from YouTube videos, enabling AI/LLMs to process video content.
Marginalia
Provides an MCP interface to search the Marginalia Search engine, which focuses on non-commercial content.
UseScraper
Scrapes content from webpages, extracting text, HTML, or markdown.
Substack Fetcher
Fetches and reads articles from a specified Substack publication, formatting the content for use with AI assistants.
Highlight YouTube
Extracts transcript text from YouTube videos using various URL formats.
Web Search
Serves as an MCP server facilitating web search functionalities.
Steel Puppeteer
Automates browser interactions for LLMs using Puppeteer and the Model Context Protocol.
Youtube Transcript
Retrieves transcripts from YouTube videos, providing direct access to captions and subtitles.
Playwright
Provides Playwright web page content retrieval functionality using the Model Context Protocol (MCP).
XPath
Evaluates XPath queries on XML and HTML content, both from strings and URLs.
Vertex
Jumpstarts deco.cx site development with a pre-configured template.
DuckDuckResearch
Enables web searching via DuckDuckGo, content extraction, and screenshot capture.
Scroll for more results...