Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 547 servers and find the perfect MCPs for your needs.

Searxng icon

Searxng

Runs a Searxng meta-search instance from a remote script.

BuiltWith icon

BuiltWith

Queries the BuiltWith API to retrieve information about website technology stacks.

Prysm icon

Prysm

Scrape web content with high accuracy and flexibility for AI assistants.

Playwright icon

Playwright

Automates web browser interactions for LLMs using structured accessibility data from Playwright.

OpenFEC icon

OpenFEC

Provides access to Federal Election Commission campaign finance data through the OpenFEC API using the Model Context Protocol.

ScreenshotOne icon

ScreenshotOne

Connect AI assistants to ScreenshotOne API to capture website screenshots.

ElfProxy icon

ElfProxy

Facilitates secure and scalable web data access for AI systems through dynamic IP rotation and AI-optimized web interaction.

Awesome icon

Awesome

Provides a collection of Model Context Protocol (MCP) tools for analyzing crypto market data and Vietnam stock market data.

YouTube icon

YouTube

Fetches and extracts transcripts from YouTube videos, enabling AI/LLMs to process video content.

Marginalia icon

Marginalia

Provides an MCP interface to search the Marginalia Search engine, which focuses on non-commercial content.

UseScraper icon

UseScraper

Scrapes content from webpages, extracting text, HTML, or markdown.

Substack Fetcher icon

Substack Fetcher

Fetches and reads articles from a specified Substack publication, formatting the content for use with AI assistants.

Highlight YouTube icon

Highlight YouTube

Extracts transcript text from YouTube videos using various URL formats.

Web Search icon

Web Search

Serves as an MCP server facilitating web search functionalities.

Steel Puppeteer icon

Steel Puppeteer

Automates browser interactions for LLMs using Puppeteer and the Model Context Protocol.

Youtube Transcript icon

Youtube Transcript

Retrieves transcripts from YouTube videos, providing direct access to captions and subtitles.

Playwright icon

Playwright

Provides Playwright web page content retrieval functionality using the Model Context Protocol (MCP).

XPath icon

XPath

Evaluates XPath queries on XML and HTML content, both from strings and URLs.

Vertex icon

Vertex

Jumpstarts deco.cx site development with a pre-configured template.

DuckDuckResearch icon

DuckDuckResearch

Enables web searching via DuckDuckGo, content extraction, and screenshot capture.

Scroll for more results...