Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 3149 servers and find the perfect MCPs for your needs.

Provides access to Reddit's public API, allowing LLMs to interact with and retrieve content from Reddit.

174

ReAct Web Search

Integrates web search capabilities into AI assistant frameworks using the Exa API for real-time, markdown-formatted results.

143

TikTok

Integrates TikTok access into applications via TikNeuron.

151

Crawl4AI

Empowers local developers with efficient internet search and content retrieval for Large Language Models, saving tokens.

140

Website Downloader

Downloads entire websites, preserving structure and converting links for local use.

150

Search1API

Provides search and crawl functionality using Search1API through a Model Context Protocol (MCP) server.

172

AgentQL

Integrates AgentQL's data extraction capabilities as a Model Context Protocol server.

162

Serper Search and Scrape

Enables web search and webpage scraping capabilities using the Serper API within MCP-compatible environments.

153

Fetch

Fetches URLs and YouTube video transcripts through an MCP server.

156

Scrapeless

Enables AI assistants to perform web searches and retrieve data from Google.

161

Web3 Research

Conduct in-depth cryptocurrency research locally using multiple data sources.

148

AkShare One

Provides an interface for retrieving China stock market data based on akshare-one.

158

Datagov Israel

Enables interaction with the Israeli Government Public API (data.gov.il).

143

Bocha Search

Empowers AI applications with high-quality world knowledge from billions of web pages and diverse content sources.

158

DataForSEO

Enables Claude to interact with DataForSEO APIs and obtain SEO data through a standardized Model Context Protocol interface.

184

Pubmearch

Analyzes PubMed medical literature to provide researchers with insights into medical research trends.

149

Read Website Fast

Extracts web content quickly and converts it to clean, token-efficient Markdown for AI agents and LLM pipelines.

145

Content Core

Extracts, cleans, and summarizes content from various media sources using AI-powered processing.

148

Comet

Connects Claude Code to Perplexity Comet browser, enabling agentic web browsing, deep research, and real-time task monitoring.

152

OnionClaw

Empower AI agents with complete access to the Tor network and dark web data through a zero-configuration OpenClaw skill or standalone application.

158

20 results loaded • More available

Scroll for more results...