Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1038servers and find the perfect MCPs for your needs.
PTT
Enables MCP clients to interact with and automate operations on PTT, a popular Taiwanese bulletin board system, via the Model Context Protocol.
Aoai Web Browsing
Automates web browser interactions via Playwright, driven by Azure OpenAI and the Model Context Protocol.
Dataset Viewer
Provides access to browse, filter, analyze, and download datasets hosted on the Hugging Face Hub.
Pdf Reader
Extracts text and images from PDF files, with OCR support for scanned documents.
Binance Cryptocurrency
Enables AI agents to access real-time Binance cryptocurrency market data.
OP.GG
Connects AI agents to OP.GG data via a Model Context Protocol implementation, enabling function calling for data retrieval.
Dappier
Connects LLMs and Agentic AI to real-time, rights-cleared, proprietary data from trusted sources.
Pdf Reader
Extracts text content from PDF files, supporting both local files and URLs.
Rod
Automates browser interactions and provides web interaction capabilities for applications using the Rod browser automation framework.
YouTube Transcript
Extracts transcripts from YouTube videos, enabling content analysis and processing.
Crawl4AI
Enables web scraping and crawling capabilities for Large Language Models.
Clarity Data Export
Fetches Microsoft Clarity analytics data through a Model Context Protocol (MCP) server.
TradingView Chart Image
Fetches TradingView chart images for specified tickers and intervals.
VibeCoding
Automates web search and report generation using a LangGraph-based multi-agent system.
Dumpling AI
Integrates with Dumpling AI to provide data scraping, content processing, knowledge management, AI agent, and code execution capabilities.
Fetch
Enables fetching web content and processing images for use with Claude Desktop or other Model Context Protocol (MCP) clients.
Newsnow
Aggregates trending news and hot topics from multiple platforms via the Newsnow API, leveraging the Model Context Protocol (MCP).
JigsawStack
Enables AI models to interact with JigsawStack models through a Model Context Protocol server.
DocSearch
Crawls websites, generates Markdown documentation, and makes that documentation searchable.
JSer.info
Provides an MCP Server for accessing and searching data related to the JSer.info JavaScript information website.
Scroll for more results...