Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 936servers and find the perfect MCPs for your needs.
Jina Free
Retrieves webpage content by leveraging Jina AI's reader service.
Browserai
Enables AI agents and applications to access and extract real-time web data.
YTComment
Enables AI systems to download and analyze YouTube video comments without requiring API keys.
Perplexity
Enables web searches using the Perplexity AI API.
Job URL Analyzer
Analyzes job URLs to extract detailed company information, enriching data through intelligent web crawling and external providers.
Tavily Search
Enables client-side internet search functionality via an MCP server.
Defuddle Fetch
Retrieves and cleans web content, optimizing it for consumption by Large Language Models.
Chromedp
Provides a Model Context Protocol server enabling AI assistants to interact with web pages, manage browser instances, and perform various web automation tasks.
Space Flight News
Fetches and searches for spaceflight-related news articles using the Space Flight News API.
LinkedIn Profile Scraper
Scrapes LinkedIn profile data asynchronously using the RapidAPI LinkedIn Profile Scraper API.
Server Fetch TypeScript
Fetches and converts web content, providing functionalities ranging from raw text extraction to rendered HTML retrieval.
Query Table
Scrapes tabular data from websites like Eastmoney, Iwencai, and TDX using Playwright.
Fetch Python
Extracts and transforms web content into various formats, including raw text, rendered HTML, and Markdown.
Bocha
Enables AI agents to perform web searches via the Bocha API using the Model Context Protocol.
RandomUser
Provides enhanced access to the randomuser.me API with custom formatting, password generation, and weighted nationality distribution.
Yanyue
Fetches cigarette data from Yanyue (Yanyue.cn) using an MCP server.
YTT
Retrieves YouTube video transcripts for integration with AI assistants and other applications.
EVE Online Market
Provides a Model Context Protocol server for accessing EVE Online market data via the ESI API.
SearxNG Search
Enables web searches using a SearxNG instance via the Model Context Protocol.
NIH RePORTER
Enables conversational searching of NIH-funded research projects and publications via the NIH RePORTER API.
Scroll for more results...