Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1145servers and find the perfect MCPs for your needs.
Jina Free
Retrieves webpage content by leveraging Jina AI's reader service.
Webscan
Analyzes web content, extracts information, and identifies potential issues.
SearXNG
Integrates the SearXNG API and Puppeteer to provide powerful web search and live web content processing capabilities for agentic systems.
Video Downloader
Empower intelligent agents with secure video downloading capabilities from over 1000 diverse websites.
NBA Player Stats
Provides comprehensive NBA player statistics from basketball-reference.com via a Model Context Protocol server.
Tavily Search
Enables client-side internet search functionality via an MCP server.
News Instagram
Automates the intelligent scraping, AI-processing, and publishing of Canadian news into engaging Instagram content.
Ticketmaster
Discovers events, venues, and attractions through the Ticketmaster Discovery API.
XPath
Evaluates XPath queries on XML and HTML content, both from strings and URLs.
CleanWeb
Extracts and cleans core web content, filtering ads and converting it into a pristine Markdown format.
MoEngage Documentation
Provides AI assistants with direct access to comprehensive MoEngage documentation for enhanced search and retrieval.
Browserless
Provides a comprehensive interface for Browserless.io browser automation capabilities through Model Context Protocol (MCP) tools.
LangExtract
Enables AI assistants to extract structured information from unstructured text using Google's langextract library through an optimized Model Context Protocol interface.
Social Analytics RapidAPI
Provides comprehensive social media analytics and scraping for LinkedIn, Facebook, Instagram, and web search via RapidAPI integrations.
Puppeteer
Automates browser interactions, enabling LLMs to interact with web pages, capture screenshots, and execute JavaScript.
Yahoo Finance
Provides comprehensive financial data, including real-time market information, historical prices, and economic indicators from Yahoo Finance.
Session Code
Integrates the Tavily API to provide web search capabilities through an MCP server.
Web Tools
Provides web search and intelligent research capabilities through Claude CLI integration for MCP clients.
Tavily
Enables AI systems to access and interact with real-time web information through search, extraction, mapping, and crawling tools.
Hubble
Facilitates data retrieval and analysis from Google Search and other online sources through API integration with Claude Desktop.
Scroll for more results...