Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 1145servers and find the perfect MCPs for your needs.

Jina Free icon

Jina Free

Retrieves webpage content by leveraging Jina AI's reader service.

Webscan icon

Webscan

Analyzes web content, extracts information, and identifies potential issues.

SearXNG icon

SearXNG

Integrates the SearXNG API and Puppeteer to provide powerful web search and live web content processing capabilities for agentic systems.

Video Downloader icon

Video Downloader

Empower intelligent agents with secure video downloading capabilities from over 1000 diverse websites.

NBA Player Stats icon

NBA Player Stats

Provides comprehensive NBA player statistics from basketball-reference.com via a Model Context Protocol server.

Tavily Search icon

Tavily Search

Enables client-side internet search functionality via an MCP server.

News Instagram icon

News Instagram

Automates the intelligent scraping, AI-processing, and publishing of Canadian news into engaging Instagram content.

Ticketmaster icon

Ticketmaster

Discovers events, venues, and attractions through the Ticketmaster Discovery API.

XPath icon

XPath

Evaluates XPath queries on XML and HTML content, both from strings and URLs.

CleanWeb icon

CleanWeb

Extracts and cleans core web content, filtering ads and converting it into a pristine Markdown format.

MoEngage Documentation icon

MoEngage Documentation

Provides AI assistants with direct access to comprehensive MoEngage documentation for enhanced search and retrieval.

Browserless icon

Browserless

Provides a comprehensive interface for Browserless.io browser automation capabilities through Model Context Protocol (MCP) tools.

LangExtract icon

LangExtract

Enables AI assistants to extract structured information from unstructured text using Google's langextract library through an optimized Model Context Protocol interface.

Social Analytics RapidAPI icon

Social Analytics RapidAPI

Provides comprehensive social media analytics and scraping for LinkedIn, Facebook, Instagram, and web search via RapidAPI integrations.

Puppeteer icon

Puppeteer

Automates browser interactions, enabling LLMs to interact with web pages, capture screenshots, and execute JavaScript.

Yahoo Finance icon

Yahoo Finance

Provides comprehensive financial data, including real-time market information, historical prices, and economic indicators from Yahoo Finance.

Session Code icon

Session Code

Integrates the Tavily API to provide web search capabilities through an MCP server.

Web Tools icon

Web Tools

Provides web search and intelligent research capabilities through Claude CLI integration for MCP clients.

Tavily icon

Tavily

Enables AI systems to access and interact with real-time web information through search, extraction, mapping, and crawling tools.

Hubble icon

Hubble

Facilitates data retrieval and analysis from Google Search and other online sources through API integration with Claude Desktop.

Showing 20 of 1145 results

Scroll for more results...