Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 547 servers and find the perfect MCPs for your needs.

Tavily Web Search icon

Tavily Web Search

Enables AI models to search the web and retrieve up-to-date information using the Tavily API.

Tavily Search icon

Tavily Search

Enables client-side internet search functionality via an MCP server.

Playwright Plus Python icon

Playwright Plus Python

Automates browser interactions and data collection using Playwright within an MCP environment.

Hackernew icon

Hackernew

Provides an AI-friendly server for interacting with Hacker News data.

Tavily Extract icon

Tavily Extract

Extracts web page content using the Tavily API.

Tavily icon

Tavily

Provides a remote SSE MCP server for interacting with the Tavily Search API.

Buckeye icon

Buckeye

Queries all public information across the osu.edu domain and its subdomains.

Screenshot Server icon

Screenshot Server

Captures screenshots of web pages and local HTML files through a simple interface.

Browser Recorder icon

Browser Recorder

Records browsing sessions using MCPML and Playwright.

Cybersecurity News icon

Cybersecurity News

Retrieves the latest cybersecurity news from various websites and integrates with Claude Desktop.

Exa icon

Exa

Enables AI assistants like Claude to perform web searches using the Exa AI Search API.

Gralio icon

Gralio

Access over 3 million SaaS reviews, pricing data, and more for over 30,000 software products.

Semantic Scholar icon

Semantic Scholar

Provides comprehensive access to academic paper data, author information, and citation networks via the Semantic Scholar API.

UseScraper icon

UseScraper

Extracts content from web pages using the UseScraper API.

Browser Agent icon

Browser Agent

Enables Claude Desktop to autonomously interact with web content through browser automation.

Brave Search icon

Brave Search

Integrates the Brave Search API to provide web and local search capabilities.

Tracxn icon

Tracxn

Enables AI models to access Tracxn's database of companies, investors, transactions, and market intelligence via a Model Control Protocol (MCP) server implementation.

Access icon

Access

Extends Model Context Protocol servers with the ability to extract text from web pages and PDFs, and execute predefined commands.

Server Practice 2 icon

Server Practice 2

Implements Model Context Protocol (MCP) servers for LinkedIn profile scraping and weather data retrieval.

CGV Cinema API icon

CGV Cinema API

Provides a Python client for interacting with the CGV Cinema mobile API to access movie listings, locations, schedules, and seat maps.

Scroll for more results...