Web Scraping & Data Collection MCP 서버
web scraping & data collection를 위한 엄선된 MCP 서버 컬렉션을 찾아보세요. 1322개의 서버를 탐색하고 필요에 맞는 완벽한 MCP를 찾아보세요.
Browser
Automates web browsing, content extraction, and interactive operations through a Puppeteer-powered server.
OLEXI
Empowers AI chat agents to accurately search and cite Australian legal information from the AustLII database.
Link
Intelligently fetches structured web documentation and manages conversation memory within the Cursor IDE.
LinkedIn Profile Scraper
Scrapes LinkedIn profile data asynchronously using the RapidAPI LinkedIn Profile Scraper API.
Query Table
Scrapes tabular data from websites like Eastmoney, Iwencai, and TDX using Playwright.
FreeCrawl
Provides a self-hosted Model Context Protocol server for JavaScript-enabled web scraping and versatile document processing.
Remote Papers Research
Integrates with arXiv to search academic papers, extract detailed information, organize resources by topic, and generate AI-ready research prompts.
Hacker News
Provides AI agents with access to Hacker News data via the Model Context Protocol.
Technopark Job Search
Enables AI assistants to search and retrieve job listings from the Technopark job portal.
Wikipedia
Provides Wikipedia search and content retrieval tools via a production-ready Model Context Protocol (MCP) server.
Bibextract
Extracts survey content and bibliography in BibTeX format directly from arXiv papers.
Chrome Browser Assistant
Transform your Chrome browser into an AI-controlled automation tool for content analysis, semantic search, and complex web interactions.
Job URL Analyzer
Analyzes job URLs to extract detailed company information, enriching data through intelligent web crawling and external providers.
Video Downloader
Empower intelligent agents with secure video downloading capabilities from over 1000 diverse websites.
Google Search
Proxies requests to the Google Programmable Search Engine, providing caching, rate-limiting, and metrics.
Browserbase
Integrates Browserbase's headless browser capabilities with large language models (LLMs) via the Model Context Protocol (MCP).
Selenium Python
Enables large language models or external tools to interact with a browser through a standardized Model Context Protocol for automation.
SmartCity
Provides real-time data about traffic, bike-sharing, air quality, and weather in Valencia, Spain.
Georgia Tech
Provides large language models with comprehensive access to Georgia Tech's academic, research, and campus data.
Apify
Enables AI applications and agents to access and utilize Apify Actors as external tools for data extraction, web searching, and various automation tasks.
Scroll for more results...