Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.
Scrapes Australian creative writing competitions and automatically manages them as structured GitHub issues with intelligent duplicate detection.
Automates the installation and configuration of the Pure.md MCP server for seamless web-to-markdown conversion within Claude Code.
Installs and configures the YouTube Info MCP server to enable automated extraction of video metadata and details within Claude Code.
Conducts enterprise-grade company research, competitive analysis, and market intelligence using professional web scraping and search tools.
Automates information gathering from web searches and authoritative sources to generate and save structured research reports.
Automates the end-to-end lifecycle of discovering, validating, building, and publishing Model Context Protocol (MCP) servers and automation tools.
Analyzes AI tool URLs to extract metadata and automatically categorizes and adds them to the awesome-ai-tools repository.
Searches the internet and converts live webpage content into markdown for real-time information retrieval and analysis.
Queries the Google Places API to retrieve business locations, venue details, and reviews directly through the CLI.
Analyzes PDF documents to extract structured data, including tables, section headers, and metadata, while providing automated summaries.
Extracts, transforms, and structures data from complex Excel files into JSON or CSV formats.
Extracts clean, plain text from EPUB, MOBI, and PDF files for analysis and data processing.
Downloads and converts YouTube videos into high-quality audio files using yt-dlp and ffmpeg.
Empowers Claude with semantic, neural search capabilities and specialized web filtering using the Exa API.
Automates company data enrichment for investment dashboards by fetching employee counts, job postings, and news mentions.
Validates blockchain data collection pipelines using a systematic 5-step empirical workflow to ensure data integrity and storage efficiency.
Performs headless web searches and extracts readable markdown content using the Brave Search API without requiring a browser.
Scrapes and extracts post data from Threads profiles using automated browser navigation and authentication.
Fetches and downloads content from any URL using the powerful wget command-line utility.
Extracts clean, readable text from web articles and blog posts by removing ads, navigation, and clutter.
Powers Claude Code with semantic search, similar content discovery, and structured research capabilities via the Exa API.
Extracts and processes comprehensive data from GitHub repositories for ingestion into RAG pipelines and LLM knowledge bases.
Performs intelligent web searches via the Zhipu search engine with automated relative date resolution.
Curates specialized AI technology news and technical insights using targeted search strategies and quality filtering rules.
Streamlines the development of Python-based video classification systems with optimized scraping and incremental database management.
Crawls entire websites and builds searchable full-text indexes of content converted into Markdown format.
Ensures rigorous factual accuracy through systematic, multi-pass evidence validation and source tiering.
Conducts real-time, AI-optimized web searches and content extraction to provide up-to-date information beyond Claude's knowledge cutoff.
Processes, analyzes, and transforms various file formats into structured data or new document types using a standardized CLI.
Automates video metadata extraction and media downloading by processing structured task lists through MCP services.
Scroll for more results...