Web Scraping & Data Collection Agent Skills

Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.

Competition Scraper & GitHub Persister

Scrapes Australian creative writing competitions and automatically manages them as structured GitHub issues with intelligent duplicate detection.

Pure.md MCP Installer

Automates the installation and configuration of the Pure.md MCP server for seamless web-to-markdown conversion within Claude Code.

YouTube Info MCP Installer

Installs and configures the YouTube Info MCP server to enable automated extraction of video metadata and details within Claude Code.

Competitive Research with Bright Data

Conducts enterprise-grade company research, competitive analysis, and market intelligence using professional web scraping and search tools.

Research Reporter

Automates information gathering from web searches and authoritative sources to generate and save structured research reports.

MCP Opportunity Pipeline

Automates the end-to-end lifecycle of discovering, validating, building, and publishing Model Context Protocol (MCP) servers and automation tools.

Add Awesome Tool

Analyzes AI tool URLs to extract metadata and automatically categorizes and adds them to the awesome-ai-tools repository.

Web Connectivity and Search

Searches the internet and converts live webpage content into markdown for real-time information retrieval and analysis.

Google Places Integration

Queries the Google Places API to retrieve business locations, venue details, and reviews directly through the CLI.

Scanner PDF Analysis

Analyzes PDF documents to extract structured data, including tables, section headers, and metadata, while providing automated summaries.

Scanner Excel Extraction

Extracts, transforms, and structures data from complex Excel files into JSON or CSV formats.

Ebook Text Extractor

Extracts clean, plain text from EPUB, MOBI, and PDF files for analysis and data processing.

YouTube Video to Audio

Downloads and converts YouTube videos into high-quality audio files using yt-dlp and ffmpeg.

Exa Semantic Web Search

Empowers Claude with semantic, neural search capabilities and specialized web filtering using the Exa API.

Hot List Data Enrichment

Automates company data enrichment for investment dashboards by fetching employee counts, job postings, and news mentions.

Blockchain Data Pipeline Validation

Validates blockchain data collection pipelines using a systematic 5-step empirical workflow to ensure data integrity and storage efficiency.

Brave Search Integration

Performs headless web searches and extracts readable markdown content using the Brave Search API without requiring a browser.

Threads Scraper

Scrapes and extracts post data from Threads profiles using automated browser navigation and authentication.

Wget URL Reader

Fetches and downloads content from any URL using the powerful wget command-line utility.

Article Extractor

Extracts clean, readable text from web articles and blog posts by removing ads, navigation, and clutter.

Exa Semantic Search

Powers Claude Code with semantic search, similar content discovery, and structured research capabilities via the Exa API.

GitHub Harvester

Extracts and processes comprehensive data from GitHub repositories for ingestion into RAG pipelines and LLM knowledge bases.

Zhipu AI Search

Performs intelligent web searches via the Zhipu search engine with automated relative date resolution.

AI Tech Digest

Curates specialized AI technology news and technical insights using targeted search strategies and quality filtering rules.

Actress Classifier Development Guide

Streamlines the development of Python-based video classification systems with optimized scraping and incremental database management.

Web Crawler & Search Indexer

Crawls entire websites and builds searchable full-text indexes of content converted into Markdown format.

Iterative Fact Verification

Ensures rigorous factual accuracy through systematic, multi-pass evidence validation and source tiering.

Tavily Web Search

Conducts real-time, AI-optimized web searches and content extraction to provide up-to-date information beyond Claude's knowledge cutoff.

Document Processor

Processes, analyzes, and transforms various file formats into structured data or new document types using a standardized CLI.

Video Metadata Parser & Downloader

Automates video metadata extraction and media downloading by processing structured task lists through MCP services.

30 results loaded • More available

Scroll for more results...