Web Scraping & Data Collection Agent Skills

Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.

Person Intelligence Analyzer

Performs deep multi-platform intelligence gathering across LinkedIn, X, Reddit, and GitHub to create actionable networking and sales reports.

Universal Deep Search

Conducts comprehensive cross-platform intelligence gathering with automated cascading research for people, companies, and topics.

Meta Searching

Overcomes access restrictions, rate limits, and validation errors to perform reliable web searches and content extraction when standard tools fail.

Search Plus

Overcomes web access restrictions and rate limits by performing federated searches and intelligent content extraction from blocked or challenging URLs.

AI-Powered PDF Text Extractor

Converts PDF documents into LLM-friendly Markdown while preserving complex structures like tables, headers, and lists.

Crawl4AI

Scrapes websites, extracts structured data, and automates web data collection pipelines using the Crawl4AI library.

AI Web Summarizer & Content Extractor

Summarizes web pages and documents into structured, type-specific Markdown reports using adaptive AI detection.

Extracting Form Fields

Extracts and structures metadata from PDF form fields into JSON format to facilitate automated document processing and form filling.

DocAI Web to Markdown

Converts any web URL into clean, structured Markdown for seamless content extraction and documentation.

Multi-Source Investigation

Conducts systematic, high-integrity research across diverse information sources with rigorous cross-validation and credibility scoring.

Primary Source Researcher

Identifies and captures a subject's authentic voice from social media, blogs, and archives for documentary music projects.

Legal Research Specialist

Extracts narrative-rich facts, quotes, and timelines from court documents and indictments for documentary and creative projects.

Video Downloader

Downloads videos and playlists from YouTube and other platforms in various resolutions and formats for offline viewing and archival.

Investigative Journalism Researcher

Conducts deep investigative research and source verification for documentary-style creative projects and journalism.

Biographical Researcher for Music

Conducts deep biographical research to extract humanizing details, quotes, and life trajectories for documentary-style music production.

Financial Research Specialist

Analyzes SEC filings, earnings calls, and market data to extract deep corporate insights and financial narratives.

Investigative Researcher

Performs journalism-grade investigative research using primary source analysis, triple-source verification, and evidence-chain mapping.

Verified Research

Conducts deep-dive technical research by verifying actual source content across GitHub repositories and web documentation.

Automated Document Hunter

Automates the systematic search, retrieval, and organization of primary source documents from free public archives using browser automation.

Official Government Source Researcher

Researches and extracts factual data from official US government agency statements, press releases, and litigation records.

Gemini Web Search

Empowers Claude with real-time web search capabilities using the Google Gemini CLI to access up-to-date information and documentation.

Firecrawl Web Scraping & Automation

Automates web scraping, searching, and browser-based data extraction to provide clean, LLM-optimized markdown.

X Content Extraction and Script Generation

Extracts Twitter posts and comments to organize viewpoints and generate professional narration scripts for content production.

YouTube Transcript

Extracts YouTube video transcripts, metadata, and chapters into formatted Markdown files for knowledge management systems.

YouTube Transcript Downloader

Extracts, downloads, and cleans YouTube video transcripts and captions for easy reading and analysis.

Advanced Gemini Web Search

Executes comprehensive web searches using the Gemini command to gather real-time data and detailed information.

School Calendar Data Extractor

Extracts and structures school calendar dates from PDFs and websites to automate camp and childcare planning.

Washington Legislative Tracker

Tracks and analyzes Washington State K-12 education legislation using direct committee-based discovery and automated SOAP API queries.

Web Search Optimizer

Performs intelligent web searches using a prioritized MCP strategy to find the most relevant documentation and live technical data.

YouTube Video Downloader

Downloads YouTube videos and audio with customizable quality and format settings directly through Claude Code.

30 results loaded • More available

Scroll for more results...