Web Scraping & Data Collection Agent Skills

Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.

TripIt Data Exporter

Exports comprehensive TripIt travel data including trips, flights, and lodging into a structured JSON format via browser automation.

Tavily Web Search & Research

Integrates high-performance web search, content extraction, and site crawling capabilities using the Tavily API for real-time information retrieval.

Job Description Parser

Extracts structured requirements and metadata from job descriptions to facilitate automated candidate matching and recruitment analysis.

Playwright Web Scraper

Extracts structured data from complex websites using a robust, three-phase Playwright automation workflow.

Tavily Web Search

Empowers Claude to perform advanced web searches, crawl websites, and extract high-quality content using the Tavily AI search engine.

Exa Semantic Search

Performs neural, semantic web searches and content discovery using the Exa API to find highly relevant data and research.

API Scraper Architect

Provides standardized architectural patterns and Pydantic models for building robust API documentation scrapers.

Tavily Web Search & Extraction

Integrates real-time web search and content extraction capabilities into Claude Code using the Tavily API.

Exa Semantic Search

Performs semantic web searches and similar content discovery using the Exa API to retrieve high-quality research data.

Deep Research & Analysis

Conducts exhaustive, multi-threaded investigations using parallel agent teams to produce high-confidence synthesized reports from 20+ sources.

Firecrawl Scraper

Extracts deep web content, parses PDFs, and captures screenshots using the Firecrawl API directly within Claude Code.

Firecrawl Web Scraper

Extracts structured data, captures screenshots, and parses PDFs from complex websites using the Firecrawl API.

Firecrawl Scraper

Automates deep web scraping, PDF parsing, and visual snapshots using the Firecrawl API.

Exa Semantic Search

Performs high-precision semantic searches and intent-based research using the Exa AI search engine.

Blog & RSS Monitor

Monitors and manages updates from blogs and RSS/Atom feeds directly through the CLI.

YouTube Transcript & Data API

Retrieves YouTube transcripts, searches channels, and extracts video metadata for content analysis and automation.

Valyu Search & Research Toolkit

Empowers Claude with multi-domain search, AI-driven answers, content extraction, and comprehensive deep research reporting.

Media Downloader

Downloads high-quality video and audio from over 1,000 platforms with automated metadata extraction and QR code support.

Doc Site Analysis

Analyzes API documentation structures to identify data extraction patterns for automated scraper generation.

Unified Web Search

Performs simultaneous multi-engine searches using Gemini, Codex, and Claude to provide a consolidated, high-precision information report.

Scraper Code Generator

Generates high-performance, robust Python code for scraping and parsing structured API documentation from HTML.

Smart Link Extractor

Extracts core resources from social media and technical blogs to generate structured Markdown archives with intelligent categorization.

bioRxiv Preprint Search

Accesses and retrieves research papers from the bioRxiv preprint server for literature reviews and trend analysis.

Firecrawl Scraper

Automates complex web data extraction, page crawling, and document parsing using the Firecrawl API.

Firecrawl Web Scraper

Extracts clean web content, captures screenshots, and parses PDFs using the powerful Firecrawl API.

Tavily Web Search

Empowers Claude with real-time web search, content extraction, and advanced crawling capabilities using the Tavily API.

Tavily Web Search

Empowers Claude with real-time web search, automated content extraction, and deep research capabilities using the Tavily API.

Tavily Web Search & Extraction

Provides real-time web search, content extraction, and automated site crawling capabilities powered by the Tavily API.

Obsidian Web Clipper Template Creator

Generates structured JSON templates for the Obsidian Web Clipper by analyzing web page metadata and verifying CSS selectors.

Video & Transcript Downloader

Downloads videos, audio, and clean paragraph-style transcripts from YouTube and other yt-dlp supported platforms.

30 results loaded • More available

Scroll for more results...