Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.
Exports comprehensive TripIt travel data including trips, flights, and lodging into a structured JSON format via browser automation.
Integrates high-performance web search, content extraction, and site crawling capabilities using the Tavily API for real-time information retrieval.
Extracts structured requirements and metadata from job descriptions to facilitate automated candidate matching and recruitment analysis.
Extracts structured data from complex websites using a robust, three-phase Playwright automation workflow.
Empowers Claude to perform advanced web searches, crawl websites, and extract high-quality content using the Tavily AI search engine.
Performs neural, semantic web searches and content discovery using the Exa API to find highly relevant data and research.
Provides standardized architectural patterns and Pydantic models for building robust API documentation scrapers.
Integrates real-time web search and content extraction capabilities into Claude Code using the Tavily API.
Performs semantic web searches and similar content discovery using the Exa API to retrieve high-quality research data.
Conducts exhaustive, multi-threaded investigations using parallel agent teams to produce high-confidence synthesized reports from 20+ sources.
Extracts deep web content, parses PDFs, and captures screenshots using the Firecrawl API directly within Claude Code.
Extracts structured data, captures screenshots, and parses PDFs from complex websites using the Firecrawl API.
Automates deep web scraping, PDF parsing, and visual snapshots using the Firecrawl API.
Performs high-precision semantic searches and intent-based research using the Exa AI search engine.
Monitors and manages updates from blogs and RSS/Atom feeds directly through the CLI.
Retrieves YouTube transcripts, searches channels, and extracts video metadata for content analysis and automation.
Empowers Claude with multi-domain search, AI-driven answers, content extraction, and comprehensive deep research reporting.
Downloads high-quality video and audio from over 1,000 platforms with automated metadata extraction and QR code support.
Analyzes API documentation structures to identify data extraction patterns for automated scraper generation.
Performs simultaneous multi-engine searches using Gemini, Codex, and Claude to provide a consolidated, high-precision information report.
Generates high-performance, robust Python code for scraping and parsing structured API documentation from HTML.
Extracts core resources from social media and technical blogs to generate structured Markdown archives with intelligent categorization.
Accesses and retrieves research papers from the bioRxiv preprint server for literature reviews and trend analysis.
Automates complex web data extraction, page crawling, and document parsing using the Firecrawl API.
Extracts clean web content, captures screenshots, and parses PDFs using the powerful Firecrawl API.
Empowers Claude with real-time web search, content extraction, and advanced crawling capabilities using the Tavily API.
Empowers Claude with real-time web search, automated content extraction, and deep research capabilities using the Tavily API.
Provides real-time web search, content extraction, and automated site crawling capabilities powered by the Tavily API.
Generates structured JSON templates for the Obsidian Web Clipper by analyzing web page metadata and verifying CSS selectors.
Downloads videos, audio, and clean paragraph-style transcripts from YouTube and other yt-dlp supported platforms.
Scroll for more results...