Web Scraping & Data Collection Agent Skills

Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.

RSS Feed Fetcher

Fetches and parses RSS/Atom feeds to automate news gathering and content monitoring directly within Claude Code.

Supadata Video & Web Extraction

Extracts transcripts from social media videos and scrapes websites into LLM-ready markdown format.

ScrapeNinja Web Scraper

Bypasses anti-bot protections and extracts structured data from complex websites using high-performance Chrome TLS fingerprinting and JS rendering.

Bright Data Web Scraper

Extracts structured data from major social media platforms and websites using the Bright Data Web Scraper API.

Firecrawl Web Scraper

Automates web scraping, site crawling, and structured data extraction from any URL using the Firecrawl API.

SerpApi Search & Scraping

Accesses real-time search engine results from Google, Bing, and YouTube directly within Claude Code using structured JSON.

Video Downloader

Downloads videos and audio from YouTube and other streaming platforms with customizable quality and format options.

Modao Prototype Capture

Automates the extraction of Modao prototype pages, screenshots, and annotations into organized Markdown documentation.

Financial Document Processor

Automates the classification and extraction of data from financial documents while ensuring data integrity through rigorous safety and verification protocols.

Financial Document Processor

Extracts structured data from financial documents using OCR and text extraction while enforcing rigorous data safety and verification protocols.

Extract Moves from Video

Extracts typed commands and sequential text inputs from screen recordings and terminal sessions using optimized OCR workflows.

Extract Moves From Video

Extracts text commands, terminal inputs, and gameplay moves from screen recordings using optimized OCR and image preprocessing techniques.

Code From Image

Extracts and implements code or algorithms from images by utilizing OCR tools, image preprocessing, and systematic verification strategies.

ClinicalTrials.gov Database

Accesses and queries the ClinicalTrials.gov API v2 to retrieve detailed medical study data, recruitment status, and eligibility criteria for clinical research.

PubMed Database Connector

Facilitates direct access to PubMed literature and the NCBI E-utilities API for advanced biomedical research and data extraction.

USPTO Database Access

Accesses official USPTO APIs to perform comprehensive patent and trademark searches, intellectual property analysis, and prosecution history tracking.

Article Extractor

Extracts clean, readable text from web URLs by removing advertisements, navigation menus, and distractions.

BioRxiv Database

Searches and retrieves life sciences preprints from the bioRxiv server using keywords, authors, date ranges, and categories.

App Store Scraper

Scrapes and analyzes iOS and macOS App Store data using iTunes APIs to retrieve structured JSON for apps, reviews, and ratings.

YouTube Video Downloader

Downloads YouTube videos and extracts MP3 audio using optimized quality presets for social sharing and local storage.

Research Archival & GitHub Integration

Automates scraping, markdown archival, and GitHub Issue creation for AI research conversations and web content.

Parallel Web Search

Conducts fast, cost-effective web research and information lookups with automated inline citations and structured data output.

Unified Browser Abstraction

Provides a unified interface for web scraping and content extraction across Playwright, Hyperbrowser, and Antigravity native providers.

Parallel Data Enrichment

Enriches lists and CSV files with web-sourced data like contact info, company funding, and executive details.

Parallel Deep Research

Conducts exhaustive, multi-step web investigations and generates comprehensive reports for complex research topics.

Parallel Web Content Extraction

Extracts high-fidelity, verbatim content from URLs, PDFs, and JavaScript-heavy sites using a token-efficient forked context.

Exa Search CLI

Enables real-time web searching and high-quality programming context retrieval using the Exa neural search engine.

Competitor Intelligence & Analysis

Conducts deep-dive competitive research across web, social media, and professional networks to generate actionable market intelligence.

Anysite Market Research

Conducts deep market analysis and competitive intelligence by aggregating data from Y Combinator, SEC filings, and social media platforms.

30 results loaded • More available

Scroll for more results...

Web Scraping & Data Collection Agent Skills

RSS Feed Fetcher

Supadata Video & Web Extraction

ScrapeNinja Web Scraper

Bright Data Web Scraper

Firecrawl Web Scraper

SerpApi Search & Scraping

Video Downloader

Modao Prototype Capture

Financial Document Processor

Financial Document Processor

Extract Moves from Video

Extract Moves From Video

Code From Image

ClinicalTrials.gov Database

PubMed Database Connector

USPTO Database Access

Article Extractor

BioRxiv Database

App Store Scraper

YouTube Video Downloader

Research Archival & GitHub Integration

Parallel Web Search

Unified Browser Abstraction

Social Media Content Extractor

Parallel Data Enrichment

Parallel Deep Research

Parallel Web Content Extraction

Exa Search CLI

Competitor Intelligence & Analysis

Anysite Market Research

Web Scraping & Data Collection Agent Skills

RSS Feed Fetcher

Supadata Video & Web Extraction

ScrapeNinja Web Scraper

Bright Data Web Scraper

Firecrawl Web Scraper

SerpApi Search & Scraping

Video Downloader

Modao Prototype Capture

Financial Document Processor

Financial Document Processor

Extract Moves from Video

Extract Moves From Video

Code From Image

ClinicalTrials.gov Database

PubMed Database Connector

USPTO Database Access

Article Extractor

BioRxiv Database

App Store Scraper

YouTube Video Downloader

Research Archival & GitHub Integration

Parallel Web Search

Unified Browser Abstraction

Social Media Content Extractor

Parallel Data Enrichment

Parallel Deep Research

Parallel Web Content Extraction

Exa Search CLI

Competitor Intelligence & Analysis

Anysite Market Research