Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.
Fetches and parses RSS/Atom feeds to automate news gathering and content monitoring directly within Claude Code.
Extracts transcripts from social media videos and scrapes websites into LLM-ready markdown format.
Bypasses anti-bot protections and extracts structured data from complex websites using high-performance Chrome TLS fingerprinting and JS rendering.
Extracts structured data from major social media platforms and websites using the Bright Data Web Scraper API.
Automates web scraping, site crawling, and structured data extraction from any URL using the Firecrawl API.
Accesses real-time search engine results from Google, Bing, and YouTube directly within Claude Code using structured JSON.
Downloads videos and audio from YouTube and other streaming platforms with customizable quality and format options.
Automates the extraction of Modao prototype pages, screenshots, and annotations into organized Markdown documentation.
Automates the classification and extraction of data from financial documents while ensuring data integrity through rigorous safety and verification protocols.
Extracts structured data from financial documents using OCR and text extraction while enforcing rigorous data safety and verification protocols.
Extracts typed commands and sequential text inputs from screen recordings and terminal sessions using optimized OCR workflows.
Extracts text commands, terminal inputs, and gameplay moves from screen recordings using optimized OCR and image preprocessing techniques.
Extracts and implements code or algorithms from images by utilizing OCR tools, image preprocessing, and systematic verification strategies.
Accesses and queries the ClinicalTrials.gov API v2 to retrieve detailed medical study data, recruitment status, and eligibility criteria for clinical research.
Facilitates direct access to PubMed literature and the NCBI E-utilities API for advanced biomedical research and data extraction.
Accesses official USPTO APIs to perform comprehensive patent and trademark searches, intellectual property analysis, and prosecution history tracking.
Extracts clean, readable text from web URLs by removing advertisements, navigation menus, and distractions.
Searches and retrieves life sciences preprints from the bioRxiv server using keywords, authors, date ranges, and categories.
Scrapes and analyzes iOS and macOS App Store data using iTunes APIs to retrieve structured JSON for apps, reviews, and ratings.
Downloads YouTube videos and extracts MP3 audio using optimized quality presets for social sharing and local storage.
Automates scraping, markdown archival, and GitHub Issue creation for AI research conversations and web content.
Conducts fast, cost-effective web research and information lookups with automated inline citations and structured data output.
Provides a unified interface for web scraping and content extraction across Playwright, Hyperbrowser, and Antigravity native providers.
Extracts and organizes content from Threads and Instagram into structured Markdown for knowledge management tools like Obsidian and Notion.
Enriches lists and CSV files with web-sourced data like contact info, company funding, and executive details.
Conducts exhaustive, multi-step web investigations and generates comprehensive reports for complex research topics.
Extracts high-fidelity, verbatim content from URLs, PDFs, and JavaScript-heavy sites using a token-efficient forked context.
Enables real-time web searching and high-quality programming context retrieval using the Exa neural search engine.
Conducts deep-dive competitive research across web, social media, and professional networks to generate actionable market intelligence.
Conducts deep market analysis and competitive intelligence by aggregating data from Y Combinator, SEC filings, and social media platforms.
Scroll for more results...