Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.
Automates the gathering of AI industry trends, product launches, and developer insights from multiple high-signal web sources.
Extracts web page content and converts it into clean, readable Markdown for seamless AI analysis and data collection.
Extracts YouTube subtitles and generates formatted Chinese transcripts with optional translation and timestamp support.
Manages YouTube channel tracking by automating video content collection, transcript retrieval, and structured summary generation.
Extracts text and structural data from complex Microsoft Word documents containing nested tables, checkboxes, and multi-layered cell layouts.
Conducts deep-dive research into metal bands to identify far-right, NSBM, or fascist ties using a multi-phase investigative methodology.
Extracts clean source code from GitHub file URLs using the GitHub CLI to bypass web scraping noise and HTML clutter.
Automates the collection, filtering, and processing of Twitter search results into structured link lists and databases.
Empowers Claude with real-time internet research capabilities by integrating Gemini's Google Search tool directly into the terminal workflow.
Extracts content from password-protected websites and internal documentation sites by leveraging the Windows Edge browser via Chrome DevTools Protocol.
Extracts and organizes Snowflake documentation into structured Markdown format with intelligent caching and configurable spider depth.
Lists and manages configured event sources for Instagram accounts and web aggregators used in newsletter generation.
Optimizes B2B data enrichment through intelligent provider selection, waterfall logic, and credit-efficient routing.
Extracts data from JavaScript-heavy websites, authenticated pages, and complex documentation using advanced browser automation.
Powers Claude with real-time web searches, deep company research, and high-quality programming documentation retrieval using Exa AI.
Extracts high-quality, LLM-optimized web data and performs advanced crawling through a powerful CLI integration.
Extracts structured content from popular Chinese news platforms and converts it into JSON and Markdown formats.
Integrates real-time Hacker News data streams into AI agents for automated tech news monitoring and community trend analysis.
Monitors near-Earth asteroids and hazardous space objects using real-time NASA NeoWs data and integrated x402 payment processing.
Searches the web using Exa AI to provide real-time information retrieval and up-to-date data for AI coding workflows.
Extracts clean, markdown-formatted content and metadata from any URL using the Jina Reader API for LLM consumption.
Converts entire websites into LLM-ready markdown and structured data with advanced anti-bot bypass and JavaScript rendering.
Extracts and summarizes web content using quote-grounding and structured reporting to ensure high technical fidelity.
Downloads and converts public Google Docs, Sheets, and Slides into local formats for direct analysis and integration.
Identifies high-potential trending topics and data gaps on X and the web to surface monetization opportunities for AI agents.
Crawls and extracts website content into structured markdown files or context-optimized chunks for AI analysis.
Integrates the Tavily API to perform live web searches and structured data retrieval for RAG-augmented workflows.
Fetches and analyzes real-time stories, comments, and user data from Hacker News using the official API.
Automates web data collection and browser tasks using pre-built Actors for popular sites like Amazon, Google, and LinkedIn.
Integrates privacy-focused web, image, video, and news search capabilities directly into Claude Code via the Brave Search API.
Scroll for more results...