发现web scraping & data collection类别的 Claude 技能。浏览 17 个技能,找到适合您 AI 工作流程的完美功能。
Conducts deep-dive technical research by verifying actual source content across GitHub repositories and web documentation.
Analyzes SEC filings, earnings calls, and market data to extract deep corporate insights and financial narratives.
Conducts deep investigative research and source verification for documentary-style creative projects and journalism.
Extracts YouTube video transcripts, metadata, and chapters into formatted Markdown files for knowledge management systems.
Automates the systematic search, retrieval, and organization of primary source documents from free public archives using browser automation.
Researches and extracts factual data from official US government agency statements, press releases, and litigation records.
Performs journalism-grade investigative research using primary source analysis, triple-source verification, and evidence-chain mapping.
Conducts deep biographical research to extract humanizing details, quotes, and life trajectories for documentary-style music production.
Executes comprehensive web searches using the Gemini command to gather real-time data and detailed information.
Empowers Claude with real-time web search capabilities using the Google Gemini CLI to access up-to-date information and documentation.
Performs intelligent web searches using a prioritized MCP strategy to find the most relevant documentation and live technical data.
Extracts clean, clutter-free Markdown from web pages to optimize AI context and reduce token usage.
Extracts Twitter posts and comments to organize viewpoints and generate professional narration scripts for content production.
Tracks and analyzes Washington State K-12 education legislation using direct committee-based discovery and automated SOAP API queries.
Extracts, downloads, and cleans YouTube video transcripts and captions for easy reading and analysis.
Extracts and structures school calendar dates from PDFs and websites to automate camp and childcare planning.
Downloads YouTube videos and audio with customizable quality and format settings directly through Claude Code.
Conducts deep, iterative web research to generate comprehensive reports with verified citations and source tracking.
Extracts specific data from JSON files efficiently to minimize token usage and improve processing speed.
Extracts and validates structured data from scientific literature collections to create analysis-ready datasets for systematic reviews and meta-analyses.
Orchestrates a multi-source image pipeline to download, validate, and normalize fighter photos from Wikimedia, Sherdog, and Bing.
Orchestrates the extraction, validation, and database loading of comprehensive fighter data from UFCStats.com using Scrapy spiders.
Converts batches of images and scanned documents into structured markdown files using local DeepSeek-OCR models via Ollama.
Retrieves and manages historical visual snapshots of websites using the Internet Archive's Wayback Machine.
Archives URLs to the Internet Archive's Wayback Machine for permanent digital preservation and snapshot tracking.
Deploys a local, privacy-respecting metasearch engine to aggregate web, package repository, and code results in structured JSON.
Retrieves the earliest archived snapshot of any URL from the Wayback Machine to identify a website's original version.
Retrieves comprehensive GitHub user and organization profile data including repository counts, follower statistics, and account metadata.
Retrieves and calculates the full historical archive span for any URL using the Wayback Machine.
Locates and retrieves the most recent archived version of any URL from the Internet Archive's Wayback Machine.
Scroll for more results...