发现web scraping & data collection类别的 Claude 技能。浏览 17 个技能,找到适合您 AI 工作流程的完美功能。
Extracts clean, clutter-free Markdown from web pages to optimize token usage and context relevance.
Implements robust rate limiting, exponential backoff, and idempotency patterns for resilient FireCrawl API interactions.
Integrates Exa's neural search API to perform semantic web queries and retrieve contextually relevant data for RAG pipelines.
Monitors Twitter/X for real-time AI news, trending tools, and developer insights using the Bird CLI.
Downloads and converts YouTube videos into high-quality audio files using yt-dlp and ffmpeg.
Extracts clean, plain text from EPUB, MOBI, and PDF files for analysis and data processing.
Extracts, transforms, and structures data from complex Excel files into JSON or CSV formats.
Analyzes PDF documents to extract structured data, including tables, section headers, and metadata, while providing automated summaries.
Identifies and resolves common Firecrawl integration mistakes, anti-patterns, and resource management issues during code reviews.
Analyzes AI tool URLs to extract metadata and automatically categorizes and adds them to the awesome-ai-tools repository.
Automates the end-to-end lifecycle of discovering, validating, building, and publishing Model Context Protocol (MCP) servers and automation tools.
Automates information gathering from web searches and authoritative sources to generate and save structured research reports.
Conducts enterprise-grade company research, competitive analysis, and market intelligence using professional web scraping and search tools.
Installs and configures the YouTube Info MCP server to enable automated extraction of video metadata and details within Claude Code.
Automates the installation and configuration of the Pure.md MCP server for seamless web-to-markdown conversion within Claude Code.
Automates web scraping workflows to collect and analyze job postings from major Korean and international job boards.
Scrapes Australian creative writing competitions and automatically manages them as structured GitHub issues with intelligent duplicate detection.
Conducts deep, multi-level OSINT research on people, companies, domains, and concepts using automated investigative techniques.
Automates LinkedIn profile discovery and data extraction using optimized search queries and structured result parsing.
Downloads audio and video from thousands of websites with advanced control over formats, subtitles, and metadata.
Crawls entire websites and extracts clean, structured content into markdown files with AI-enriched metadata.
Automates the end-to-end processing and metadata curation of genome assembly datasets for VEuPathDB resources.
Automates LinkedIn job searching and generates ATS-optimized resumes with integrated skill gap analysis and interview preparation.
End of results