Descubre Habilidades de Claude para web scraping & data collection. Explora 17 habilidades y encuentra las capacidades perfectas para tus flujos de trabajo de IA.
Converts any website into clean, LLM-ready markdown or structured data using the Firecrawl API.
Extracts clean markdown and structured data from any website, including JavaScript-heavy single-page applications.
Automates the process of crawling, extracting, and converting documentation websites into structured Markdown or Claude Code skills.
Downloads videos, audio, and clean paragraph-style transcripts from YouTube and other yt-dlp supported platforms.
Retrieves real-time information, news, images, and videos from the web using DuckDuckGo to provide up-to-date data and resources.
Extracts and cleans transcripts from global video platforms like YouTube and Bilibili into readable text files.
Generates structured JSON templates for the Obsidian Web Clipper by analyzing web page metadata and verifying CSS selectors.
Downloads YouTube videos and audio with customizable quality and format settings using the powerful yt-dlp engine.
Extracts and cleans transcripts, subtitles, and captions from YouTube videos using yt-dlp and OpenAI Whisper for AI-powered transcription.
Provides real-time web search, content extraction, and automated site crawling capabilities powered by the Tavily API.
Empowers Claude with real-time web search, automated content extraction, and deep research capabilities using the Tavily API.
Empowers Claude with real-time web search, content extraction, and advanced crawling capabilities using the Tavily API.
Extracts clean web content, captures screenshots, and parses PDFs using the powerful Firecrawl API.
Executes multi-perspective web searches and generates structured research reports with source attribution and uncertainty analysis.
Automates complex web data extraction, page crawling, and document parsing using the Firecrawl API.
Extracts clean, readable text from web articles and blog posts by removing ads, navigation, and clutter.
Automates the extraction and parsing of monthly USDA WASDE reports into standardized datasets for agricultural market analysis.
Detects and analyzes Google Trends search spikes and all-time highs using automated browser simulation and statistical modeling.
Aggregates and summarizes real-time China macro-economic news from premium financial sources into professional magazine-style reports.
Automates searching, filtering, and downloading torrents from RuTracker using Playwright and aria2c.
Downloads high-quality video and audio from over 1,000 platforms with automated metadata extraction and QR code support.
Conducts automated community sentiment analysis and market research via Reddit to identify developer pain points and adoption trends.
Converts any URL into clean, structured Markdown with YAML metadata for efficient content consumption and LLM processing.
Monitors the CASS Freight Index to identify economic cycle turns and predict US inflation trends through logistics data analysis.
Analyzes global liquidity and risk-on sentiment by using the Rolex Market Index as a high-beta proxy for financial conditions.
Configures the essential Brave Search MCP integration and environment settings required for automated market research workflows.
Identifies and avoids common Firecrawl integration mistakes and anti-patterns to optimize web scraping efficiency.
Provides standardized architectural patterns and Pydantic models for building robust API documentation scrapers.
Conducts multi-page web research to synthesize information with full source attribution and conflict detection.
Fetches and ranks WeChat articles based on research interests with seamless Obsidian integration.
Scroll for more results...