Descubre Habilidades de Claude para web scraping & data collection. Explora 17 habilidades y encuentra las capacidades perfectas para tus flujos de trabajo de IA.
Installs and configures the YouTube Info MCP server to enable automated extraction of video metadata and details within Claude Code.
Exports comprehensive TripIt travel data including trips, flights, and lodging into a structured JSON format via browser automation.
Automates searching, filtering, and downloading torrents from RuTracker using Playwright and aria2c.
Aggregates and preprocesses high-quality AI programming tips and best practices from major developer platforms.
Converts web URLs into clean, raw markdown to minimize context window usage and eliminate HTML noise.
Conducts systematic web research by planning subtasks, delegating to research agents, and synthesizing findings into comprehensive reports.
Downloads YouTube videos and audio with customizable quality and format settings directly through the terminal.
Orchestrates systematic web research through automated planning, parallel subagent delegation, and comprehensive report synthesis.
Converts interview recordings, academic PDFs, and various document formats into structured markdown for qualitative analysis.
Orchestrates the discovery, retrieval, and organization of academic literature to facilitate theoretical pattern extraction and qualitative research synthesis.
Performs concurrent, recursive web scraping to extract clean text content while maintaining URL-based directory structures and respecting domain boundaries.
Automates the installation and configuration of the Pure.md MCP server for seamless web-to-markdown conversion within Claude Code.
Converts any URL into clean, structured Markdown with YAML metadata for efficient content consumption and LLM processing.
Fetches web documentation, extracts specific topics using AI subagents, and generates structured markdown summaries.
Converts entire PDF documents into clean, structured Markdown while preserving formatting, tables, and images for seamless context loading.
Integrates Gemini AI models with real-time web content through Google Search grounding to provide verified information and automatic citations.
Extracts and correlates high-quality food images from delivery platforms and restaurant websites to populate digital catalogs and menus.
Configures the essential Brave Search MCP integration and environment settings required for automated market research workflows.
Scrapes web pages and WeChat articles to produce clean, noise-free Markdown content for processing, translation, or archival.
Conducts deep, multi-source web research and synthesis without cluttering the primary conversation context.
Enforces a rigorous, test-driven development workflow for building and maintaining web scrapers and data extraction agents.
Reads and processes RSS/Atom feeds from any blog URL with intelligent automatic feed discovery.
Simplifies the creation of standardized data collection forms in Excel for mobile survey platforms like ODK and KoBoToolbox.
Automates LinkedIn job searching and generates ATS-optimized resumes with integrated skill gap analysis and interview preparation.
Downloads videos and audio from YouTube and other platforms for offline viewing, archiving, and media editing.
Fetches and ranks WeChat articles based on research interests with seamless Obsidian integration.
Downloads high-quality videos and audio from YouTube and other platforms for offline access, archival, and content repurposing.
Scrapes Australian creative writing competitions and automatically manages them as structured GitHub issues with intelligent duplicate detection.
Automates video and audio downloads from YouTube and other platforms with customizable quality and format options.
Orchestrates parallel subagents to conduct systematic, well-documented web research and synthesize findings into comprehensive reports.
Scroll for more results...