Descubre Habilidades de Claude para web scraping & data collection. Explora 17 habilidades y encuentra las capacidades perfectas para tus flujos de trabajo de IA.
Extracts typed commands and sequential text inputs from screen recordings and terminal sessions using optimized OCR workflows.
Automates the classification and extraction of data from financial documents while ensuring data integrity through rigorous safety and verification protocols.
Facilitates direct access to PubMed literature and the NCBI E-utilities API for advanced biomedical research and data extraction.
Scrapes and analyzes iOS and macOS App Store data using iTunes APIs to retrieve structured JSON for apps, reviews, and ratings.
Extracts and organizes content from Threads and Instagram into structured Markdown for knowledge management tools like Obsidian and Notion.
Provides a unified interface for web scraping and content extraction across Playwright, Hyperbrowser, and Antigravity native providers.
Enables real-time web searching and high-quality programming context retrieval using the Exa neural search engine.
Downloads YouTube videos and extracts audio with optimized quality presets for easy sharing and local playback.
Executes autonomous search missions across local codebases and the web to return structured, attributed data.
Manages and troubleshoots self-hosted Firecrawl instances for high-performance web-to-markdown scraping.
Ensures uninterrupted research capabilities by delegating web searches to autonomous agents when primary search APIs fail or hit limits.
Searches global patent databases using natural language queries to discover prior art and track innovation landscapes.
Orchestrates parallel investigator agents to conduct multi-wave web research and synthesize high-confidence findings.
Streamlines web scraping, data collection, and Actor development using the official Apify JavaScript SDK and platform documentation.
Conducts comprehensive multi-source research, deep content extraction, and intelligent analysis using parallel agents and specialized patterns.
Extracts and organizes technical trading documentation and articles from mql5.com for research and training data collection.
Enhances Claude with real-time web search capabilities using Perplexity models to access current information and scientific citations.
Conducts systematic web searches with source evaluation and hypothesis tracking to synthesize high-quality external information.
Facilitates comprehensive PDF manipulation, including data extraction, document generation, and form handling using Python and CLI tools.
Automates the extraction, structuring, and organization of unstructured data into AI-ready formats from web and local sources.
Extracts and transforms unstructured information from various sources into structured, AI-interpretable formats.
Integrates Anthropic's Claude Agent SDK with You.com's HTTP MCP server to provide real-time web search and content extraction capabilities.
Extracts clean, readable content from web articles and blog posts directly into Markdown format by removing clutter and ads.
Extracts clean web content, crawls entire domains, and searches the web directly through the Firecrawl API using terminal commands.
Optimizes online research by applying structured query patterns and advanced search techniques for precise information retrieval.
Scrapes web content, maps site structures, and extracts structured data using advanced crawling and search capabilities.
Integrates the Perplexity API to conduct deep web research, capture real-time data, and generate structured reports with verifiable citations.
Performs real-time web and local searches using the Brave Search API directly via curl commands to retrieve current information and technical solutions.
Analyzes website structures and debugs web scraping issues using Chrome DevTools to improve data extraction accuracy.
Extracts clean, readable text from blog posts and articles by removing ads, navigation, and clutter.
Scroll for more results...