Scrapy icon

Scrapy

Scrapes websites, from basic HTTP requests to dynamic sites requiring JavaScript execution, using TypeScript.

About

Scrapy is a high-performance web scraping tool built with TypeScript, designed to handle everything from basic HTTP scraping to complex, dynamic websites that rely on JavaScript. It leverages Model Context Protocol (MCP) for enhanced functionality, providing features like simple scraping for static content, Puppeteer integration for dynamic content and browser automation, and batch scraping for processing multiple URLs simultaneously. With support for CSS selectors, proxy configuration, and customizable concurrency, Scrapy offers a robust solution for diverse web scraping needs.

Key Features

  • Handles both basic HTTP scraping and dynamic JavaScript-rendered content.
  • Offers batch scraping with customizable concurrency and retry mechanisms.
  • Includes Puppeteer integration for browser automation and screenshot capturing.
  • 0 GitHub stars
  • Supports CSS selectors for precise element extraction.
  • Provides proxy support for anonymous scraping.

Use Cases

  • Extracting data from static HTML websites using CSS selectors.
  • Scraping dynamic content from JavaScript-heavy Single Page Applications (SPAs).
  • Automating form submissions and interactions on websites.
Craft Better Prompts with AnyPrompt
Sponsored