01Supports multiple authentication scenarios (forms, API keys, OAuth, 2FA)
02Extracts diverse content types including HTML, PDFs, tables, and JSON
03Adheres to an ethical framework for responsible content access
04Plugin-based architecture for custom content sources
05Offers CLI, REST API, and MCP server interfaces for flexible use
060 GitHub stars