010 GitHub stars
02Clean Markdown and text output optimized for LLM consumption
03Query-based content reranking to focus on specific page sections
04Configurable timeouts and optional image metadata inclusion
05Advanced extraction depth for JavaScript-heavy and dynamic pages
06Batch processing support for up to 20 URLs in a single request