0110 GitHub stars
02Automatic robots.txt compliance and rate limiting
03Multiple output formats including HTML, summaries, and screenshots
04Stealth proxy mode to bypass anti-bot protections
05Clean markdown conversion for LLM processing
06JavaScript rendering for dynamic content and SPAs