014 GitHub stars
02Query selectors directly against rendered DOM snapshots and compare multiple selectors.
03Fetch pages via standard HTTP client or CDP renderer for JavaScript-heavy content.
04Inspect web pages with summaries, parsed DOM trees, prettified HTML, and link extraction.
05Cache HTML documents locally with configurable limits and reuse via document handles.
06Generate reusable `silkworm` spider templates from blueprints and validate them statically.