Crawlforge FAQs

Question 1

What is Crawlforge?

Accepted Answer

Crawlforge is an open-source technical SEO spider designed to perform comprehensive audits of websites. It's built LLM-native, storing crawl data in a queryable local database (DuckDB + Parquet), and offers an extensible 'rules as code' architecture.

Question 2

Is Crawlforge a good alternative to Screaming Frog SEO Spider?

Accepted Answer

Yes, Crawlforge is positioned as an open-source alternative to Screaming Frog. It offers similar comprehensive technical SEO auditing capabilities but differentiates itself with LLM-native integration, local SQL-queryable data storage (DuckDB), and a highly extensible 'rules as code' architecture.

Question 3

How does Crawlforge integrate with AI and LLMs?

Accepted Answer

Crawlforge is LLM-native, shipping with a Model Context Protocol (MCP) server. This allows AI assistants like Claude Code, Codex, or Cursor to directly drive crawls, query results, summarize issues, and generate client-ready audit reports using natural language.

Question 4

What kind of technical SEO issues can Crawlforge detect?

Accepted Answer

It includes 269 built-in rules across 18 categories, covering aspects like response codes, metadata, structured data, canonicals, hreflang, redirects, sitemaps, images, and content. Its modular design allows for easy addition of custom SEO checks.

Question 5

Can I perform custom data analysis with Crawlforge?

Accepted Answer

Absolutely. Crawl data is stored in Parquet files with a DuckDB index, creating a columnar source of truth. This allows users to run arbitrary SQL queries against the data, diff two crawls, and perform highly customized analysis without complex ETL processes.

Crawlforge

Crawlforge

Key Features

Use Cases

Key Features

Use Cases