01Schema-driven extraction using Claude's vision capabilities for complex PDF layouts
02Automatic JSON repair and validation against domain-specific external APIs
03Integrated filtering pipeline using local models (Ollama) or Claude (Haiku/Sonnet)
04Comprehensive quality assurance with precision, recall, and F1 score calculation
05Multi-format export supporting Python, R, CSV, Excel, and SQLite
063 GitHub stars