01Pre-commit hook integration for continuous data quality assurance
02Comprehensive coverage analysis across domains and difficulty levels
03Referential integrity checks between query sets and document sections
04Semantic duplicate detection with configurable similarity thresholds
05Automated JSON schema validation for documents and queries
0669 GitHub stars