GoldenCheck revolutionizes data validation by automatically discovering quality rules from your existing data, eliminating the need for manual rule creation. It offers a powerful command-line interface with an interactive Text User Interface (TUI) for reviewing findings, and an optional LLM boost to enhance issue detection, severity assessment, and relationship discovery. Supporting various data sources like CSV, Parquet, and databases, it facilitates continuous monitoring, auto-fixes, and seamless integration into CI/CD pipelines, ensuring high data integrity with minimal setup.
主要功能
01Automated data fixing capabilities (trim, normalize, coerce types)
02Automated rule discovery from diverse data sources
03Interactive TUI and CLI for reviewing data quality findings
04LLM-enhanced detection of subtle data quality issues
050 GitHub stars
06REST API for programmatic data quality scans and monitoring
使用案例
01Automating data validation and quality gates in CI/CD pipelines
02Continuously monitoring data quality in directories, databases, or data lakes
03Interactively scanning, reviewing, and fixing data quality issues in datasets