Performs automated data quality audits including null analysis, duplicate detection, and anomaly screening for Dremio Cloud datasets.
The Dremio Data Quality Investigator skill empowers Claude to proactively monitor and report on the integrity of Dremio Cloud data sources. By automating complex SQL-based checks for schema drift, nullability, data freshness, and statistical outliers, it helps data engineers and analysts identify pipeline failures or data corruption before they impact downstream applications. This skill provides a structured workflow that transitions from initial schema discovery and sampling to a comprehensive, actionable quality report, ensuring data reliability across your lakehouse.
Características Principales
01Schema drift verification and nullability assessment
025 GitHub stars
03Data freshness and update frequency monitoring
04Statistical outlier and numeric range validation
05Automated null and duplicate detection via optimized SQL queries
06Standardized data quality reporting for stakeholder review
Casos de Uso
01Troubleshooting ETL pipeline failures or identifying the source of stale data
02Auditing a new Dremio dataset for production readiness and constraint validation
03Identifying primary key violations and duplicate records in large-scale datasets