01Enforces strict schema consistency for seamless cross-dataset integration
02Converts raw CSV, XML, Parquet, and JSON into standardized SourceSet DataFrames
03Provides interactive subcommands for pipeline orchestration and data inspection
04Automates code generation for new data transformation functions via 'Chef' builders
05Supports diverse domains including CGM, EHR, genomics, and wearable device data
060 GitHub stars