01Load CSV or Parquet files and manage multiple datasets using Pandas or Polars.
02Validate dataset schemas with Pandera and clean data using PyJanitor pipelines.
03Generate comprehensive statistical profiles and insights with YData Profiling.
04Create and save various Seaborn/Matplotlib charts for LLM interpretation.
050 GitHub stars
06Execute complex SQL queries across loaded datasets using an in-memory DuckDB engine.