关于
This skill automates the creation of comprehensive test datasets based on schema specifications, specifically designed for ETL pipelines and data-driven applications. It intelligently generates various data types—including integers, floats, strings, dates, and enums—while automatically injecting critical edge cases like null values, boundary limits, and empty strings. By providing realistic yet synthetic data, it helps developers and data engineers identify potential pipeline failures and data validation issues before they reach production environments.