소개
This skill provides a structured framework for consolidating data from diverse formats like JSON, CSV, Parquet, and XML into a cohesive output. It guides users through the entire ETL process, from environment setup with specialized libraries like Pandas and Pyarrow to implementing complex logic for field normalization and record deduplication. By applying customizable priority rules for conflict resolution and generating detailed reports, it ensures data integrity and transparency, making it an essential tool for developers building robust data pipelines or performing complex record consolidation.