关于
This skill provides production-grade patterns for enhancing Apache Spark job performance, helping developers build scalable data pipelines. It enables Claude to implement optimal partitioning strategies, configure fine-grained executor memory settings, and debug performance bottlenecks like data skew or excessive shuffling. By applying industry-standard techniques such as Adaptive Query Execution (AQE), broadcast joins, and efficient serialization, this skill ensures that big data processing remains cost-effective and performant even at a multi-terabyte scale.