Acerca de
This skill provides expert-level guidance for scaling and debugging Apache Spark data processing pipelines. It offers production-ready patterns for executor configuration, memory tuning, join optimization (including broadcast and salt joins), and data format best practices. Whether you're dealing with slow jobs, data skew, or OOM errors, this skill helps implement efficient execution models to reduce processing time and infrastructure costs.