Simplifies the retrieval and analysis of Databricks job logs to diagnose Spark failures and performance issues.
The dbr-logs skill provides a specialized interface for interacting with the dbr-logs CLI, allowing developers to fetch, filter, and analyze Databricks job logs directly within Claude. It automates the identification of root causes for common Spark issues like OutOfMemoryErrors, network timeouts, and data skew by focusing on relevant application logs while suppressing JVM and Spark lifecycle noise. By utilizing structured JSONL output, it enables precise error grouping across drivers and executors, making it an essential tool for data engineers debugging production failures.
주요 기능
01Intelligent noise suppression using the focus flag to hide JVM chatter
02Pattern-based root cause assessment for Spark and Python errors
032 GitHub stars
04Structured JSONL formatting for programmatic error analysis
05Automated log retrieval for driver and specific executor sources
06Direct resolution of job names and Databricks workspace URLs
사용 사례
01Investigating executor-specific memory leaks and OutOfMemoryError patterns
02Debugging a failed Databricks production job using a URL or job name
03Filtering massive logs for specific search terms across large clusters