About
The Braintrust skill empowers developers to implement robust LLM observability directly within their Claude Code environment. It provides a suite of scripts for querying logs with specialized SQL, running comprehensive evaluations, and logging input/output data for better model transparency. By bridging the gap between development and monitoring, this skill allows users to analyze performance trends, filter logs by metadata, and iterate on LLM prompts with data-driven insights, ensuring production-grade reliability for AI applications.