01Manage prompt versions and labels across staging and production
02Identify and debug exceptions with full stacktrace context
0341 GitHub stars
04Create and maintain evaluation datasets and test cases
05Monitor performance metrics including latency and token usage
06Query and inspect AI traces and LLM generations in real-time