About
The AgRAG LLM Judge skill enables developers to systematically assess the performance of Agentic GraphRAG systems, particularly those tailored for complex domains like telecommunications test scope analysis. It automates the process of running batch prompts in headless mode while maintaining thread consistency to ensure reliable evaluations of retrieval quality, grounding, and tool usage. By applying a structured rubric that measures metrics like graph path validity and tool efficiency, the skill generates detailed transcripts and diagnostic reports, helping developers identify failure modes and apply recommended fixes to improve agentic accuracy.