概要
Ensures the robustness of Claude Code skills by subjecting them to a rigorous Red-Green-Refactor validation process before deployment. This framework simulates high-pressure environments—such as tight deadlines, authority constraints, and sunk costs—to identify and close loopholes in agent behavior. By generating quantitative Serena compliance scores and tracking loophole counts, it transforms qualitative process documentation into measurable, bulletproof implementation patterns that resist rationalization.