About
This skill provides a rigorous framework for testing and validating Claude Code skills by leveraging fresh subagent instances to ensure objective evaluation. It helps developers avoid the 'priming problem' where prior conversation history skews results, allowing for precise measurement of a skill's impact on Claude's behavior. By implementing a three-phase TDD approach—Baseline, With-Skill, and Rationalization—users can systematically confirm that their custom skills are effective, reproducible, and resistant to post-hoc justification.