概要
This skill provides a systematic framework for performing end-to-end evaluations based on specific user goals rather than testing features in isolation. It traces user journeys through the entire stack—including the frontend (UX), application logic (Code), LLM interactions (AI), and backend persistence (Infrastructure)—to pinpoint exactly where a process breaks down. Ideal for debugging complex workflows like onboarding or multi-step configurations, it includes a goal library for standardizing assessments and generates comprehensive reports with prioritized recommendations to improve product success rates.