关于
This skill facilitates the development of production-ready research tools by orchestrating a structured experimentation framework designed for maximum accuracy and reliability. It manages a massive testing scale—often exceeding 50 runs per tool—to compare diverse strategic approaches, refine data schemas, and optimize source selection. By guiding tools through initialization, broad experimentation, deep refinement, and final optimization, it ensures that every capability is empirically validated for accuracy, scalability, and practical utility in grounding AI outputs.