概要
This skill provides a robust testing framework specifically designed for the Kosmos project, an implementation of the AI Scientist autonomous discovery system. It enables developers to run tiered test suites—ranging from quick sanity checks to full research workflows—across multiple environments including Ollama-powered local LLMs, Anthropic/OpenAI APIs, and secure Docker execution environments. By automating environment health checks and provider switching, it ensures the reliability of complex AI-driven research workflows and scientific discovery pipelines within the Claude Code environment.