소개
This skill integrates OpenAI's Whisper model into the Claude Code environment, enabling developers to programmatically convert audio recordings into accurate text transcripts. It supports various file formats like M4A and OGG, allows for custom prompts to improve accuracy—such as specifying speaker names—and offers flexible output options including plain text or structured JSON. It is particularly useful for processing voice notes, transcribing meeting recordings, or building automated voice-to-text pipelines within the Clawdis assistant framework.