关于
The OpenAI Whisper API skill integrates high-accuracy speech-to-text capabilities directly into your development workflow. It enables the automated transcription of various audio formats such as M4A, OGG, and MP3 using the whisper-1 model. This tool is particularly useful for developers who need to process voice data, as it supports custom prompts for context, language specification for improved accuracy, and flexible output options including plain text or structured JSON for further programmatic manipulation.