关于
Whisper is a robust speech-to-text skill that integrates OpenAI's general-purpose speech recognition model directly into your AI development workflow. It supports 99 languages and provides six different model sizes to balance speed and accuracy, making it ideal for automating podcast transcriptions, generating meeting notes, and translating foreign language audio into English. With features like word-level timestamps and noisy audio handling, it serves as a powerful tool for developers building multimodal applications or processing large-scale audio datasets.