概要
Whisper Audio Transcription is a specialized skill that integrates a local whisper.cpp server into the Claude environment, enabling high-accuracy speech-to-text capabilities. It leverages GPU acceleration (CUDA) with the large-v3 model to provide fast, private transcription services via a simple HTTP API. This skill is ideal for developers needing to automate transcription tasks, record and convert voice notes, or integrate robust audio processing into Python and Shell workflows without relying on external cloud APIs or incurring subscription costs.