01Support for various formats including MP3, WAV, MP4, and MKV
02High-precision word-level timestamp alignment
03Multiple model sizes ranging from 'tiny' for speed to 'large-v2' for accuracy
04Multi-language support with automatic speech detection
05Export options for TXT, SRT, VTT, and structured JSON
060 GitHub stars