01Custom prompt injection for better terminology and speaker identification
02High-accuracy transcription using the OpenAI Whisper-1 model
03Support for multiple audio formats including M4A, OGG, and MP3
04Multi-language support with explicit language tagging for improved results
05Flexible output formats including plain text and machine-readable JSON
06432 GitHub stars