01Provides granular control over pronunciation through phoneme respelling and normalization.
020 GitHub stars
03Includes pre-configured voice character profiles for specialized AI personae.
04Utilizes expressive v3 audio tags for emotional delivery including whispers, shouts, and sighs.
05Supports ElevenLabs v3, Multilingual v2, and Flash models for varying speeds and quality.
06Integrates seamlessly with local playback and custom MP3 file export.