01Expressive text-to-speech (TTS) with style prompts and pacing control
02Advanced speaker diarization and sentiment analysis using AssemblyAI
0369 GitHub stars
04Long-form transcription support for up to 9.5 hours of audio with Gemini 2.5 Pro
05Native speech-to-speech patterns for ultra-low latency (<1s) voice agents
06Real-time emotional awareness and barge-in support for natural conversations