01Built-in Server Voice Activity Detection (VAD) for natural turn-taking
02Support for Azure Neural, OpenAI, and Custom/Personal voices
03Bidirectional real-time audio streaming via WebSockets
041,777 GitHub stars
05Integrated Whisper transcription for precise speech-to-text processing
06Advanced function calling capabilities within live voice sessions