01Bidirectional WebSocket communication for real-time, low-latency audio streaming.
021,777 GitHub stars
03Support for GPT-4o Realtime and Phi-4 multimodal models.
04Advanced Turn Detection including Server VAD and Azure Semantic VAD.
05Integrated function calling support for interactive voice-enabled tools.
06Comprehensive audio format support including PCM16 and G.711 for telephony.