01Advanced Voice Activity Detection (VAD) and turn-taking logic
02Barge-in detection for natural conversation interruptions
03Spoken format prompting and response length constraints
04Conversational flow and emotional nuance optimization
051 GitHub stars
06Latency-optimized architecture selection (S2S vs. Pipeline)