Enables real-time, bidirectional voice AI capabilities in .NET applications using Azure AI and WebSocket communication.
This skill provides comprehensive guidance and implementation patterns for the Azure.AI.VoiceLive SDK, allowing developers to build sophisticated voice assistants within the .NET ecosystem. It covers essential workflows including session configuration, real-time audio streaming, semantic voice activity detection (VAD), and complex function calling integration. This skill is particularly useful for developers creating low-latency, conversational AI experiences that require high-performance C# implementations and integration with Azure's latest multimodal models like GPT-4o Realtime.
主な機能
01Managed authentication using Microsoft Entra ID and Azure Key Credentials
02Real-time bidirectional WebSocket communication for low-latency voice interactions
03Advanced Semantic Voice Activity Detection (VAD) for natural conversation flow
04Seamless integration with GPT-4o and Phi-4 multimodal real-time models
0524,382 GitHub stars
06Support for complex function calling within live voice sessions
ユースケース
01Creating hands-free industrial or medical voice-activated AI assistants
02Developing interactive AI customer service voice bots and virtual agents
03Building real-time accessibility tools and speech-to-speech translators