What audio format does Azure AI VoiceLive Java require?

It requires 16-bit PCM mono audio at a 24kHz sample rate (signed, little-endian format).

Can I use custom voices with this skill?

Yes, it supports OpenAI voices (like Alloy and Ash), Azure Standard voices, Custom voices, and Azure Personal voices.

Is the client synchronous or asynchronous?

The skill uses VoiceLiveAsyncClient, as real-time voice streaming requires reactive programming patterns and non-blocking I/O.

Does it support interruptible speech?

Yes, you can enable the interruptResponse option within the ServerVadTurnDetection settings to allow the AI to be interrupted naturally.

How do I handle authentication?

You can authenticate using either an Azure API Key or the recommended DefaultAzureCredential for secure, identity-based access.

Azure AI VoiceLive for Java

Name: Azure AI VoiceLive for Java
Author: microsoft

bymicrosoft

•

1,777

•

Cloud Infrastructure

Implements real-time, bidirectional voice conversations with AI assistants using Azure AI VoiceLive SDK for Java.

Azure AI VoiceLive for Java enables developers to build high-performance voice-first applications with low-latency WebSocket communication. This skill provides a comprehensive framework for handling real-time audio streaming, speech-to-text transcription using Whisper, and server-side voice activity detection (VAD). It supports both OpenAI and Azure-specific neural voices, allowing for sophisticated function calling and stateful conversation management within Java-based AI agents using reactive programming patterns.

Key Features

01Built-in Server Voice Activity Detection (VAD) for natural turn-taking

02Support for Azure Neural, OpenAI, and Custom/Personal voices

03Bidirectional real-time audio streaming via WebSockets

041,777 GitHub stars

05Integrated Whisper transcription for precise speech-to-text processing

06Advanced function calling capabilities within live voice sessions

Use Cases

01Developing interactive voice-controlled IoT devices and applications

02Building low-latency AI customer service voice bots

03Creating real-time accessibility tools with live AI speech interaction

What are Skills?·How to Install

Install with 🐟 Skill.Fish

npx skillfish add microsoft/skills azure-ai-voicelive-java

For use in Claude.ai and ChatGPT

Download Skill