VoiceLive API for Unity

Unity integration library for Azure AI VoiceLive API. Enables real-time voice conversations with AI models (GPT-4o, etc.) and custom AI agents in Unity applications.

Overview

This library provides a wrapper to easily use the Microsoft Foundry VoiceLive API from Unity. It abstracts complex processes such as WebSocket communication, audio capture/playback, and avatar video streaming, providing them as Inspector-configurable components.

Features

Real-time Voice Conversations: Voice interactions with Azure AI models (GPT-4o) and custom agents
Unity Components: MonoBehaviour-based components configurable via Inspector
WebSocket Integration: Efficient WebSocket communication with main thread event handling
Audio Processing: Microphone input capture and AudioSource playback (PCM16 format)
Avatar Support: Avatar video streaming via Unity WebRTC (optional)

Requirements

Unity 6000.0 or later
.NET Standard 2.1
Microsoft Foundry account with VoiceLive API access

Optional Dependencies

com.unity.webrtc 3.0.0 or later (required for avatar features)

Installation

Via Unity Package Manager

Open Unity Package Manager (Window → Package Manager)
Click "+" button → "Add package from git URL"
Enter one of the following URLs:

Latest version (upm branch):

https://github.com/TakahiroMiyaura/UnityVoiceLiveAPI.git#upm

Specific version (e.g., 1.0.0):

https://github.com/TakahiroMiyaura/UnityVoiceLiveAPI.git#upm@1.0.0

Manual Installation

Clone or download this repository
Copy the Unity/UnityVoiceLiveAPI/Assets/Reseul/UnityVoiceLiveAPI folder to your Unity project's Packages directory
Ensure required dependencies are installed:
- com.unity.nuget.newtonsoft-json (3.2.1 or later)

Quick Start

Basic Usage

Create an empty GameObject in your scene
Attach the UnityVoiceLiveClient component
Configure connection settings in the Inspector:
- Endpoint: Azure AI endpoint URL
- Access Token: API key or Bearer token
- Connection Mode: Select AIAgent or AIModel

Audio Capture Settings (Optional)

To use a specific microphone device (e.g., XR headset microphone):

Create an audio capture settings asset:
- Right-click in Project window
- Select Create > VoiceLive API > Audio Capture > Unity Microphone
Configure the device name and other settings in the Inspector
Assign the asset to Audio Capture Settings field in UnityVoiceLiveClient

If not configured, the default system microphone will be used.

Code Example

using Com.Reseul.Azure.AI.VoiceLiveAPI.Unity.Components;
using UnityEngine;

public class VoiceExample : MonoBehaviour
{
    private UnityVoiceLiveClient client;

    void Start()
    {
        client = GetComponent<UnityVoiceLiveClient>();

        // Setup event listeners
        client.OnSessionStarted.AddListener(() =>
        {
            Debug.Log("Session started!");
        });

        client.OnTranscriptReceived.AddListener((transcript) =>
        {
            Debug.Log($"Transcript: {transcript}");
        });

        // Start connection
        _ = client.Connect();
    }
}

Samples

The package includes the following samples:

Avatar Sample

A sample demonstrating voice conversations with avatar video streaming.

How to import:

Select this package in Unity Package Manager
Import "Avatar Sample" from the "Samples" section

Architecture

Unity MonoBehaviour Components
    ↓
UnityVoiceLiveClient (Main Thread Queue)
    ↓
VoiceLive API Core (.NET Standard 2.1)
    ↓
Microsoft Foundry WebSocket API

Platform Support

Platform	Support	Notes
Windows (Standalone)	✅
Android	✅	Microphone permissions required

Documentation

For detailed documentation, see:

Package README - Detailed usage and API reference
CHANGELOG - Release notes
THIRD-PARTY-NOTICES - Third-party licenses

License

Boost Software License 1.0

Author

Takahiro Miyaura

GitHub: @TakahiroMiyaura

Contributing

Issues and Pull Requests are welcome. See CONTRIBUTING.md for details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
Unity/UnityVoiceLiveAPI		Unity/UnityVoiceLiveAPI
VoiceLiveAPI-Libs		VoiceLiveAPI-Libs
.gitmodules		.gitmodules
CHANGELOG.md		CHANGELOG.md
CHANGELOG_JP.md		CHANGELOG_JP.md
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTING_JP.md		CONTRIBUTING_JP.md
LICENSE		LICENSE
README.md		README.md
README_JP.md		README_JP.md
THIRD-PARTY-NOTICES.md		THIRD-PARTY-NOTICES.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoiceLive API for Unity

Overview

Features

Requirements

Optional Dependencies

Installation

Via Unity Package Manager

Manual Installation

Quick Start

Basic Usage

Audio Capture Settings (Optional)

Code Example

Samples

Avatar Sample

Architecture

Platform Support

Documentation

License

Author

Contributing

Related Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VoiceLive API for Unity

Overview

Features

Requirements

Optional Dependencies

Installation

Via Unity Package Manager

Manual Installation

Quick Start

Basic Usage

Audio Capture Settings (Optional)

Code Example

Samples

Avatar Sample

Architecture

Platform Support

Documentation

License

Author

Contributing

Related Links

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages