Support half-duplex mode for Openai Realtime API by toubatbrian · Pull Request #814 · livekit/agents-js

toubatbrian · 2025-11-07T11:06:56Z

Allow openai realtime model to have text output piped with a custom TTS model

changeset-bot · 2025-11-07T11:07:00Z

🦋 Changeset detected

Latest commit: 0cb3f42

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 14 packages

Name	Type
@livekit/agents-plugin-google	Patch
@livekit/agents-plugin-openai	Patch
@livekit/agents	Patch
@livekit/agents-plugin-anam	Patch
@livekit/agents-plugin-cartesia	Patch
@livekit/agents-plugin-elevenlabs	Patch
@livekit/agents-plugin-neuphonic	Patch
@livekit/agents-plugin-resemble	Patch
@livekit/agents-plugin-rime	Patch
@livekit/agents-plugin-bey	Patch
@livekit/agents-plugin-deepgram	Patch
@livekit/agents-plugin-livekit	Patch
@livekit/agents-plugin-silero	Patch
@livekit/agents-plugins-test	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

samuelcastro · 2025-11-07T23:11:47Z

@Shubhrakanti @theomonnom Any idea when we could get this reviewed and deployed? This is blocking us atm. Thank you!

samuelcastro · 2025-11-08T23:34:46Z

@toubatbrian I tested this changes and initially it worked just fine (with some latency) but after follow up questions the agent remains silent, my logs:

{"level":40,"time":1762644702889,"pid":63178,"hostname":"Sams-MacBook-Pro.local","msg":"SegmentSynchronizerImpl text marked as ended in capture text, rotating segment"}
{"level":50,"time":1762644703046,"pid":63178,"hostname":"Sams-MacBook-Pro.local","msg":"Error in SynthesizeStream"}
{"level":30,"time":1762644703049,"pid":63178,"hostname":"Sams-MacBook-Pro.local","ttftMs":302,"input_tokens":854,"cached_input_tokens":832,"output_tokens":17,"total_tokens":871,"tokens_per_second":36.48,"msg":"RealtimeModel metrics"}
{"level":40,"time":1762644715328,"pid":63164,"hostname":"Sams-MacBook-Pro.local","msg":"job is unresponsive"}

toubatbrian · 2025-11-10T09:04:04Z

@samuelcastro, which STT / TTS are you using. I've tested multiple time on my end as well and it worked fine. Could you also post the full logs?

samuelcastro · 2025-11-10T15:14:17Z

@toubatbrian Tested with cartesia sonic 3.

samuelcastro · 2025-11-11T17:42:03Z

@toubatbrian Any idea when we can get this in?

toubatbrian · 2025-11-12T08:50:03Z

@samuelcastro I've tested with sonic-3, and it worked also fine. Have you tested with this example: https://github.com/livekit/agents-js/blob/06eceabc78c2d8b14071e8eef43c0c0e1fe74c78/examples/src/realtime_with_tts.ts?

Any idea when we can get this in?

Let me check with my team and I'll get back to you shortly!

slpn1 · 2025-11-12T09:15:29Z

@toubatbrian I tested this changes and initially it worked just fine (with some latency) but after follow up questions the agent remains silent, my logs:

@toubatbrian You're not encountering the 5 responses limit when using cartesia without an API key are you?

toubatbrian · 2025-11-12T09:37:51Z

@samuelcastro You can also try Livekit Inference Gateway: https://docs.livekit.io/agents/models/tts/inference/cartesia/, with something like:

import { AgentSession } from '@livekit/agents';

session = new AgentSession({
    tts="cartesia/sonic-3:9626c31c-bec5-4cca-baa8-f8ba9e84c8bc",
    // ... tts, stt, vad, turn_detection, etc.
});

or (if you want more custom control):

import { AgentSession } from '@livekit/agents';

session = new AgentSession({
    tts: new inference.TTS({ 
        model: "cartesia/sonic-3", 
        voice: "9626c31c-bec5-4cca-baa8-f8ba9e84c8bc", 
        language: "en",
        modelOptions: {
            speed: 1.5,
            volume: 1.2,
            emotion: "excited"
        }
    }),
    // ... tts, stt, vad, turn_detection, etc.
});

This would be much easier to test and setup

samuelcastro · 2025-11-12T13:44:12Z

ok great @toubatbrian I will test it again.

toubatbrian added 4 commits November 7, 2025 18:45

save interface changes

769e1c2

fix lint

635bb75

fix typing

88e16b3

save testable ckpt

4eb67db

toubatbrian changed the title ~~brianyin/ajs-322-openai-half-duplex-mode~~ [draft] brianyin/ajs-322-openai-half-duplex-mode Nov 7, 2025

toubatbrian added 4 commits November 7, 2025 19:10

add realtime with custom tts example

c615542

Update realtime_with_tts.ts

25277f9

fix bugs

249cdd5

restore debug logs

bbf09ff

toubatbrian changed the title ~~[draft] brianyin/ajs-322-openai-half-duplex-mode~~ brianyin/ajs-322-openai-half-duplex-mode Nov 7, 2025

toubatbrian requested review from Shubhrakanti and theomonnom November 7, 2025 11:52

toubatbrian added 3 commits November 7, 2025 19:54

cleanup

53967d7

Create sour-mugs-lay.md

8634baa

Update realtime_model.ts

cd36c22

Update realtime_with_tts.ts

cb85299

save

06eceab

toubatbrian mentioned this pull request Nov 12, 2025

Realtime with custom tts #772

Closed

Merge branch 'main' into brian/realtime-with-tts

0cb3f42

toubatbrian changed the title ~~brianyin/ajs-322-openai-half-duplex-mode~~ Support half-duplex mode for Openai Realtime API Nov 12, 2025

theomonnom approved these changes Nov 12, 2025

View reviewed changes

toubatbrian merged commit 9a58cd3 into main Nov 13, 2025
8 checks passed

toubatbrian deleted the brian/realtime-with-tts branch November 13, 2025 09:27

github-actions Bot mentioned this pull request Nov 13, 2025

Version Packages #817

Merged

simllll mentioned this pull request Nov 17, 2025

feat: TTS with RealtimeModel #781

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support half-duplex mode for Openai Realtime API#814

Support half-duplex mode for Openai Realtime API#814
toubatbrian merged 14 commits intomainfrom
brian/realtime-with-tts

toubatbrian commented Nov 7, 2025 •

edited

Loading

Uh oh!

changeset-bot Bot commented Nov 7, 2025 •

edited

Loading

Uh oh!

samuelcastro commented Nov 7, 2025 •

edited

Loading

Uh oh!

samuelcastro commented Nov 8, 2025

Uh oh!

toubatbrian commented Nov 10, 2025

Uh oh!

samuelcastro commented Nov 10, 2025

Uh oh!

samuelcastro commented Nov 11, 2025

Uh oh!

toubatbrian commented Nov 12, 2025

Uh oh!

slpn1 commented Nov 12, 2025

Uh oh!

toubatbrian commented Nov 12, 2025

Uh oh!

samuelcastro commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

toubatbrian commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot Bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

samuelcastro commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samuelcastro commented Nov 8, 2025

Uh oh!

toubatbrian commented Nov 10, 2025

Uh oh!

samuelcastro commented Nov 10, 2025

Uh oh!

samuelcastro commented Nov 11, 2025

Uh oh!

toubatbrian commented Nov 12, 2025

Uh oh!

slpn1 commented Nov 12, 2025

Uh oh!

toubatbrian commented Nov 12, 2025

Uh oh!

samuelcastro commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

toubatbrian commented Nov 7, 2025 •

edited

Loading

changeset-bot Bot commented Nov 7, 2025 •

edited

Loading

samuelcastro commented Nov 7, 2025 •

edited

Loading