feat(openai): stream input_audio_transcription delta events by longcw · Pull Request #5859 · livekit/agents

longcw · 2026-05-27T02:34:43Z

Summary

Ports livekit/agents-js#1581 — wires conversation.item.input_audio_transcription.delta from the OpenAI Realtime API so user transcripts surface word-by-word as InputTranscriptionCompleted(is_final=False) partials, instead of only firing once on .completed.

Enables streaming user transcripts with gpt-realtime-whisper (and any future delta-emitting transcription model). Previously the .delta branch was a pass because partials weren't useful from the legacy transcription pipeline; now that OpenAI streams them in realtime, we accumulate and emit.

Changes

Add _input_transcript_accumulators: dict[str, dict[int, str]] keyed by (item_id, content_index).
New _handle_conversion_item_input_audio_transcription_delta handler accumulates and emits is_final=False.
_handle_..._completed clears the matching accumulator before emitting the final, so a subsequent delta on the same item_id starts fresh.
_handle_..._failed emits a closing is_final=True with the last accumulated partial so consumers waiting on a final don't hang. No-op when no partials had streamed.
Accumulators are also cleared on conversation.item.deleted and session reconnect.

Wire conversation.item.input_audio_transcription.delta from the OpenAI Realtime API as InputTranscriptionCompleted(is_final=False) partials. Accumulators are keyed per (item_id, content_index) and cleared on .completed, .deleted, session reconnect, and on .failed (which now emits a closing is_final=True when partials had streamed so consumers don't hang).

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 3 additional findings.

chenghao-mou requested a review from a team May 27, 2026 02:34

devin-ai-integration Bot reviewed May 27, 2026

View reviewed changes

test(realtime): wait for final input transcript before asserting

9c38f8b

theomonnom approved these changes May 27, 2026

View reviewed changes

longcw merged commit 541d844 into main May 27, 2026
25 checks passed

longcw deleted the longc/realtime-input-transcription-delta branch May 27, 2026 03:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(openai): stream input_audio_transcription delta events#5859

feat(openai): stream input_audio_transcription delta events#5859
longcw merged 2 commits into
mainfrom
longc/realtime-input-transcription-delta

longcw commented May 27, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

longcw commented May 27, 2026

Summary

Changes

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants