(gladia & soniox): add translation support by tinalenguyen · Pull Request #5148 · livekit/agents

tinalenguyen · 2026-03-18T21:18:55Z

add input_language and input_text to SpeechData and add translation support for gladia and soniox

to test, iterate through STT node:

async def stt_node(
        self, audio: AsyncIterable, model_settings: ModelSettings
    ) -> AsyncIterable[stt_module.SpeechEvent]:
        async for event in Agent.default.stt_node(self, audio, model_settings):
            if isinstance(event, stt_module.SpeechEvent) and event.alternatives:
                alt = event.alternatives[0]
                if alt.input_language:
                    logger.info(
                        f"[STT translation] input_language={alt.input_language}, "
                        f"language={alt.language}, "
                        f"input_text={alt.input_text!r}, text={alt.text!r}"
                    )
            yield event

…dpoint Emit END_OF_SPEECH based on speaking state, not final text presence. Previously both were inside the same conditional, so if an error or finished message arrived while speaking but before final tokens accumulated, END_OF_SPEECH was skipped. This left downstream consumers in speaking state with no turn detection triggered. Only affects agents using turn_detection=stt (no VAD). Pre-existing bug also present on main and livekit#5148.

chenghao-mou

lgtm. One small thing I noticed is that the translation works with code-switching, but the source/input language only supports one value.

chenghao-mou · 2026-03-20T11:34:26Z

livekit-agents/livekit/agents/stt/stt.py

+    input_language: LanguageCode | None = None
+    """the detected/input language spoken by the user. populated by STT services that support translation,
+    where `language` holds the target language and `input_language` holds the original spoken language"""
+    input_text: str | None = None


nit: borrowing terms from machine translation terminology, we should name them source_language, and source_text.

I renamed to source_languages and source_texts, for the polyglots

chenghao-mou · 2026-03-20T11:53:33Z

livekit-plugins/livekit-plugins-soniox/livekit/plugins/soniox/stt.py

                # Reset speaking state, so the next transcript will send START_OF_SPEECH again.
                is_speaking = False
+            else:
+                final_original.reset()


I don't think we need to reset this here

…dpoint Emit END_OF_SPEECH based on speaking state, not final text presence. Previously both were inside the same conditional, so if an error or finished message arrived while speaking but before final tokens accumulated, END_OF_SPEECH was skipped. This left downstream consumers in speaking state with no turn detection triggered. Only affects agents using turn_detection=stt (no VAD). Pre-existing bug also present on main and livekit#5148.

tinalenguyen added 2 commits March 18, 2026 17:12

add input_language and input_text and apply to gladia and soniox

ea4583b

format

83ede96

tinalenguyen linked an issue Mar 18, 2026 that may be closed by this pull request

Add Soniox Real-Time Translation Support to livekit-plugins-soniox #4943

Closed

chenghao-mou requested a review from a team March 18, 2026 21:19

This comment was marked as resolved.

Sign in to view

reset

d8f057e

MSameerAbbas mentioned this pull request Mar 20, 2026

feat(gladia, soniox): add translation support with input_language/input_text on SpeechData #5111

Closed

9 tasks

chenghao-mou approved these changes Mar 20, 2026

View reviewed changes

support multiple source languages

57d2525

This comment was marked as resolved.

Sign in to view

fix

435a686

tinalenguyen merged commit 5f13474 into main Mar 20, 2026
14 of 22 checks passed

tinalenguyen deleted the tina/support-translations-speechdata branch March 20, 2026 20:37

tinalenguyen mentioned this pull request Mar 20, 2026

Feature Request: Add Original Language Detection to Gladia STT Plugin #4402

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(gladia & soniox): add translation support#5148

(gladia & soniox): add translation support#5148
tinalenguyen merged 5 commits intomainfrom
tina/support-translations-speechdata

tinalenguyen commented Mar 18, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

chenghao-mou left a comment

Uh oh!

chenghao-mou Mar 20, 2026

Uh oh!

tinalenguyen Mar 20, 2026

Uh oh!

chenghao-mou Mar 20, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tinalenguyen commented Mar 18, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

chenghao-mou left a comment

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

tinalenguyen Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants