fix(nemo): extract Hypothesis.text for TDT/RNNT ASR models by fqscfqj · Pull Request #10012 · mudler/LocalAI

fqscfqj · 2026-05-26T07:24:17Z

Problem

NeMo Parakeet TDT models (e.g. parakeet-tdt-0.6b-v3) deployed via the nemo backend always produce empty transcription output.

Root Cause

NeMo's transcribe() returns different types depending on the model architecture:

CTC models (e.g. Whisper): List[str] — works fine
TDT/RNNT models (e.g. parakeet-tdt-0.6b-v3): List[Hypothesis] — the decoded text lives in the Hypothesis.text attribute

The backend code at backend/python/nemo/backend.py line 105 did:

text = results[0]  # Hypothesis object, not a str!

This assigned the entire Hypothesis dataclass to the protobuf string field. When protobuf tried to serialize it, it either raised a TypeError (caught by the except block → returns empty) or silently converted to an empty string. Either way, the transcript was always blank.

Fix

Check the return type and extract .text from Hypothesis objects when present:

result = results[0]
if isinstance(result, str):
    text = result
elif hasattr(result, 'text'):
    text = result.text if result.text else 
else:
    text = str(result) if result else

This is backward-compatible — CTC models still return strings and take the first branch.

Testing

Verified by reading the NeMo source code:

nemo/collections/asr/parts/submodules/rnnt_decoding.py → rnnt_decoder_predictions_tensor() always returns List[Hypothesis] even when return_hypotheses=False
nemo/collections/asr/models/rnnt_models.py → _transcribe_output_processing() passes this through unchanged

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Updates transcript extraction logic in the NeMo backend to correctly handle different model output types (CTC strings vs. RNNT/TDT Hypothesis objects).

Changes:

Adds type-aware handling for results[0] to extract text from either str or an object with a .text attribute.
Improves inline documentation clarifying expected return types from different model families.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+            elif hasattr(result, 'text'):
+                text = result.text if result.text else ""
+            else:
+                text = str(result) if result else ""


+            else:
+                text = str(result) if result else ""


CTC models (e.g. Whisper) return List[str] from transcribe(), but TDT/RNNT models (e.g. parakeet-tdt-0.6b-v3) return List[Hypothesis] where the decoded text lives in the Hypothesis.text attribute. Previously, results[0] was assigned directly to the protobuf string field, causing silent empty output for non-CTC models. Now checks the return type and extracts .text from Hypothesis objects, with a safe fallback via getattr().

Use single getattr() call instead of hasattr() + double access, and return empty string for unknown types instead of str(result) to avoid leaking internal repr to clients.

mudler · 2026-05-26T20:10:15Z

thanks!

Copilot AI review requested due to automatic review settings May 26, 2026 07:24

Copilot AI reviewed May 26, 2026

View reviewed changes

fqscfqj mentioned this pull request May 26, 2026

bug: ASR backends silently fail on non-string upstream return types (nemo TDT/RNNT, qwen-asr timestamps) #10014

Open

fqscfqj added 2 commits May 26, 2026 08:49

refactor: simplify Hypothesis text extraction per Copilot review

2e07e48

Use single getattr() call instead of hasattr() + double access, and return empty string for unknown types instead of str(result) to avoid leaking internal repr to clients.

fqscfqj force-pushed the fix/nemo-tdt-hypothesis-text branch from da11db9 to 2e07e48 Compare May 26, 2026 08:50

mudler approved these changes May 26, 2026

View reviewed changes

mudler enabled auto-merge (squash) May 26, 2026 20:10

mudler merged commit df7623f into mudler:master May 26, 2026
57 checks passed

BrewTestBot mentioned this pull request May 27, 2026

localai 4.3.2 Homebrew/homebrew-core#285003

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(nemo): extract Hypothesis.text for TDT/RNNT ASR models#10012

fix(nemo): extract Hypothesis.text for TDT/RNNT ASR models#10012
mudler merged 2 commits into
mudler:masterfrom
fqscfqj:fix/nemo-tdt-hypothesis-text

fqscfqj commented May 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

mudler commented May 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

fqscfqj commented May 26, 2026

Problem

Root Cause

Fix

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

mudler commented May 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants