[TTS] Refactor BOS and EOS handling to fix codec conversion by rlangman · Pull Request #15054 · NVIDIA-NeMo/NeMo

rlangman · 2025-11-10T21:36:14Z

This PR fixes a bug in MagpieTTS training in which self._codec_converter.convert_original_to_new() is being called on audio codec tokens after the Lhotse data loader has already pre-populated them with BOS and EOS tokens.

The addition of BOS and EOS tokens is now always done inside the model class, after convert_original_to_new() is called. These tokens are also removed before convert_new_to_original() is called, so that they are not mistakenly passed into the codec model.

There are also a few other minor changes, including:

Fix data loading field mismatch when doing on-the-fly codec extraction (we now use target_audio instead of recording and context_audio instead of context_recording)
Fix bug with inconsistent padding in spectral codec encoder
Replace some manual padding done with torch.cat with simpler call to torch.nn.functional.pad

Signed-off-by: Ryan <rlangman@nvidia.com>

blisc · 2025-12-10T15:46:06Z

Closing, please make a new PR into main

github-actions bot added the TTS label Nov 10, 2025

rlangman force-pushed the magpietts_2508_eos_fix branch from c87e39e to 620903a Compare November 10, 2025 21:49

[TTS] Refactor BOS and EOS handling to fix codec conversion

fba3760

Signed-off-by: Ryan <rlangman@nvidia.com>

rlangman force-pushed the magpietts_2508_eos_fix branch from 620903a to 5fb564c Compare December 5, 2025 23:05

Modify codec resampling interface

cd463a8

rlangman force-pushed the magpietts_2508_eos_fix branch from d6f3a89 to cd463a8 Compare December 5, 2025 23:12

blisc closed this Dec 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TTS] Refactor BOS and EOS handling to fix codec conversion#15054

[TTS] Refactor BOS and EOS handling to fix codec conversion#15054
rlangman wants to merge 2 commits intoNVIDIA-NeMo:magpietts_2508from
rlangman:magpietts_2508_eos_fix

rlangman commented Nov 10, 2025

Uh oh!

blisc commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rlangman commented Nov 10, 2025

Uh oh!

blisc commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants