prevent double-encoding of captions for audio auto-split dataset #2543

bghira · 2026-02-01T03:29:42Z

This pull request introduces a safeguard to the text embeddings processing logic for audio datasets that source their data from video. Specifically, it ensures that text embedding processing is skipped for these datasets, as they inherit captions from their parent video dataset during training.

Text embedding processing logic:

In simpletuner/helpers/data_backend/factory.py, the _process_text_embeddings function now checks if the audio dataset is configured with source_from_video=True and skips text embedding processing in that case, logging an informational message. This prevents redundant processing since captions are inherited from the parent dataset.

prevent double-encoding of captions for audio auto-split dataset

de0b6d0

bghira merged commit 3722deb into main Feb 1, 2026
2 checks passed

bghira deleted the bugfix/double-encode-captions branch February 1, 2026 03:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prevent double-encoding of captions for audio auto-split dataset #2543

prevent double-encoding of captions for audio auto-split dataset #2543

Uh oh!

bghira commented Feb 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

prevent double-encoding of captions for audio auto-split dataset #2543

prevent double-encoding of captions for audio auto-split dataset #2543

Uh oh!

Conversation

bghira commented Feb 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants