ui: add audio options for video datasets on models which support a+v #2545

bghira · 2026-02-01T04:26:56Z

This pull request adds support for video models that use audio conditioning by introducing new capabilities for handling audio inputs and S2V (sound-to-video) requirements. It updates both backend and frontend logic to detect and expose these capabilities, enhances the dataset wizard UI to allow audio settings for video datasets, and adds comprehensive frontend tests to ensure correct behavior.

Backend: Model capability detection

Added logic in models_service.py to detect if a model supports audio inputs (supports_audio_inputs) and if it requires S2V datasets (requires_s2v_datasets). These are now included in the model's reported capabilities.

Frontend: Capability exposure and UI logic

Updated dataloader-section-component.js and dataset-wizard.js to expose supportsAudioInputs and requiresS2VDatasets getters, making these capabilities available for Alpine.js components and UI logic. [1] [2]

UI: Audio settings for video datasets

Modified the dataset modal in dataset_modal.html so that the Audio tab is shown for video datasets if the selected model supports audio inputs.
Enhanced the audio settings section in audio_body.html to display auto-split and audio format options for video datasets, including toggles and advanced configuration for audio extraction from videos. [1] [2]

Testing: Frontend logic

Added a new test suite dataloader_audio_capabilities.test.js covering the new getters and their combinations, ensuring that audio-related UI logic behaves correctly for various model capabilities.

ui: add audio options for video datasets on models which support a+v

4f73a9c

bghira merged commit a382691 into main Feb 1, 2026
2 checks passed

bghira deleted the ui/video-audio-split-options branch February 1, 2026 04:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ui: add audio options for video datasets on models which support a+v #2545

ui: add audio options for video datasets on models which support a+v #2545

Uh oh!

bghira commented Feb 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ui: add audio options for video datasets on models which support a+v #2545

ui: add audio options for video datasets on models which support a+v #2545

Uh oh!

Conversation

bghira commented Feb 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants