Skip to content

Conversation

@bghira
Copy link
Owner

@bghira bghira commented Jan 29, 2026

This pull request introduces several enhancements and improvements to the dataloader UI and configuration, with a particular focus on audio and video dataset support, advanced options, and user experience. The most important changes include the addition of new configuration options for audio and video datasets, improved advanced settings for destructive operations and metadata refresh, and UI consistency improvements for dataset type handling.

Audio & Video Dataset Enhancements:

  • Added new audio configuration options, including duration bucketing, bucket strategy, and improved field structure for audio settings in audio_body.html. Also, added support for audio-specific HuggingFace dataset options and synchronization logic for caption fields. [1] [2] [3] [4] [5]
  • Introduced video bucketing options, allowing users to select a bucket strategy and frame interval for video datasets in video_body.html.

Advanced & Destructive Options:

  • Added a "Metadata Refresh" section to advanced settings, allowing users to set a metadata update interval for applicable dataset types.
  • Introduced "Destructive Options" in advanced settings, providing checkboxes to delete unwanted or problematic images with clear warnings.

UI & Usability Improvements:

  • Added a footer to expanded dataset list items with a "Disable dataset" toggle for quick enable/disable actions. [1] [2]
  • Added a "Max Samples" field in the basic section to limit dataset size.
  • Added "Text File Options" for caption strategy, allowing users to disable multiline split and preserve line breaks.

Consistency & Restriction Updates:

  • Updated logic and UI to restrict the "Basic" tab for dataset types that do not support it (removing "audio" from the exclusion list so it now supports the basic tab). [1] [2] [3]

These changes collectively improve the flexibility and safety of dataset configuration, especially for audio and video datasets, and provide users with more control and better feedback in the UI.

@bghira bghira merged commit ed22154 into main Jan 29, 2026
2 checks passed
@bghira bghira deleted the bugfix/missing-audio-options branch January 29, 2026 18:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants