feat: Integrate Wan with multi-resolution DL by pthombre · Pull Request #1475 · NVIDIA-NeMo/Automodel

pthombre · 2026-03-06T17:30:57Z

Add text-to-video multiresolution dataloader: Introduces TextToVideoDataset and build_video_multiresolution_dataloader to support video training (Wan, HunyuanVideo) with bucket-based multiresolution sampling, replacing the legacy build_dataloader / MetaFilesDataset path.
Refactor diffusion datasets with shared base class: Extracts common multiresolution logic (metadata loading, bucket grouping, dynamic batch sizing) into BaseMultiresolutionDataset, with TextToImageDataset and TextToVideoDataset as concrete implementations.
Refactor and rename dataloader APIs: Renames collate_fn_flux → collate_fn_text_to_image, build_flux_multiresolution_dataloader → build_text_to_image_multiresolution_dataloader, and extracts a shared _build_multiresolution_dataloader_core helper. Moves collate_fn_production and dataloader builders from sampler.py
to collate_fns.py.
Add video collate function (collate_fn_video) with support for model-specific optional fields (e.g. text_mask, image_embeds).
Update example configs: Migrate Wan 2.1 and HunyuanVideo YAML configs to use the new video multiresolution dataloader; update Flux configs to use renamed image dataloader API.
Remove obsolete CI/CD nightly scripts and configs for Wan 2.1 pretrain.
Fix Flux attention backend configuration (use "flash" instead of "_flash_3_hub").
Fix and update unit tests for renamed APIs and new video dataloader.

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

copy-pr-bot · 2026-03-06T17:31:01Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: pthombre <pthombre@users.noreply.github.com>

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

pthombre · 2026-03-08T22:35:33Z

/ok to test a31c392

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

Signed-off-by: pthombre <pthombre@users.noreply.github.com>

pthombre · 2026-03-08T22:40:47Z

/ok to test fa92b0c

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

pthombre · 2026-03-08T22:47:07Z

/ok to test 755c134

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

Signed-off-by: pthombre <pthombre@users.noreply.github.com>

pthombre · 2026-03-09T23:15:13Z

/ok to test 4a891b8

feat: Integrate Wan with multi-resolution DL

a0f1e31

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

pthombre and others added 4 commits March 6, 2026 17:31

Update uv lock

9f57f48

Signed-off-by: pthombre <pthombre@users.noreply.github.com>

Required changes for compatability with AM container

108e4c1

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

Fix attention backend for flux

a00d334

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

Merge branch 'main' into pranav/text_to_video_dataloader

a31c392

copy-pr-bot Bot had a problem deploying to nemo-ci March 8, 2026 22:35 Error

copy-pr-bot Bot had a problem deploying to nemo-ci March 8, 2026 22:35 Failure

copy-pr-bot Bot had a problem deploying to nemo-ci March 8, 2026 22:35 Error

copy-pr-bot Bot had a problem deploying to nemo-ci March 8, 2026 22:35 Failure

copy-pr-bot Bot had a problem deploying to test March 8, 2026 22:35 Error

pthombre and others added 2 commits March 8, 2026 15:39

Fix overrides

88c6901

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

Update uv lock

fa92b0c

Signed-off-by: pthombre <pthombre@users.noreply.github.com>

copy-pr-bot Bot temporarily deployed to test March 8, 2026 22:41 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci March 8, 2026 22:41 Error

copy-pr-bot Bot temporarily deployed to nemo-ci March 8, 2026 22:41 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci March 8, 2026 22:45 Error

Fix linting errors

755c134

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

copy-pr-bot Bot temporarily deployed to test March 8, 2026 22:47 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci March 8, 2026 22:48 Failure

copy-pr-bot Bot temporarily deployed to nemo-ci March 8, 2026 22:48 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci March 9, 2026 18:33 Inactive

copy-pr-bot Bot temporarily deployed to test March 9, 2026 18:33 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci March 9, 2026 18:58 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci March 9, 2026 19:20 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci March 9, 2026 19:36 Inactive

pthombre and others added 3 commits March 9, 2026 16:13

Restore uv lock to original status

4668bc6

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>

Update uv lock

67df67a

Signed-off-by: pthombre <pthombre@users.noreply.github.com>

Merge branch 'main' into pranav/text_to_video_dataloader

4a891b8

copy-pr-bot Bot temporarily deployed to test March 9, 2026 23:15 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci March 9, 2026 23:15 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci March 9, 2026 23:29 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci March 9, 2026 23:59 Inactive

akoumpa reviewed Mar 10, 2026

View reviewed changes

Comment thread nemo_automodel/_diffusers/auto_diffusion_pipeline.py

akoumpa approved these changes Mar 12, 2026

View reviewed changes

chtruong814 approved these changes Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Integrate Wan with multi-resolution DL#1475

feat: Integrate Wan with multi-resolution DL#1475
pthombre merged 13 commits intomainfrom
pranav/text_to_video_dataloader

pthombre commented Mar 6, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Mar 6, 2026

Uh oh!

pthombre commented Mar 8, 2026

Uh oh!

pthombre commented Mar 8, 2026

Uh oh!

pthombre commented Mar 8, 2026

Uh oh!

pthombre commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pthombre commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot Bot commented Mar 6, 2026

Uh oh!

pthombre commented Mar 8, 2026

Uh oh!

pthombre commented Mar 8, 2026

Uh oh!

pthombre commented Mar 8, 2026

Uh oh!

pthombre commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pthombre commented Mar 6, 2026 •

edited

Loading