Fix SongUNet with ShardTensor when using zero embedding#1432
Merged
pzharrington merged 12 commits intoNVIDIA:2.0.0-rcfrom Feb 19, 2026
Merged
Fix SongUNet with ShardTensor when using zero embedding#1432pzharrington merged 12 commits intoNVIDIA:2.0.0-rcfrom
pzharrington merged 12 commits intoNVIDIA:2.0.0-rcfrom
Conversation
Contributor
Greptile SummaryThis PR fixes a tensor type mismatch error that occurred when training regression models with The fix adds a conditional check: when
Important Files Changed
Last reviewed commit: 8db65bf |
Collaborator
|
/blossom-ci |
pzharrington
approved these changes
Feb 19, 2026
ktangsali
pushed a commit
that referenced
this pull request
Feb 25, 2026
* Bug fixes for ShardTensor+SongUNet * Handle dtensor spec in sharded view * Fix SongUNet with ShardTensor when using zero embedding * Use buffer for zero embed --------- Co-authored-by: Peter Harrington <48932392+pzharrington@users.noreply.github.com> Co-authored-by: Peter Harrington <pharrington@nvidia.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PhysicsNeMo Pull Request
When training regression models (i.e. no time step embedding,
embedding_type == "zero") usingShardTensor,SongUNetwas giving this error:caused by the
embtensor being a plaintorch.Tensor:physicsnemo/physicsnemo/models/diffusion_unets/song_unet.py
Lines 627 to 632 in 70b06ed
I added a conversion of
embtoShardTensorifxis aShardTensor. With this fix, it is possible to train theStormCastUNettype regression models withShardTensor.Description
Checklist
Dependencies
Review Process
All PRs are reviewed by the PhysicsNeMo team before merging.
Depending on which files are changed, GitHub may automatically assign a maintainer for review.
We are also testing AI-based code review tools (e.g., Greptile), which may add automated comments with a confidence score.
This score reflects the AI’s assessment of merge readiness and is not a qualitative judgment of your work, nor is
it an indication that the PR will be accepted / rejected.
AI-generated feedback should be reviewed critically for usefulness.
You are not required to respond to every AI comment, but they are intended to help both authors and reviewers.
Please react to Greptile comments with 👍 or 👎 to provide feedback on their accuracy.