Skip to content

feat: Create DTensorPolicyWorkerV2 to integrate nemo-automodel apis#794

Merged
terrykong merged 8 commits intomainfrom
ffrujeri/automodel_integration_dtensor_policy_worker
Aug 25, 2025
Merged

feat: Create DTensorPolicyWorkerV2 to integrate nemo-automodel apis#794
terrykong merged 8 commits intomainfrom
ffrujeri/automodel_integration_dtensor_policy_worker

Conversation

@ffrujeri
Copy link
Contributor

@ffrujeri ffrujeri commented Jul 29, 2025

What does this PR do ?

This PR adds automodel repo as a submodule and creates DTensorPolicyWorkerV2 to consume parallelize functionality from nemo_automodel apis recently upstreamed.

Issues

List issues that this PR closes (syntax):
#578

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
  • Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

  • ...

@ffrujeri ffrujeri marked this pull request as ready for review July 30, 2025 00:57
@ffrujeri ffrujeri force-pushed the ffrujeri/automodel_integration_dtensor_policy_worker branch 3 times, most recently from a040061 to e565216 Compare July 31, 2025 19:11
@github-actions github-actions bot added the CI Relating to CI label Aug 1, 2025
@ffrujeri ffrujeri force-pushed the ffrujeri/automodel_integration_dtensor_policy_worker branch 2 times, most recently from cdf4cad to ddffb89 Compare August 1, 2025 20:20
Copy link
Collaborator

@terrykong terrykong left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks @ffrujeri for beginning this migration!

@ffrujeri ffrujeri requested a review from terrykong August 4, 2025 18:15
@ffrujeri ffrujeri force-pushed the ffrujeri/automodel_integration_dtensor_policy_worker branch from 327a2b9 to f47ab8d Compare August 5, 2025 04:54
@ffrujeri ffrujeri changed the title feat: Create DTensorPolicyWorkerV2 to integrate nemo_automodel apis feat: Create DTensorPolicyWorkerV2 to integrate nemo-automodel apis Aug 5, 2025
@ffrujeri ffrujeri force-pushed the ffrujeri/automodel_integration_dtensor_policy_worker branch from f47ab8d to 293d3c9 Compare August 5, 2025 16:33
@ffrujeri ffrujeri force-pushed the ffrujeri/automodel_integration_dtensor_policy_worker branch 2 times, most recently from 193d7e2 to 039adbd Compare August 14, 2025 18:11
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@ffrujeri ffrujeri force-pushed the ffrujeri/automodel_integration_dtensor_policy_worker branch from 7b75e42 to 388ad40 Compare August 15, 2025 23:33
@github-actions
Copy link

ℹ️ File Synchronization Check

Check based on commit: 388ad40 (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

✅ DTensor Policy Worker Synchronization Check

Both DTensor policy worker files were modified in this PR:

  • nemo_rl/models/policy/dtensor_policy_worker.py
  • nemo_rl/models/policy/dtensor_policy_worker_v2.py

Please ensure that the changes are consistent between both files where applicable.


This check ensures that related file implementations remain synchronized across the codebase. If you believe this warning is incorrect or the files should intentionally differ, please add a comment explaining the reasoning.

@github-actions
Copy link

✅ Submodule Fast-Forward Check Results

Check based on commit: 388ad40 (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

✅ Submodules that are properly updated:

Automodel: ✅ New submodule being added

All submodule changes look good! ✨

@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
@NVIDIA-NeMo NVIDIA-NeMo deleted a comment from github-actions bot Aug 15, 2025
Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
@ffrujeri ffrujeri force-pushed the ffrujeri/automodel_integration_dtensor_policy_worker branch from 181b9bb to 5eacd3f Compare August 23, 2025 05:27
@github-actions
Copy link

⚠️ File Synchronization Check

Check based on commit: 5eacd3f (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

⚠️ Parallel Plans Synchronization Warning

The file nemo_rl/models/dtensor/parallelize.py was modified in this PR, but 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py was not updated.

Why this matters:
These files contain similar parallel plan implementations that should be kept synchronized to ensure consistency across the codebase.

Action required:

  • Please review if the changes in nemo_rl/models/dtensor/parallelize.py should also be applied to 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py
  • Update 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py if necessary to maintain synchronization
  • If the files are intentionally different, please add a comment in the PR explaining why

Files to check:

  • Modified: nemo_rl/models/dtensor/parallelize.py
  • Not modified: 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py

✅ DTensor Policy Worker Synchronization Check

Both DTensor policy worker files were modified in this PR:

  • nemo_rl/models/policy/dtensor_policy_worker.py
  • nemo_rl/models/policy/dtensor_policy_worker_v2.py

Please ensure that the changes are consistent between both files where applicable.


This check ensures that related file implementations remain synchronized across the codebase. If you believe this warning is incorrect or the files should intentionally differ, please add a comment explaining the reasoning.

@github-actions
Copy link

✅ Submodule Fast-Forward Check Results

Check based on commit: 5eacd3f (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

✅ Submodules that are properly updated:

Automodel: ✅ New submodule being added

All submodule changes look good! ✨

@ffrujeri ffrujeri added the CI:L1 Run doctests, unit tests, and functional tests label Aug 23, 2025
@github-actions
Copy link

⚠️ File Synchronization Check

Check based on commit: 5eacd3f (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

⚠️ Parallel Plans Synchronization Warning

The file nemo_rl/models/dtensor/parallelize.py was modified in this PR, but 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py was not updated.

Why this matters:
These files contain similar parallel plan implementations that should be kept synchronized to ensure consistency across the codebase.

Action required:

  • Please review if the changes in nemo_rl/models/dtensor/parallelize.py should also be applied to 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py
  • Update 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py if necessary to maintain synchronization
  • If the files are intentionally different, please add a comment in the PR explaining why

Files to check:

  • Modified: nemo_rl/models/dtensor/parallelize.py
  • Not modified: 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py

✅ DTensor Policy Worker Synchronization Check

Both DTensor policy worker files were modified in this PR:

  • nemo_rl/models/policy/dtensor_policy_worker.py
  • nemo_rl/models/policy/dtensor_policy_worker_v2.py

Please ensure that the changes are consistent between both files where applicable.


This check ensures that related file implementations remain synchronized across the codebase. If you believe this warning is incorrect or the files should intentionally differ, please add a comment explaining the reasoning.

@github-actions
Copy link

✅ Submodule Fast-Forward Check Results

Check based on commit: 5eacd3f (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

✅ Submodules that are properly updated:

Automodel: ✅ New submodule being added

All submodule changes look good! ✨

@terrykong terrykong enabled auto-merge August 24, 2025 21:41
@terrykong terrykong added CI:docs Run doctest and removed CI:L0 Run doctests and unit tests CI:L1 Run doctests, unit tests, and functional tests labels Aug 24, 2025
@github-actions
Copy link

⚠️ File Synchronization Check

Check based on commit: 5eacd3f (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

⚠️ Parallel Plans Synchronization Warning

The file nemo_rl/models/dtensor/parallelize.py was modified in this PR, but 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py was not updated.

Why this matters:
These files contain similar parallel plan implementations that should be kept synchronized to ensure consistency across the codebase.

Action required:

  • Please review if the changes in nemo_rl/models/dtensor/parallelize.py should also be applied to 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py
  • Update 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py if necessary to maintain synchronization
  • If the files are intentionally different, please add a comment in the PR explaining why

Files to check:

  • Modified: nemo_rl/models/dtensor/parallelize.py
  • Not modified: 3rdparty/Automodel-workspace/Automodel/nemo_automodel/components/distributed/optimized_tp_plans.py

✅ DTensor Policy Worker Synchronization Check

Both DTensor policy worker files were modified in this PR:

  • nemo_rl/models/policy/dtensor_policy_worker.py
  • nemo_rl/models/policy/dtensor_policy_worker_v2.py

Please ensure that the changes are consistent between both files where applicable.


This check ensures that related file implementations remain synchronized across the codebase. If you believe this warning is incorrect or the files should intentionally differ, please add a comment explaining the reasoning.

@github-actions
Copy link

✅ Submodule Fast-Forward Check Results

Check based on commit: 5eacd3f (PR #794 from ffrujeri/automodel_integration_dtensor_policy_worker)

✅ Submodules that are properly updated:

Automodel: ✅ New submodule being added

All submodule changes look good! ✨

@terrykong terrykong added this pull request to the merge queue Aug 24, 2025
Merged via the queue into main with commit e861b94 Aug 25, 2025
51 of 56 checks passed
@terrykong terrykong deleted the ffrujeri/automodel_integration_dtensor_policy_worker branch August 25, 2025 01:43
jveronvialard pushed a commit that referenced this pull request Aug 27, 2025
…794)

Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Julien Veron Vialard <jveronvialar@nvidia.com>
soodoshll pushed a commit to soodoshll/RL that referenced this pull request Aug 28, 2025
…VIDIA-NeMo#794)

Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Qidong Su <qidongs@nvidia.com>
skirdey-inflection pushed a commit to skirdey-inflection/RL that referenced this pull request Aug 30, 2025
…VIDIA-NeMo#794)

Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Stanislav Kirdey <stan@inflection.ai>
soodoshll pushed a commit to soodoshll/RL that referenced this pull request Sep 4, 2025
…VIDIA-NeMo#794)

Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Signed-off-by: Qidong Su <qidongs@nvidia.com>
PrinsYin pushed a commit to PrinsYin/RL that referenced this pull request Nov 30, 2025
…VIDIA-NeMo#794)

Signed-off-by: Felipe Vieira Frujeri <ffrujeri@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI:docs Run doctest CI Relating to CI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants