[Bugfix] Fix CUDA/CPU mismatch in threaded training #6245

bmind7 · 2025-09-16T01:34:54Z

Proposed change(s)

On Windows, running with threaded: true produced “tensors on different devices” errors. Threaded trainers create tensors in multiple threads. Implicit CPU allocations (or per-thread default device changes) led to CPU-CUDA mixing and PyTorch mode stack corruption. Making device placement explicit and consistent prevents both classes of errors.

policy/torch_policy.py
- Create action masks, observations, and RNN memories on default_device() during inference.
torch_entities/utils.py
- ModelUtils.list_to_tensor() and list_to_tensor_list() now allocate on default_device().
torch_entities/networks.py
- VectorInput.update_normalization() now uses device-correct tensors.
optimizer/torch_optimizer.py, poca/optimizer_torch.py
- Initialize zero RNN memories on default_device().
torch_entities/components/reward_providers/gail_reward_provider.py
- Ensure DONE tensors, epsilons, and accumulators allocate on the correct device.

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

https://discussions.unity.com/t/ml-agents-4-0-0-is-now-available/1681770/5

Types of change(s)

[x ] Bug fix

Updated tensor creation in torch_policy.py and utils.py to explicitly use the default device, ensuring consistency across devices (CPU/GPU). Also set torch config in TrainerController to use the default device. This improves device management and prevents potential device mismatch errors.

Updated tensor creation in optimizers, reward providers, and network normalization to explicitly use the configured default_device. Removed redundant set_torch_config call in trainer_controller to avoid interfering with PyTorch's global device context. These changes improve device consistency and prevent device mismatch errors in multi-threaded or multi-device training scenarios.

CLAassistant · 2025-09-16T01:35:01Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ maryamziaa
❌ bmind7
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Copilot

Pull Request Overview

This PR fixes CUDA/CPU device mismatch errors that occur during threaded training on Windows by making tensor device placement explicit and consistent across the codebase.

Ensures all tensor creation operations use default_device() to maintain device consistency
Fixes issues where tensors were implicitly created on different devices in multi-threaded environments
Updates utilities, networks, optimizers, and reward providers to use explicit device placement

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
torch_entities/utils.py	Updates tensor creation utilities to allocate on default device
torch_entities/networks.py	Fixes device placement in vector input normalization
torch_entities/components/reward_providers/gail_reward_provider.py	Ensures GAIL reward provider tensors use correct device
policy/torch_policy.py	Makes device placement explicit for masks, observations, and RNN memories
poca/optimizer_torch.py	Initializes zero RNN memories on default device
optimizer/torch_optimizer.py	Fixes RNN memory initialization device placement

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

maryamziaa

Hi, Please fix the issue with black reformatting and then you should be good to merge the PR. Thanks!

bmind7 added 3 commits September 15, 2025 15:49

Cleanup

2486c58

maryamziaa requested a review from Copilot September 16, 2025 12:31

Copilot AI reviewed Sep 16, 2025

View reviewed changes

maryamziaa requested a review from slunity September 16, 2025 12:33

slunity approved these changes Sep 16, 2025

View reviewed changes

Black reformatting

202629d

maryamziaa self-requested a review September 16, 2025 15:29

maryamziaa reviewed Sep 16, 2025

View reviewed changes

Black reformatting

6dfb7ac

maryamziaa merged commit a83b3b8 into Unity-Technologies:develop Sep 16, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix CUDA/CPU mismatch in threaded training #6245

[Bugfix] Fix CUDA/CPU mismatch in threaded training #6245

Uh oh!

bmind7 commented Sep 16, 2025

Uh oh!

CLAassistant commented Sep 16, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

maryamziaa left a comment

Uh oh!

Uh oh!

Uh oh!

[Bugfix] Fix CUDA/CPU mismatch in threaded training #6245

[Bugfix] Fix CUDA/CPU mismatch in threaded training #6245

Uh oh!

Conversation

bmind7 commented Sep 16, 2025

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Uh oh!

CLAassistant commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

maryamziaa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Sep 16, 2025 •

edited

Loading