[Feature] Feat/diffusion bc loss by theap06 · Pull Request #3604 · pytorch/rl

theap06 · 2026-04-08T16:20:57Z

[Feature] Add DiffusionBCLoss objective and Pendulum BC example" --body

Summary

Fixes #3149

Implements DiffusionBCLoss in torchrl/objectives/diffusion_bc.py — a LossModule subclass that computes the ε-prediction (noise-prediction) denoising loss from Diffusion Policy (Chi et al., RSS 2023)
Completes Phase-1 of the diffusion policy feature alongside DiffusionActor ([Feature] Diffusion Actor DDPMModule #3596)
Adds 17 unit tests and an end-to-end Pendulum-v1 BC example

Design

The loss:

Samples a random timestep t per batch element
Corrupts the clean demonstration action via _DDPMModule.add_noise(clean_action, t) (forward diffusion)
Runs the score network on (noisy_action || observation || t)
Returns MSE between predicted noise and actual noise as loss_diffusion_bc

Supports set_keys() for observation/action key remapping and configurable reduction.

Files changed

File	Description
`torchrl/objectives/diffusion_bc.py`	`DiffusionBCLoss` implementation
`torchrl/objectives/__init__.py`	Register `DiffusionBCLoss`
`test/objectives/test_diffusion_bc.py`	17 tests (output keys, backward, gradient flow, custom keys, convergence)
`examples/diffusion_bc_pendulum.py`	End-to-end BC training on Pendulum-v1

Test plan

pytest test/objectives/test_diffusion_bc.py — 17/17 passing
pre-commit run — all hooks passing
Forward + backward smoke tested locally

…tioned on observations using a fixed linear-beta DDPM scheduler, following Diffusion Policy (Chi et al., RSS 2023).

… into feat/diffusion-actor

Implements the ε-prediction denoising loss from Diffusion Policy (Chi et al., RSS 2023) as a TorchRL LossModule, completing Phase-1 of the diffusion policy feature alongside DiffusionActor (pytorch#3596). - torchrl/objectives/diffusion_bc.py: DiffusionBCLoss subclassing LossModule, uses _DDPMModule.add_noise() for the forward diffusion step and computes MSE between predicted and actual noise. Supports configurable reduction and set_keys() for observation/action key remapping. - torchrl/objectives/__init__.py: register DiffusionBCLoss in alphabetical order. - test/objectives/test_diffusion_bc.py: 17 tests covering output keys, scalar loss, backward, gradient flow, reduction modes, custom keys, and a training convergence check. - examples/diffusion_bc_pendulum.py: end-to-end BC training on Pendulum-v1 with expert data collection, training loop, and evaluation.

pytorch-bot · 2026-04-08T16:21:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3604

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⚠️ 17 Awaiting Approval

As of commit ef2554c with merge base f54a7c7 ():

AWAITING APPROVAL - The following workflows need approval before CI can run:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

theap06 and others added 7 commits April 5, 2026 03:10

Implements a diffusion-based actor that denoises latent actions condi…

e875678

…tioned on observations using a fixed linear-beta DDPM scheduler, following Diffusion Policy (Chi et al., RSS 2023).

Merge remote-tracking branch 'origin/main' into feat/diffusion-actor

4fac135

linter

bef0729

encorporated the changes for the repetitive code

b17f523

Merge branch 'feat/diffusion-actor' of https://github.com/achintya-p/rl…

15cea02

… into feat/diffusion-actor

fixed linting

ef3a480

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 8, 2026

github-actions bot added Feature New feature Examples Objectives Modules labels Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Feat/diffusion bc loss#3604

[Feature] Feat/diffusion bc loss#3604
theap06 wants to merge 7 commits intopytorch:mainfrom
theap06:feat/diffusion-bc-loss

theap06 commented Apr 8, 2026

Uh oh!

pytorch-bot bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

theap06 commented Apr 8, 2026

Summary

Design

Files changed

Test plan

Uh oh!

pytorch-bot bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3604

⚠️ 17 Awaiting Approval

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Apr 8, 2026 •

edited

Loading