-
Notifications
You must be signed in to change notification settings - Fork 370
Pull requests: pytorch/rl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Refactor] Pass all keys at reset (prototype)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2956
opened May 15, 2025 by
vmoens
Loading…
10 tasks
[Feature] empty_lazy for lazy tensor storages
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2955
opened May 14, 2025 by
vmoens
Loading…
[BugFix] Base transform applies Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
_call
on reset
bug
#2913
opened Apr 22, 2025 by
louisfaury
Loading…
2 of 9 tasks
[Bugfix] Fix VecNorm eps usage
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2866
opened Mar 22, 2025 by
lin-erica
Loading…
2 of 10 tasks
v0 param server (using collectives not object store)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2865
opened Mar 21, 2025 by
mikaylagawarecki
•
Draft
[Test] Add PEnv tests for devices
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2843
opened Mar 10, 2025 by
vmoens
Loading…
[DEBUG] ppo compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2814
opened Feb 27, 2025 by
IvanKobzarev
Loading…
10 tasks
[Feature,Deprecation] Split KLRewardTransform in more modules
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2813
opened Feb 27, 2025 by
vmoens
Loading…
[Feature,Example] Add MCTS algorithm and example
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Examples
#2796
opened Feb 19, 2025 by
kurtamohler
Loading…
[DRAFT] ppo chess with llm and ConditionalPolicySwitch to sunfish bot
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2763
opened Feb 5, 2025 by
mikaylagawarecki
•
Draft
[Example] Self-play chess PPO example
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Examples
#2709
opened Jan 21, 2025 by
vmoens
Loading…
[WIP] Compute lp during loss execution
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2688
opened Jan 10, 2025 by
vmoens
Loading…
[Tutorial] MCTS
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2673
opened Dec 19, 2024 by
vmoens
Loading…
First draft for modular Hindsight Experience Replay Transform
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
[Tutorial] Beam search with GPT models
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
tutorials
#2623
opened Dec 2, 2024 by
vmoens
Loading…
[Feature] PPOTrainer
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2550
opened Nov 11, 2024 by
vmoens
Loading…
[Feature] habitat env from config
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2539
opened Nov 6, 2024 by
vmoens
Loading…
10 tasks
[Examples] boiler plate code for multi-turn reward for RLHF
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2467
opened Oct 5, 2024 by
rghosh08
Loading…
3 of 10 tasks
[Algorithm] Update scripts with compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2449
opened Sep 23, 2024 by
vmoens
Loading…
[Feature] RB compability with compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2426
opened Sep 9, 2024 by
vmoens
Loading…
[CI] Add benchmarks to test runs
CI
Has to do with CI setup (e.g. wheels & builds, tests...)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2410
opened Sep 2, 2024 by
vmoens
Loading…
[Feature] non-functional SAC loss
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2393
opened Aug 13, 2024 by
vmoens
Loading…
[Feature] use_vmap=False for SAC
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
#2392
opened Aug 13, 2024 by
vmoens
Loading…
[Algorithm] TD3 fast
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2389
opened Aug 10, 2024 by
vmoens
Loading…
[Doc] Better doc for distributed RBs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2378
opened Aug 7, 2024 by
vmoens
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-16.