Implement PPO MPI (SB2 PPO1) #11

araffin · 2020-11-26T10:25:19Z

MPI can be quite useful to use multiprocessing full potential
but it is dependency that can be tricky to install.

vwxyzjn · 2022-04-19T20:13:53Z

Hey @araffin I prototyped multi-GPU support with torch.distributed vwxyzjn/cleanrl#162. Preliminary experiments seem successful when controlling torch thread number to 1 per process and use SyncVecEnv:

ppo_atari_multigpu_batch_reduce.py was able to obtain 20-30% speed up at no cost of sample efficiency by leveraging data parallelism given by torch.distributed.

araffin · 2022-04-20T08:59:50Z

@vwxyzjn ooh nice =)
I didn't know you could do that with PyTorch (it looks like they included MPI but for GPU).
I will try to have a look later this week.
Does this work also with cpu only?

araffin added enhancement New feature or request help wanted Help from contributors is needed labels Nov 26, 2020

araffin mentioned this issue Jan 18, 2021

[question] EvalCallback using MPI hill-a/stable-baselines#1069

Open

araffin mentioned this issue Feb 1, 2021

[Question] How to use multiple cores with PPO? DLR-RM/stable-baselines3#217

Closed

araffin mentioned this issue Mar 30, 2021

[Question] Is there a way to parallelise PPO so that it can run on GPU? DLR-RM/stable-baselines3#374

Closed

araffin mentioned this issue Jun 20, 2021

Any limitation on number of environments? DLR-RM/stable-baselines3#482

Closed

araffin mentioned this issue Sep 16, 2021

[Question] PPO multi-processing in SpinUp vs SB3 DLR-RM/stable-baselines3#571

Closed

2 tasks

sgillen mentioned this issue Sep 22, 2021

Augmented Random Search (ARS) #42

Merged

15 tasks

araffin mentioned this issue Sep 28, 2021

MPIVecEnv #45

Open

araffin mentioned this issue Nov 30, 2021

Questions about CPU utilization DLR-RM/stable-baselines3#682

Closed

araffin mentioned this issue Aug 21, 2023

[Question] Miscellaneous questions DLR-RM/stable-baselines3#1650

Closed

4 tasks

araffin mentioned this issue May 22, 2024

[Question] Running Multi-threaded PPO training independently with no interference DLR-RM/stable-baselines3#1931

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement PPO MPI (SB2 PPO1) #11

Implement PPO MPI (SB2 PPO1) #11

araffin commented Nov 26, 2020 •

edited

Loading

vwxyzjn commented Apr 19, 2022 •

edited

Loading

araffin commented Apr 20, 2022

Implement PPO MPI (SB2 PPO1) #11

Implement PPO MPI (SB2 PPO1) #11

Comments

araffin commented Nov 26, 2020 • edited Loading

vwxyzjn commented Apr 19, 2022 • edited Loading

araffin commented Apr 20, 2022

araffin commented Nov 26, 2020 •

edited

Loading

vwxyzjn commented Apr 19, 2022 •

edited

Loading