parallel running #3

waynezw0618 · 2020-02-12T15:29:08Z

Hi Antymon:
Your PPO code is interesting. I am wondering whether it can be used for parallel training, where each episode is a mpi/openmp solver. I would like to perform simulations on a cluster with each episode as a single node openmp based simulation. do you have any suggestion?

Yours Sincerely

Wei

Antymon · 2020-02-15T14:00:04Z

Hi, this implementation is meant for a single multicore node and even with that respect is rather simple - with multiple environments being called in parallel on each step and centralized network processing resumed once all environments delivered results. TF's graph portability will be useful for you, of course, and perhaps some snippets of my code. I vaguely recall that OpenAI Baselines had MPI implementations in Python that you can look into to port or reuse (there were 2 implementations of PPO initially PPO and PPO2... ). Hope that helps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parallel running #3

parallel running #3

waynezw0618 commented Feb 12, 2020

Antymon commented Feb 15, 2020 •

edited

Loading

parallel running #3

parallel running #3

Comments

waynezw0618 commented Feb 12, 2020

Antymon commented Feb 15, 2020 • edited Loading

Antymon commented Feb 15, 2020 •

edited

Loading