Use poetry to lock dependencies (#157)

* Use poetry to lock dependencies * Add mujoco support * Quikc fix * Update poetry * Add horovod * add wandb * update poetry file * Add envpool and documentation * Update brax documentation Co-authored-by: Costa Huang <costah@nvidia.com>
Denys88 · May 26, 2022 · adac4da · adac4da
1 parent 106cea3
commit adac4da
Show file tree

Hide file tree

Showing 7 changed files with 2,447 additions and 9 deletions.
diff --git a/.gitignore b/.gitignore
@@ -1,3 +1,4 @@
+wandb
 # Byte-compiled / optimized / DLL files
 __pycache__/
 *.py[cod]

diff --git a/README.md b/README.md
@@ -65,7 +65,16 @@ To run Mujoco, Atari games or Box2d based environments training they need to be
 To run Atari also ```pip install opencv-python``` is required. In addition installation of envpool for maximum perf is highly recommended: ```pip install envpool```
 
 
-## Training
+# Development setup
+
+
+```bash
+poetry install
+# install cuda related dependencies
+poetry run pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html
+```
+
+# Training
 **NVIDIA Isaac Gym**
 
 Download and follow the installation instructions of Isaac Gym: https://developer.nvidia.com/isaac-gym  
@@ -90,13 +99,20 @@ And IsaacGymEnvs: https://github.com/NVIDIA-Omniverse/IsaacGymEnvs
 
 *Atari Pong*
 
-```python runner.py --train --file rl_games/configs/atari/ppo_pong.yaml```
-```python runner.py --play --file rl_games/configs/atari/ppo_pong.yaml --checkpoint nn/PongNoFrameskip.pth```
+```bash
+poetry install -E atari
+poetry run python runner.py --train --file rl_games/configs/atari/ppo_pong.yaml
+poetry run python runner.py --play --file rl_games/configs/atari/ppo_pong.yaml --checkpoint nn/PongNoFrameskip.pth
+```
 
 *Brax Ant*
 
-```python runner.py --train --file rl_games/configs/brax/ppo_ant.yaml```
-```python runner.py --play --file rl_games/configs/brax/ppo_ant.yaml --checkpoint runs/Ant_brax/nn/Ant_brax.pth```
+```bash
+poetry install -E brax
+poetry run pip install --upgrade "jax[cuda]==0.3.13" -f https://storage.googleapis.com/jax-releases/jax_releases.html
+poetry run python runner.py --train --file rl_games/configs/brax/ppo_ant.yaml
+poetry run python runner.py --play --file rl_games/configs/brax/ppo_ant.yaml --checkpoint runs/Ant_brax/nn/Ant_brax.pth
+```
 
 
 ## Config Parameters

diff --git a/docs/ATARI_ENVPOOL.md b/docs/ATARI_ENVPOOL.md
@@ -1,7 +1,13 @@
 # Atari with Envpool (https://envpool.readthedocs.io/en/latest/)  
 
 ## How to run:  
-* **Pong** ```python runner.py --train --file rl_games/configs/atari/ppo_pong_envpool.yaml```
+* **Pong** 
+
+```
+poetry install -E envpool
+poetry run pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html
+poetry run python runner.py --train --file rl_games/configs/atari/ppo_pong_envpool.yaml
+```
 
 ## Results:  
 * **Pong-v5** 2 minutes training time to achieve 20+ score.

diff --git a/docs/BRAX.md b/docs/BRAX.md
@@ -1,8 +1,18 @@
 # Brax (https://github.com/google/brax)  
 
 ## How to run:  
-* **Ant** ```python runner.py --train --file rl_games/configs/brax/ppo_ant.yaml```
-* **Humanoid** ```python runner.py --train --file rl_games/configs/brax/ppo_humanoid.yaml```
+
+* **Setup**
+
+```bash
+poetry install -E brax
+poetry run pip install --upgrade "jax[cuda]==0.3.13" -f https://storage.googleapis.com/jax-releases/jax_releases.html
+poetry run pip install torch==1.10.2+cu113 torchvision==0.11.3+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html
+```
+
+* **Ant** ```poetry run python runner.py --train --file rl_games/configs/brax/ppo_ant.yaml```
+* **Humanoid** ```poetry run python runner.py --train --file rl_games/configs/brax/ppo_humanoid.yaml```
+
 ## Visualization of the trained policy:  
 * **brax_visualization.ipynb**
 

diff --git a/docs/MUJOCO_ENVPOOL.md b/docs/MUJOCO_ENVPOOL.md
@@ -1,7 +1,12 @@
 # Mujoco (https://github.com/deepmind/mujoco)  
 
 ## How to run:
-* **Humanoid** ```python runner.py --train --file rl_games/configs/mujoco/humanoid_envpool.yaml```
+* **Humanoid** 
+```
+poetry install -E envpool
+poetry run pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html
+poetry run python runner.py --train --file rl_games/configs/mujoco/humanoid_envpool.yaml
+```
 
 ## Results:
 * **HalfCheetah-v4**