GitHub

This repo contains implementation of RL algorithm PPO, PAIR.

Installation

Clone the environment and algorithm repos

git clone git@github.com:IrisLi17/stacking_env.git
git clone git@github.com:IrisLi17/onpolicy_algorithm.git

Download docker image

docker pull irisli20/bullet_torch

Create a container

docker run -it --gpus all -e NVIDIDA_DRIVER_CAPABILITIES=all -v /absolute/path/to/onpolicy_algorithm:/projects/onpolicy_algorithm -v /absolute/path/to/stacking_env:/projects/stacking_env --name your_container_name irisli20/bullet_torch bash

Run experiments inside the container.

cd /projects/onpolicy_algorithm
tmux new -s test
# Train a pick-and-place policy with PPO
python train.py --config config.pick_and_place
# Record video of a trained agent. Images will be saved to tmp/
python train.py --config config.pick_and_place --load_path pretrained_model/pick_and_place.pt --play
# Train stack-6 from a pretrained stack-1 model
python train.py --config config.stacking_pair --load_path pretrained_model/stack1_model.pt

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
config		config
onpolicy		onpolicy
plot		plot
policies		policies
pretrained_model		pretrained_model
utils		utils
vec_env		vec_env
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
convert_jit.py		convert_jit.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Installation

About

Uh oh!

Releases

Packages

Languages

IrisLi17/onpolicy_algorithm

Folders and files

Latest commit

History

Repository files navigation

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages