HumanoidRobotWalk

Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.

Programs & libraries needed in order to run this project

OpenAI Gym : A toolkit for developing and comparing reinforcement learning algorithms
PyBullet Gym : PyBullet Robotics Environments fully compatible with Gym toolkit (uses the Bullet physics engine)
PyTorch : Open source machine learning library based on the Torch library
NumPy : Fundamental package for scientific computing with Python
matplotlib : Plotting library for the Python programming language and its numerical mathematics extension NumPy

Algorithms pseudocodes

Trust Region Policy Optimization (TRPO) - implemented by Vasilije Pantić

Proximal Policy Optimization (PPO) - implemented by Nikola Zubić

How to run?

For TRPO: Run trpo_main.py at root/code/trpo/,
For PPO: Run ppo_main.py at root/code/ppo/,
and enter the absolute file path to the trained model.

Trained models are available at: root/code/trained_models/.

In motion

TRPO

PPO

Numerical results

Training time [h]	24	96
TRPO

Training time [h]	6.5	48
PPO

Name	Name	Last commit message	Last commit date
Latest commit Nikola Zubić Update README.md Mar 9, 2021 1569c5d · Mar 9, 2021 History 37 Commits
.idea	.idea	added final model	Mar 9, 2021
code	code	renamed ppo images	Mar 9, 2021
utils	utils	Added in motion gifs for trpo and ppo	Mar 9, 2021
.gitignore	.gitignore	Environment and rendering set-up	Feb 2, 2021
README.md	README.md	Update README.md	Mar 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HumanoidRobotWalk

Programs & libraries needed in order to run this project

Algorithms pseudocodes

Trust Region Policy Optimization (TRPO) - implemented by Vasilije Pantić

Proximal Policy Optimization (PPO) - implemented by Nikola Zubić

How to run?

In motion

TRPO

PPO

Numerical results

About

Languages

reinai/HumanoidRobotWalk

Folders and files

Latest commit

History

Repository files navigation

HumanoidRobotWalk

Programs & libraries needed in order to run this project

Algorithms pseudocodes

Trust Region Policy Optimization (TRPO) - implemented by Vasilije Pantić

Proximal Policy Optimization (PPO) - implemented by Nikola Zubić

How to run?

In motion

TRPO

PPO

Numerical results

About

Topics

Resources

Stars

Watchers

Forks

Languages