Skip to content

Pytorch Implementation of Policy Gradient algorithms for Unity(ML-Agent)

License

Notifications You must be signed in to change notification settings

dnddnjs/unity-pg

Repository files navigation

pytorch-trpo

PyTorch implementation of Vanilla Policy Gradient, Truncated Natural Policy Gradient, Trust Region Policy Optimization, Proximal Policy Optimization

Train

  • algorithm: PG, NPG, TRPO, PPO
  • env: Ant-v2, HalfCheetah-v2, Hopper-v2, Humanoid-v2, HumanoidStandup-v2, InvertedPendulum-v2, Reacher-v2, Swimmer-v2, Walker2d-v2
python train.py --algorithm "algorithm name" --env "environment name"

Reference

This code is modified version of codes

About

Pytorch Implementation of Policy Gradient algorithms for Unity(ML-Agent)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published