Skip to content

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

License

Notifications You must be signed in to change notification settings

Hanrui-Wang/pytorch-ddpg

 
 

Repository files navigation

Deep Deterministic Policy Gradient on PyTorch

Overview

The is the implementation of Deep Deterministic Policy Gradient (DDPG) using PyTorch. Part of the utilities functions such as replay buffer and random process are from keras-rl repo. Contributes are very welcome.

Dependencies

Run

  • Training : results of two environment and their training curves:

    • Pendulum-v0
    $ ./main.py --debug
    alternate text
    • MountainCarContinuous-v0
    $ ./main.py --env MountainCarContinuous-v0 --validate_episodes 100 --max_episode_length 2500 --ou_sigma 0.5 --debug
    alternate text
  • Testing :

$ ./main.py --mode test --debug

TODO

About

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%