Skip to content

Giulero/deep_deterministic_policy_gradient

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deep deterministic Policy Gradient on HalfCheetah-v2

Dependencies

  • tensorflow
  • gym
  • mujocopy

Run!

Simply type on the terminal python main.py --mode train/test.

Results

After ~ 18000 episodes the mean reward converges to 2700.

cheetah_rew1 cheetah_rew2

About

DDPG implementation. Tested with cheetah in Mujoco.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published