Twin Delayed DDPG (TD3) in PyTorch

A relatively minimal PyTorch TD3 implementation from scratch. Heavily based on my other repo, SAC-PyTorch.

Implementation Details

This code borrows the hyperparameters from Scott Fujimoto's implementation but with one difference, which is that the network architecture is the same as the SAC paper (barring the additional output units for log-variance). This means there's an extra layer of 256 hidden units.

Get Started

Simply run:

python train_agent.py

for default args. Changeable args are:

--env: String of environment name (Default: HalfCheetah-v2)
--seed: Int of seed (Default: 100)
--use_obs_filter: Boolean that is true when used (seems to degrade performance)
--update_every_n_steps: Int of how many env steps we take before optimizing the agent (Default: 1, ratio of steps v.s. backprop is tied to 1:1)
--n_random_actions: Int of how many random steps we take to 'seed' the replay pool (Default: 25000)
--n_collect_steps: Int of how steps we collect before training  (Default: 1000)
--n_evals: Int of how many episodes we run an evaluation for (Default: 1)
--save_model: Boolean that is true when used (saves model when GIFs are made, loading and running is left as an exercise for the reader (or until I get around to it))

Results

Gets 14,000 on HalfCheetah-v2 at 1.3 million samples. This is better than SAC!

Full graphs TBA; computer died.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
networks.py		networks.py
td3.py		td3.py
train_agent.py		train_agent.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

init.py

init.py

networks.py

networks.py

td3.py

td3.py

train_agent.py

train_agent.py

utils.py

utils.py

Repository files navigation

Twin Delayed DDPG (TD3) in PyTorch

Implementation Details

Get Started

Results

About

Releases

Packages

Languages

License

philipjball/TD3_PyTorch

Folders and files

Latest commit

History

Repository files navigation

Twin Delayed DDPG (TD3) in PyTorch

Implementation Details

Get Started

Results

About

Resources

License

Stars

Watchers

Forks

Languages