JAX Implementation of TD3

JAX implementation of Twin Delayed Deep Deterministic Policy Gradients (TD3) paper.

This code attempts to turn the PyTorch implementation from the original TD3 repository into JAX implementation while making minimal modifications. Training runs about two times as fast as the original PyTorch code on a i7-6700K+GTX-1080 machine.

Code is tested using jaxlib 0.1.61, flax 0.3.0 and Python 3.9.

Example usage:

python main.py --env HalfCheetah-v3

or

./run_experiments.sh

for full experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
plots		plots
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TD3.py		TD3.py
main.py		main.py
plot.py		plot.py
requirements.txt		requirements.txt
run_experiments.sh		run_experiments.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JAX Implementation of TD3

Example Plots

About

Releases

Packages

Languages

License

yifan12wu/td3-jax

Folders and files

Latest commit

History

Repository files navigation

JAX Implementation of TD3

Example Plots

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages