lightATAC

This is a lightweight reimplementation of Adversarially Trained Actor Critic (ATAC), a model-free offline reinforcement learning algorithm with SoTA performance on D4RL by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal (https://arxiv.org/abs/2202.02446).

To install, simply clone the repo and run pip install -e . . Then you can start the training by, e.g.,

python main.py --log_dir ./tmp_results --env_name hopper-medium-expert-v2 --beta 1.0

More instructions can be found in main.py, and please see the original paper for hyperparameters (e.g., beta). The code was tested with python 3.9.

The experimental results of lightATAC (over different $\beta$ values) on D4RL mujoco datasets can be viewed at https://tensorboard.dev/experiment/6RwXhalaQeWNmQNHDGvaFA.

This reimplementation is based on gwthomas/IQL-PyTorch. It is minimalistic, so users can easily modify it for their needs. It follows mostly the logic in the original ATAC code, but with some code optimization leading to 1.5X-2X speed up.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
lightATAC		lightATAC
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
main.py		main.py
run.sh		run.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lightATAC

About

Releases

Packages

Languages

License

chinganc/lightATAC

Folders and files

Latest commit

History

Repository files navigation

lightATAC

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages