Skip to content

chinganc/lightATAC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lightATAC

This is a lightweight reimplementation of Adversarially Trained Actor Critic (ATAC), a model-free offline reinforcement learning algorithm with SoTA performance on D4RL by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal (https://arxiv.org/abs/2202.02446).

To install, simply clone the repo and run pip install -e . . Then you can start the training by, e.g.,

python main.py --log_dir ./tmp_results --env_name hopper-medium-expert-v2 --beta 1.0

More instructions can be found in main.py, and please see the original paper for hyperparameters (e.g., beta). The code was tested with python 3.9.

The experimental results of lightATAC (over different $\beta$ values) on D4RL mujoco datasets can be viewed at https://tensorboard.dev/experiment/6RwXhalaQeWNmQNHDGvaFA.

This reimplementation is based on gwthomas/IQL-PyTorch. It is minimalistic, so users can easily modify it for their needs. It follows mostly the logic in the original ATAC code, but with some code optimization leading to 1.5X-2X speed up.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published