Offline Hierarchical Reinforcement Learning

This is a jax implementation of LPD on Datasets for Deep Data-Driven Reinforcement Learning (D4RL), the corresponding paper is Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery.

Quick Start

For experiments on D4RL, our code is implemented based on IQL:

First train the flow model:

$ python3 flow.py

Then, run the following code:

$ python3 train_offline.py --env_name=antmaze-large-play-v0 --config=configs/antmaze_config.py --eval_episodes=10 --eval_interval=5000

Citing

If you find this open source release useful, please reference in your paper (it is our honor):

@inproceedings{yang2023flow,
  title={Flow to control: Offline reinforcement learning with lossless primitive discovery},
  author={Yang, Yiqin and Hu, Hao and Li, Wenzhe and Li, Siyuan and Yang, Jun and Zhao, Qianchuan and Zhang, Chongjie},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={37},
  number={9},
  pages={10843--10851},
  year={2023}
}

Note

If you have any questions, please contact me: yangyiqi19@mails.tsinghua.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
configs		configs
tmp		tmp
wrappers		wrappers
LICENSE		LICENSE
README.md		README.md
actor.py		actor.py
common.py		common.py
critic.py		critic.py
dataset_utils.py		dataset_utils.py
evaluation.py		evaluation.py
flow.py		flow.py
framework.png		framework.png
learner.py		learner.py
policy.py		policy.py
requirements.txt		requirements.txt
train_finetune.py		train_finetune.py
train_offline.py		train_offline.py
utils.py		utils.py
value_net.py		value_net.py

License

YiqinYang/LPD

Folders and files

Latest commit

History

Repository files navigation

Offline Hierarchical Reinforcement Learning

Quick Start

Citing

Note

About

Resources

License

Stars

Watchers

Forks

Languages