Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning

PyTorch implementation of Learning Diverse Behaviors in Offline RL (DiveOff). If you use our code or data please cite the paper.

We built our datasets based on D4RL. Networks are trained using PyTorch 1.13 and Python 3.8.15.

Usage

To train the DiveOff agent on walker2dvel-diverse-expert-v1, run

python training.py --policy DiveOff --env walker2dvel-diverse-expert-v1

For training on the antvel tasks, we set --info_lr_rate 0.2, e.g.,

python train.py --env antvel-diverse-expert-medium-v1 --info_lr_rate 0.2

To visualize the behaviors learned by DiveOff on walker2dvel-diverse-expert-medium-v1, run

python visualize_diverse_policy.py --env walker2dvel-diverse-expert-medium-v1

Bibtex

When you use our codes for your work, please cite:

@InProceedings{osa2024,
  title={Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning},
  author={Takayuki Osa and Tatsuya Harada},
  booktitle = {Proceedings of the International Conference on Machine Learning (ICML)},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
adaptEnv		adaptEnv
dataset		dataset
models/DiveOff		models/DiveOff
velEnv		velEnv
DiveOff.py		DiveOff.py
README.md		README.md
models.py		models.py
train.py		train.py
utils.py		utils.py
visualize_diverse_policy.py		visualize_diverse_policy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning

Usage

Bibtex

About

Releases

Packages

Languages

TakaOsa/DiveOff

Folders and files

Latest commit

History

Repository files navigation

Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning

Usage

Bibtex

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages