Behaviour-aware clustering for offline policy learning

Official implementation of paper: Dataset Clustering for Improved Offline Policy Learning

Download datasets

Our datasets can be downloaded from THIS LINK. This link provides multi-behavior datasets with labels, which serve as the ground truth for evaluating clustering results. Additionally, it includes the policies trained using stable-baselines3 for generating the multi-behavior datasets.

It should be noted that all datasets include observations, actions, rewards, terminals, and labels, making them suitable for training policies as well.

The locomotion datasets and robotic hand manipulation datasets are created by us, while the trifinger datasets are created based on an open-source project.

Using our code

Installation

Our project can be installed by cloning the repository as follows:

https://github.com/wq13552463699/Behaviour-aware-clustering-for-offline-policy-learning.git

Or you can download this GitHub project and unzip it locally.

Then you can install the required libraries by running:

pip install -r requirements.txt

Run experiments

You can run the experiments by performing:

python main.py --exp-name <name> --raw-dataset-path <local path of multi-behaviour dataset> --save-path <local path>

This command includes only a subset of the hyperparameters required to execute the experiments. You can find the remaining hyperparameters in the main.py file.

After the clustering process terminates as convergence, a file estimated_traj_labels.pkl will be created in the specified save path. This file contains the clustering results as discrete labels, which can then be compared with the ground truth labels for evaluation.

Pretrained results

Our pretrained results can be accessed by following THIS LINK, which contains the tuned hyperparameters, the clustering results, and the trained neural network models.

Cite this work

@article{wang-clustering,
  title={Dataset Clustering for Improved Offline Policy Learning},
  author={Wang, Qiang and Deng, Yixin and Sanchez, Francisco Roldan and Wang, Keru and McGuinness, Kevin and O'Connor, Noel and Redmond, Stephen J},
  journal={arXiv preprint arXiv:2402.09550},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
pu_filter		pu_filter
README.md		README.md
clu_iterator.py		clu_iterator.py
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pu_filter

pu_filter

README.md

README.md

clu_iterator.py

clu_iterator.py

main.py

main.py

requirements.txt

requirements.txt

utils.py

utils.py

Repository files navigation

Behaviour-aware clustering for offline policy learning

Download datasets

Using our code

Installation

Run experiments

Pretrained results

Cite this work

About

Releases

Packages

Languages

RedmondLabUCD/Behaviour-aware-clustering-for-offline-policy-learning

Folders and files

Latest commit

History

Repository files navigation

Behaviour-aware clustering for offline policy learning

Download datasets

Using our code

Installation

Run experiments

Pretrained results

Cite this work

About

Resources

Stars

Watchers

Forks

Languages