Skip to content

RedmondLabUCD/Behaviour-aware-clustering-for-offline-policy-learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Behaviour-aware clustering for offline policy learning

Official implementation of paper: Dataset Clustering for Improved Offline Policy Learning

Download datasets

Our datasets can be downloaded from THIS LINK. This link provides multi-behavior datasets with labels, which serve as the ground truth for evaluating clustering results. Additionally, it includes the policies trained using stable-baselines3 for generating the multi-behavior datasets.

It should be noted that all datasets include observations, actions, rewards, terminals, and labels, making them suitable for training policies as well.

The locomotion datasets and robotic hand manipulation datasets are created by us, while the trifinger datasets are created based on an open-source project.

Using our code

Installation

Our project can be installed by cloning the repository as follows:

https://github.com/wq13552463699/Behaviour-aware-clustering-for-offline-policy-learning.git

Or you can download this GitHub project and unzip it locally.

Then you can install the required libraries by running:

pip install -r requirements.txt

Run experiments

You can run the experiments by performing:

python main.py --exp-name <name> --raw-dataset-path <local path of multi-behaviour dataset> --save-path <local path>

This command includes only a subset of the hyperparameters required to execute the experiments. You can find the remaining hyperparameters in the main.py file.

After the clustering process terminates as convergence, a file estimated_traj_labels.pkl will be created in the specified save path. This file contains the clustering results as discrete labels, which can then be compared with the ground truth labels for evaluation.

Pretrained results

Our pretrained results can be accessed by following THIS LINK, which contains the tuned hyperparameters, the clustering results, and the trained neural network models.

Cite this work

@article{wang-clustering,
  title={Dataset Clustering for Improved Offline Policy Learning},
  author={Wang, Qiang and Deng, Yixin and Sanchez, Francisco Roldan and Wang, Keru and McGuinness, Kevin and O'Connor, Noel and Redmond, Stephen J},
  journal={arXiv preprint arXiv:2402.09550},
  year={2024}
}

About

Official implementation of paper: Dataset Clustering for Improved Offline Policy Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages