MBRCSL

Code for experiment results in paper "Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning".

Installation

Install dependencies by running

$ conda env create -f environment.yml
$ conda activate mbrcsl

Add this repository directory to your PYTHONPATH environment variable.

export PYTHONPATH="$PYTHONPATH:$(pwd)"

Run Experiments

Point Maze

First, you can download the pointmaze dataset from this Google drive link. We recommend to store it under directory ../rl_dataset.

Then, you can test our main algorithm (MBRCSL) as well as other baselines in the paper by running

bash scripts/pointmaze/run.sh

Note: If you store the dataset in a customized directory, you need to modify the data_dir variable in the shell script accordingly.

To compare different output policies (MBCQL, MBRCSL-Gaussian), you need to run MBRCSL algorithm first to get the rollout dataset rollout.dat. By default, it will appear under a directory of the form logs/pointmaze/mbrcsl/timestamp_[TIME]&[SEED]/rollout/checkpoint, where [TIME] is the timestamp of the experiment (e.g., 23-1021-054021), and [SEED] is an integer of the experiment seed. After that, you can test MBCQL and MBRCSL-Gaussian by running

bash scripts/pointmaze/ablation.sh

Remember to modify the command option --rollout_ckpt_path to the correct directory that contains the rollout dataset.

Simulated robotics

You need to first install roboverse, which implemented the environments.

We use the same datasets as in COG. Please download the datasets from their Google drive link. We recommend to store these datasets under directory ../rl_dataset.

To test our main algorithm (MBRCSL) as well as baselines (except for MBCQL) in the paper, you can run

bash scripts/[TASK]/run.sh

Here [TASK] can be pickplace, doubledraweropen or doubledrawercloseopen, corresponding to tasks PickPlace, ClosedDrawer and BlockedDrawer in the paper, respectively.

Note: If you store the datasets in a customized directory, you need to modify the data_dir variable in the shell script accordingly.

To test MBCQL algorithm, you need to run MBRCSL algorithm first to get the rollout dataset rollout.dat. By default, it will appear under a directory of the form logs/[TASK]/mbrcsl/timestamp_[TIME]&[SEED]/rollout/checkpoint, where [TASK] is the name of the task (pickplace, doubledraweropen or doubledrawercloseopen), [TIME] is the timestamp of the experiment (e.g., 23-1021-054021), and [SEED] is an integer of the experiment seed. After that, you can test MBCQL by running

bash scripts/[TASK]/ablation.sh

Remember to modify the command option --rollout_ckpt_path to the correct directory that contains the rollout dataset.

Acknowledgement

The framework of this repository, including part of the baseline algorithms (COMBO, MOPO, CQL) are built upon OfflineRL-Kit. The DT and %BC baselines are built upon Decision Transformer.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
envs		envs
examples		examples
offlinerlkit		offlinerlkit
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

envs

envs

examples

examples

offlinerlkit

offlinerlkit

scripts

scripts

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

environment.yml

environment.yml

Repository files navigation

MBRCSL

Installation

Run Experiments

Point Maze

Simulated robotics

Acknowledgement

About

Releases

Packages

Languages

License

zhaoyizhou1123/mbrcsl

Folders and files

Latest commit

History

Repository files navigation

MBRCSL

Installation

Run Experiments

Point Maze

Simulated robotics

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Languages