Improving Long-Horizon Imitation Through Instruction Prediction

This is the official code repository for Improving Long-Horizon Imitation Through Language Prediction by Joey Hejna, Pieter Abbeel, and Lerrel Pinto.

This repository includes easy to use code to reproduce the main results of the paper, which can be found here.

If you use this repository, please cite our work:

@inproceedings{
hejna2023improving,
title={Improving Long-Horizon Imitation Through Instruction Prediction},
author={Hejna, Joey and Abbeel, Pieter and Pinto, Lerrel}, 
year={2022},
journal={Proceedings of the AAAI Conference on Artificial Intelligence}
volume={37}
url={https://github.com/jhejna/instruction-prediction},
}

Installation

All commands assume the user is in the repository directory.

Create the conda environment using the provided environment files. We recommend using GPU: conda env create -f environment_gpu.yaml. Then activate the conda environment.
Install the babyAI package as included in the repository: cd babyai; pip install -e ..
Install the mazebase package as included in the repo: cd mazebase; pip install -e ..
Install the langauge_prediction package: pip install -e ..

Usage

Generating Datasets

Datasets for the experiments can be generated using scripts from this repository.

To create the BabyAI datasets for BossLevel used in the paper, run the following command:

python scripts/create_babyai_dataset.py --env BabyAI-BossLevel-v0 --dataset-type traj traj_contrastive --max-mission-len 36 --skip 3 --episodes 50000 --valid-episodes 2500 --path datasets/BabyAIBossLevel_l36_50k --jobs 10

This will create two datasets, one with 'next' images used for unsupervised objectives and one without. The one without can be used in cases where memory is limited. The command also launches 10 parallel jobs, each collecting 5000 demos. The number of jobs can easily be reduced. Datasets for the other BabyAI levels can be created by modifying this command.

To create the dataset for the crafting environment, you will first need to download the raw data from this repository. Download the raw json dataset. Then run the following command:

python scripts/create_crafting_dataset.py --input-path path/to/json/file --output-path datasets/crafting

Training Models

To train a model, first update the corresponding configuration file in the configs folder with the path to the created dataset (should be fine if you ran the command listed above) and the desired parameters. Then run the following command:

python scripts/train.py --config configs/path/to/config --save-path path/to/output

Here are some example experiments from the paper:

BabyAI with Language Prediction and ATC:

python scripts/train.py --config configs/babyai/dt.yaml --save-path output/babyai/50k_lang0.7_unsup0.7

The Crafting Environment with Language Prediction and ATC:

python scripts/train.py --config configs/ayh/vit.yaml --save-path output/ayh/dataset0.4_lang0.25_unsup0.25

Note that the first time you run the ask your human experiments the GloVe embeddings will be downloaded by torchtext to .vector_cache.

The configs are self-explanatory and can be used to easily create the other experiments from the paper. If there is interest, I can also add the experiment sweeper to this repo that will run all the experiments at once.

Monitoring Jobs

Jobs automatically output tensorboard logs to the specified location. You can view them using tensorboard.

Testing Models

The final models are tested on held out levels. Results are logged during training, but can also be computed at the end using the following commands. Note that its important to match the options to those in the provided configuration files, or results may be inconsistent. The script will evaluate all models in a given folder. Here is the evaluation command for BabyAI:

python scripts/test.py --path path/to/model --best --num-ep 500 --eval-mode

Here is the evaluation command for Crafting on only unseen three-step levels. A list of all environment configurations for crafting can be found in `language_prediction/envs/mazebase.py

python scripts/test.py --path path/to/model --best --num-ep 500 --override env_kwargs.config unseen_3only

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
babyai		babyai
configs		configs
language_prediction		language_prediction
mazebase		mazebase
scripts		scripts
.gitignore		.gitignore
README.md		README.md
environment_cpu.yaml		environment_cpu.yaml
environment_gpu.yaml		environment_gpu.yaml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Long-Horizon Imitation Through Instruction Prediction

Installation

Usage

Generating Datasets

Training Models

Monitoring Jobs

Testing Models

About

Releases

Packages

Languages

jhejna/instruction-prediction

Folders and files

Latest commit

History

Repository files navigation

Improving Long-Horizon Imitation Through Instruction Prediction

Installation

Usage

Generating Datasets

Training Models

Monitoring Jobs

Testing Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages