MAPS: Max-aggregation Active Policy Selection

Official code for "Active Policy Improvement from Multiple Black-box Oracles":

Xuefeng Liu,* Takuma Yoneda,* Chaoqi Wang,* Matthew R. Walter, and Yuxin Chen, "Active Policy Improvement from Multiple Black-box Oracles", in Proceedings of the International Conference on Machine Learning (ICML), 2023 (* denotes equal contribution) [arXiv]

Preparing a runtime environment

You can either build a docker image based on docker/Dockerfile, or pull from dockerhub: docker pull ripl/maps

Usage

The current script uses weights and biases. You may need to set WANDB_API_KEY environment variable.

1. Pretrain experts with `maps/scripts/pretraining/train_expert.py`

For example, from the project root directory, you can run:

$ python3 -m maps.scripts.pretraining.train_expert sac model_dir --env-name dmc:Cheetah-run-v1

This trains a SAC policy on Cheetah-run env, and saves the network weights periodically under model_dir.

2. Create a sweep file.

The following command creates multiple run configurations over environment domains, set of experts and algorithms

* Before running the following, you need to adjust the expert paths (L6 in maps/scripts/pretraining/experts.py) to the one you saved expert models to in the previous step.

$ python3 -m maps.scripts.sweep.sample_sweep

This generates sample_sweep.jsonl

3. Run training by specifying a line number of the sweep file.

For example, to launch the configuration in the first line:

$ python3 -m maps.scripts.train maps/scripts/sweep/sample_sweep.jsonl -l 0

TODOs

Remove the Dockerfile's dependency on Takuma's image
Push the new docker image to dockerhub
Make the pretrained experts available? (git lfs?)

Citing MAPS

If you find our work useful in your research, please consider citing the paper as follows:

@article{liu2023active,
  title={Active Policy Improvement from Multiple Black-box Oracles},
  author={Liu, Xuefeng and Yoneda, Takuma and Wang, Chaoqi and Walter, Matthew R and Chen, Yuxin},
  journal={arXiv preprint arXiv:2306.10259},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docker		docker
maps		maps
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docker

docker

maps

maps

LICENSE

LICENSE

README.md

README.md

Repository files navigation

MAPS: Max-aggregation Active Policy Selection

Preparing a runtime environment

Usage

1. Pretrain experts with `maps/scripts/pretraining/train_expert.py`

2. Create a sweep file.

3. Run training by specifying a line number of the sweep file.

TODOs

Citing MAPS

About

Releases

Packages

Contributors 3

Languages

License

ripl/maps

Folders and files

Latest commit

History

Repository files navigation

MAPS: Max-aggregation Active Policy Selection

Preparing a runtime environment

Usage

1. Pretrain experts with maps/scripts/pretraining/train_expert.py

2. Create a sweep file.

3. Run training by specifying a line number of the sweep file.

TODOs

Citing MAPS

About

Resources

License

Stars

Watchers

Forks

Languages

1. Pretrain experts with `maps/scripts/pretraining/train_expert.py`