Modeling and Optimization Trade-off in Meta-learning

This repository contains the code used to obtain the experimental results in the paper Modeling and Optimization Trade-off in Meta-learning, Gao and Sener (NeurIPS 2020).

It is based on the full_code branch of the ProMP repository.

The code is written in Python 3. The part corresponding to the linear regression experiment only requires NumPy, while the part corresponding to the reinforcement learning experiments also requires Tensorflow and the Mujoco physics engine. Some of the reinforcement learning environments can be found in this repository, and the rest are from MetaWorld.

Installation

Please follow the installation instructions provided by the ProMP repository and the MetaWorld repository. For the latter, please use the api-rework branch for compatibility (this has already been added to requirements.txt).

Running the experiments

Linear regression

Execute

python3 linear_regression/run_experiment.py --p 1 --beta 2 --seed 1

The figures can then be found in the folder p-1_beta-2_seed-1/figures.

Reinforcement learning

To create all the executable scripts that we need to run, execute

python3 experiments/benchmark/run.py

They will be found in the folder scripts. The training scripts are of the form algorithm_environment_mode_seed.sh, and the testing scripts are of the form test_algorithm_environment_mode_seed_checkpoint.sh.

algorithm is replaced by ppo (DRS+PPO), promp (ProMP), trpo (DRS+TRPO), trpomaml (TRPO-MAML).
environment and mode are replaced by
- walker and params-interpolate (Walker2DRandParams)
- walker and goal-interpolate (Walker2DRandVel)
- cheetah and goal-interpolate (HalfCheetahRandVel)
- hopper and params-interpolate (HopperRandParams)
- metaworld and ml1-push (ML1-Push)
- metaworld and ml1-reach (ML1-Reach)
- metaworld and ml10 (ML10)
- metaworld and ml45 (ML45)
seed, the random seed, is replaced by integers 1-5.
checkpoint, the policies stored at various stages during training, is replaced by integers 0-20.

After all runs are finished, the figures can be created by executing

python3 experiments/benchmark/summary.py

They will be found in the folder results.

Acknowledgements

We would like to thank Charles Packer for help during the creation of the code for the reinforcement learning experiments.

Citation

To cite this repository in your research, please reference the following paper:

Katelyn Gao and Ozan Sener. Modeling and Optimization Trade-off in Meta-Learning. arXiv preprint arXiv:2010.12916 (2020).

@misc{GaoSener2020,
  Author = {Katelyn Gao and Ozan Sener},
  Title = {Modeling and Optimization Trade-off in Meta-Learning},
  Year = {2020},
  Eprint = {arXiv:2010.12916},
}

Name		Name	Last commit message	Last commit date
Latest commit History 256 Commits
configs		configs
docker		docker
docs		docs
experiment_utils		experiment_utils
experiments		experiments
linear_regression		linear_regression
maml_zoo		maml_zoo
run_scripts		run_scripts
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

License

isl-org/MetaLearningTradeoffs

Folders and files

Latest commit

History

Repository files navigation

Modeling and Optimization Trade-off in Meta-learning

Installation

Running the experiments

Linear regression

Reinforcement learning

Acknowledgements

Citation

About

Resources

License

Stars

Watchers

Forks

Languages