Imitation Learning via Kernel Mean Embedding

Kee-Eung Kim and Hyun Soo Park

This code is used for the paper Imitation Learning via Kernel Mean Embedding.

The implementation is based on Jonathan Ho's GAIL (Generative Adversarial Imitation Learning) code.

Contains an implementation of Trust Region Policy Optimization (Schulman et al., 2015) and Generative Adversarial Imitation Learning (Jonathan et al., 2016).

Dependencies:

Python 2.7
OpenAI Gym >= 0.1.0, mujoco_py >= 0.4.0
numpy >= 1.10.4, scipy >= 0.17.0, theano >= 0.8.2
h5py, pytables, pandas, matplotlib

Provided files:

expert_policies/* are the expert policies, trained by TRPO (scripts/run_rl_mj.py) on the true costs
scripts/im_pipeline.py is the main training and evaluation pipeline. This script is responsible for sampling data from experts to generate training data, running the training code (scripts/imitate_mj.py), and evaluating the resulting policies.
pipelines/* are the experiment specifications provided to scripts/im_pipeline.py
results/* contain evaluation data for the learned policies

Hyperparameters:

You can set hyperparameters by passing arguments when you run this python script. For example, in order to run GMMIL, run scripts/imitate_mj.py python script as python scripts/imitate_mj.py --mode gmmil --reward_type mmd --data EXPERT_TRAJ_PATH --env_name ENV_NAME. Check the example shell file train.sh.

Name	Name	Last commit message	Last commit date
Latest commit tzs930 Update LICENSE Oct 2, 2018 f49d3c9 · Oct 2, 2018 History 30 Commits
classic_policies	classic_policies	Add many hyperparams	Jun 28, 2018
environments	environments	Add many hyperparams	Jun 28, 2018
expert_policies	expert_policies	Add many hyperparams	Jun 28, 2018
pipelines	pipelines	Add many hyperparams	Jun 28, 2018
policyopt	policyopt	Add many hyperparams	Jun 28, 2018
results	results	Add many hyperparams	Jun 28, 2018
scripts	scripts	Add many hyperparams	Jun 28, 2018
.gitignore	.gitignore	Add many hyperparams	Jun 28, 2018
LICENSE	LICENSE	Update LICENSE	Oct 2, 2018
README.md	README.md	Update README.md	Jun 29, 2018
train.sh	train.sh	Add many hyperparams	Jun 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Imitation Learning via Kernel Mean Embedding

Kee-Eung Kim and Hyun Soo Park

About

Releases

Packages

Languages

License

KAIST-AILab/gmmil

Folders and files

Latest commit

History

Repository files navigation

Imitation Learning via Kernel Mean Embedding

Kee-Eung Kim and Hyun Soo Park

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages