Skip to content

KAIST-AILab/gmmil

Repository files navigation

Imitation Learning via Kernel Mean Embedding

Kee-Eung Kim and Hyun Soo Park

This code is used for the paper Imitation Learning via Kernel Mean Embedding.

The implementation is based on Jonathan Ho's GAIL (Generative Adversarial Imitation Learning) code.

Contains an implementation of Trust Region Policy Optimization (Schulman et al., 2015) and Generative Adversarial Imitation Learning (Jonathan et al., 2016).

Dependencies:

  • Python 2.7
  • OpenAI Gym >= 0.1.0, mujoco_py >= 0.4.0
  • numpy >= 1.10.4, scipy >= 0.17.0, theano >= 0.8.2
  • h5py, pytables, pandas, matplotlib

Provided files:

  • expert_policies/* are the expert policies, trained by TRPO (scripts/run_rl_mj.py) on the true costs
  • scripts/im_pipeline.py is the main training and evaluation pipeline. This script is responsible for sampling data from experts to generate training data, running the training code (scripts/imitate_mj.py), and evaluating the resulting policies.
  • pipelines/* are the experiment specifications provided to scripts/im_pipeline.py
  • results/* contain evaluation data for the learned policies

Hyperparameters:

You can set hyperparameters by passing arguments when you run this python script. For example, in order to run GMMIL, run scripts/imitate_mj.py python script as python scripts/imitate_mj.py --mode gmmil --reward_type mmd --data EXPERT_TRAJ_PATH --env_name ENV_NAME. Check the example shell file train.sh.

About

Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published