Skip to content

EvieQ01/OODIL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Out-of-Dynamics Imitation Learning from Multimodal Demonstrations

1. MuJoCo environments.

The implementation for MuJoCo environments is in mujoco/.

Acknowledgement

2. Driving environment.

The implementation for Driving environment is in carlo/.

Acknowledgement

3. Simulated Franka Panda Arm.

The implementation for Simulated Franka Panda Arm is in simulated_robot/.

Acknowledgement

You can resort to wandb to login your personal account via export your own wandb api key.

export WANDB_API_KEY=YOUR_WANDB_API_KEY

and run

wandb online

to turn on the online syncronization.

References

[1] Z. Cao, Y. Hao, M. Li, and D. Sadigh. Learning feasibility to imitate demonstrators with different dynamics. In CoRL, 2021.

[2] Schulman, John, et al. "Proximal policy optimization algorithms." arXiv preprint arXiv:1707.06347 (2017).

[3] Ho, Jonathan, and Stefano Ermon. "Generative adversarial imitation learning." Advances in neural information processing systems. 2016.

[4] Fu, Justin, Katie Luo, and Sergey Levine. "Learning robust rewards with adversarial inverse reinforcement learning." arXiv preprint arXiv:1710.11248 (2017).

About

Code implementation for paper: Out-of-Dynamics Imitation Learning from Multimodal Demonstrations

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published