Imagination-Arbitration

This repository contains Python implementation for the Intrinsically Motivated Meta-Controller (IM2C) algorithm and Integrated Imagination-Arbitration (I2A) learning framework [1]. The implementation considers the problem of learning vision-based robotic grasping on the NICO robot [2], but can be easily tuned towards other control problems with different sensory inputs.

Dependencies

python 2.7
numpy 1.14.3
keras 2.2.2
tensorflow 1.10.1
cv2 3.4.3

Instructions

Make sure you load the scene NICO-seated.ttt into the CoppeliaSim (previously V-REP) simulator [3].
Use im2c.py as the main script for running IM2C or i2a.py for running I2A. Simply run

python im2c.py

or

python i2a.py

File description

To enable the remote API functionality of CoppeliaSim from Python client, you will need the following 3 files: vrep.py, vrepConst.py, and remoteApi.so
NICO-seated.ttt is the simulation scene that includes the robot and its environment.
env.py contains all functions related to the simulation environment, such as connecting to the simulator, controlling the joints, receiving the visual input, and computing the reward signal.
node.py and itm.py implement the Instantaneous Topological Map (ITM) [4], which is the growing self-organizing network used in our work, and compute the proposed local learning progress.
networks.py contains the deep neural architectures for the actor, critic and local world models, as well as the implementation of the gradient-based model predictive control.
parameters.py contains hyperparameter values.
im2c.py is the main script for running the IM2C learning algorithm.
i2a.py is the main script for running the I2A learning framework.
goals.txt contains a list of random goal positions for the grasping object.

Note: The implemented actor-critic method in im2c.py and i2a.py is an off-policy variant of CACLA [5]. An implementation based on DDPG [6] can be found in the folder Imagination-Arbitration/ddpg_ia

References

[1] M. B. Hafez, C. Weber, M. Kerzel, and S. Wermter. Improving Robot Dual-System Motor Learning with Intrinsically Motivated Meta-Control and Latent-Space Experience Imagination. Robotics and Autonomous Systems, 133 (2020): 103630. [PDF]

[2] M. Kerzel, E. Strahl, S. Magg, N. Navarro-Guerrero, S. Heinrich, and S. Wermter. NICO– Neuro-Inspired COmpanion: A developmental humanoid robot platform for multimodal interaction. In 2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), pages 113–120, 2017. [PDF]

[3] E. Rohmer, S. P. Singh, and M. Freese. V-REP: A versatile and scalable robot simulation framework. In 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 1321–1326, 2013. [PDF]

[4] J. Jockusch and H. Ritter. An instantaneous topological mapping model for correlated stimuli. In International Joint Conference on Neural Networks (IJCNN), volume 1, pages 529–534, 1999. [PDF]

[5] H. Van Hasselt. Reinforcement learning in continuous state and action spaces. In Reinforcement Learning, pages 207–251. Springer, 2012. [PDF]

[6] T.P. Lillicrap, J.J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, Continuous control with deep reinforcement learning. In 4th International Conference on Learning Representations, 2016. [PDF]

Cite

If you use the code, please cite the following paper:

@article{hafez2020improving,
  title={Improving Robot Dual-System Motor Learning with Intrinsically Motivated Meta-Control and Latent-Space Experience Imagination},
  author={Hafez, Muhammad Burhan and Weber, Cornelius and Kerzel, Matthias and Wermter, Stefan},
  journal={Robotics and Autonomous Systems},
  volume={133},
  pages={103630},
  year={2020},
  publisher={Elsevier}
}

Contact

Burhan Hafez - mbhafez.com

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
ddpg_ia		ddpg_ia
LICENSE.md		LICENSE.md
NICO-seated.ttt		NICO-seated.ttt
README.md		README.md
env.py		env.py
goals		goals
i2a.py		i2a.py
im2c.py		im2c.py
itm.py		itm.py
networks.py		networks.py
node.py		node.py
parameters.py		parameters.py
remoteApi.so		remoteApi.so
vrep.py		vrep.py
vrepConst.py		vrepConst.py

License

mbhafez/Imagination-Arbitration

Folders and files

Latest commit

History

Repository files navigation

Imagination-Arbitration

Dependencies

Instructions

File description

References

Cite

Contact

About

Resources

License

Stars

Watchers

Forks

Languages