Training schema

./run_mj examples.vitac_world.launch

Training schema

Train auto encoders
Train passive dynamics model a = 0
Train active dynamics model a != 0
Find policy.
Explore the world randomly.
Learn about positive and negative rewards from tactile sensing --> hypotheses: get stuck on always getting tactile feedback
Test with tactile curiosity --> the robot contacts surfaces and stuff and gets rewarded for the novelty, but gets bored after a while
The curiosity from tactile feedback drives the robot to contact objects on the world, which in turn drives the visual curiosity
```
 --> the robot achieves a good exploration
```

Show that the learnt dynamics model is useful for some manipulation task

 --> move the rope a to a specific configuration.
 --> reverse time.
         1. start with the robot in a given position
         2. get the robot exploring and moving the rope in random positions
         3. reverse the reward in time.

 --> start with robot in some random wierd positions e.g.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
lib		lib
memories/v01		memories/v01
mujoco-python-viewer		mujoco-python-viewer
old_src		old_src
src		src
.gitignore		.gitignore
README.md		README.md
notes.txt		notes.txt
scene.xml		scene.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lib

lib

memories/v01

memories/v01

mujoco-python-viewer

mujoco-python-viewer

old_src

old_src

src

src

.gitignore

.gitignore

README.md

README.md

notes.txt

notes.txt

scene.xml

scene.xml

Repository files navigation

Training schema

About

Releases

Packages

Languages

danfergo/modovt

Folders and files

Latest commit

History

Repository files navigation

Training schema

About

Resources

Stars

Watchers

Forks

Languages