Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read paper and watch video on World Model #5

Closed
jiwoncpark opened this issue Sep 19, 2018 · 0 comments
Closed

Read paper and watch video on World Model #5

jiwoncpark opened this issue Sep 19, 2018 · 0 comments

Comments

@jiwoncpark
Copy link
Owner

Youtube video on the world model by Schmidhuber and Ha. The takeaway point is that, when trained inside a "dream" environment, i.e. an environment modeled by the MDN-RNN , the agent could learn a policy that had a higher score than when trained on "real" scenarios. The tau "temperature" parameter determines the degree of uncertainty. It seems the uncertainty helped the agent learn well. Also, training inside the simulated latent-space dream world is efficient! The world models were trained incrementally to simulate reality that is useful for transferring policies back to the real world.
Will this be useful for simulating PLAsTiCC data?

@jiwoncpark jiwoncpark changed the title World Model Read paper + watch video on World Model Sep 19, 2018
@jiwoncpark jiwoncpark changed the title Read paper + watch video on World Model Read paper and watch video on World Model Sep 19, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant