Read paper and watch video on World Model #5

jiwoncpark · 2018-09-19T06:26:49Z

Youtube video on the world model by Schmidhuber and Ha. The takeaway point is that, when trained inside a "dream" environment, i.e. an environment modeled by the MDN-RNN , the agent could learn a policy that had a higher score than when trained on "real" scenarios. The tau "temperature" parameter determines the degree of uncertainty. It seems the uncertainty helped the agent learn well. Also, training inside the simulated latent-space dream world is efficient! The world models were trained incrementally to simulate reality that is useful for transferring policies back to the real world.
Will this be useful for simulating PLAsTiCC data?

jiwoncpark changed the title ~~World Model~~ Read paper + watch video on World Model Sep 19, 2018

jiwoncpark changed the title ~~Read paper + watch video on World Model~~ Read paper and watch video on World Model Sep 19, 2018

jiwoncpark closed this as completed Sep 19, 2018

jiwoncpark added PLAsTiCC-applicable Magnificat-applicable labels Sep 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read paper and watch video on World Model #5

Read paper and watch video on World Model #5

jiwoncpark commented Sep 19, 2018

Read paper and watch video on World Model #5

Read paper and watch video on World Model #5

Comments

jiwoncpark commented Sep 19, 2018