Dreamer for Atari #33

michaelzhiluo · 2020-08-02T23:42:05Z

In short, here's the bug when I ran atari_breakout:

  File "dreamer.py", line 463, in <module>
    main(parser.parse_args())
  File "dreamer.py", line 443, in main
    functools.partial(agent, training=False), test_envs, episodes=1)
  File "/home/mluo/dreamer/tools.py", line 124, in simulate
    obs, _, done = zip(*[p()[:3] for p in promises])
  File "/home/mluo/dreamer/tools.py", line 124, in <listcomp>
    obs, _, done = zip(*[p()[:3] for p in promises])
  File "/home/mluo/dreamer/wrappers.py", line 350, in step
    obs, reward, done, info = self._env.step(action)
  File "/home/mluo/dreamer/wrappers.py", line 162, in step
    obs, reward, done, info = self._env.step(action)
  File "/home/mluo/dreamer/wrappers.py", line 211, in step
    obs, reward, done, info = self._env.step(action)
  File "/home/mluo/dreamer/wrappers.py", line 320, in step
    raise ValueError(f'Invalid one-hot action:\n{action}')
ValueError: Invalid one-hot action:
[ 0.999  -0.9995  0.9995  0.9995]

I was wondering what changes are needed to get atari to work in your much cleaner Dreamer codebase and what possible hyperparameter changes would be needed to match the results reported in the paper.

The text was updated successfully, but these errors were encountered:

IcarusWizard · 2020-08-03T05:05:07Z

Same problem as #29.

michaelzhiluo · 2020-08-03T08:59:03Z

Ty! Does Atari learn well (replicate results) with the hyperparameters in dreamer.py?

IcarusWizard · 2020-08-06T14:36:46Z

Sorry, I didn't fully run the atari experiment, since I don't have enough resource to run it 😟 (by calculation, it needs roughly 1T RAM and weeks of training on my environment).
If you have enough resource and want to replicate the results, I suggest you to try the parameters in Appendix A of the paper. My setting is --expl epsilon_greedy --horizon 10 --kl_scale 0.1 --action_dist onehot --expl_amount 0.4 --expl_min 0.1 --expl_decay 100000 --pcont 1 --time_limit 1000000. Here time_limit is set to be large enough to prevent early stop of rollout in atari environment.
You may also need to change the hidden size of the network as mentioned by Danijar in #7.
Good Luck!

xlnwel · 2021-01-19T08:55:37Z

DreamerV2 for Atari games is out. Check this repo: https://github.com/danijar/dreamerv2

danijar closed this as completed Jan 19, 2021

xiangyyyy mentioned this issue Apr 19, 2021

slow in atari tasks #48

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dreamer for Atari #33

Dreamer for Atari #33

michaelzhiluo commented Aug 2, 2020

IcarusWizard commented Aug 3, 2020

michaelzhiluo commented Aug 3, 2020

IcarusWizard commented Aug 6, 2020

xlnwel commented Jan 19, 2021

Dreamer for Atari #33

Dreamer for Atari #33

Comments

michaelzhiluo commented Aug 2, 2020

IcarusWizard commented Aug 3, 2020

michaelzhiluo commented Aug 3, 2020

IcarusWizard commented Aug 6, 2020

xlnwel commented Jan 19, 2021