Can't reproduce DQN performance #14

SunCherry · 2019-10-18T00:05:44Z

I noticed you changed the optimizer and some hyper-parameters in DQN compared to those in the "Nature" paper, well, from my side I can't reproduce results by taking any of the two settings, could you share a learning curve of "Breakout"? I have been struggling with the hyper-parameters optimization for two months. Thanks.

aslanides · 2019-10-18T14:29:25Z

Hi there.

This agent is intended to be a simple instantiation of the DQN algorithm (Q-learning + non-linear function approximation + experience replay), and isn't intended to reproduce the Nature Atari results. There are numerous subtleties related to interfacing with Atari (frame stacking, reward clipping, etc) that can be tricky to get right. For agents that are set up to run on Atari, see Dopamine (github.com/google/dopamine) or OpenAI baselines (github.com/openai/baselines)

SunCherry · 2019-10-31T15:53:08Z

Hi there.

This agent is intended to be a simple instantiation of the DQN algorithm (Q-learning + non-linear function approximation + experience replay), and isn't intended to reproduce the Nature Atari results. There are numerous subtleties related to interfacing with Atari (frame stacking, reward clipping, etc) that can be tricky to get right. For agents that are set up to run on Atari, see Dopamine (github.com/google/dopamine) or OpenAI baselines (github.com/openai/baselines)

Got it, thanks a lot.

aslanides closed this as completed Oct 18, 2019

pluebcke mentioned this issue Mar 25, 2020

DQN mnist & mountain car performance #20

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't reproduce DQN performance #14

Can't reproduce DQN performance #14

SunCherry commented Oct 18, 2019

aslanides commented Oct 18, 2019

SunCherry commented Oct 31, 2019

Can't reproduce DQN performance #14

Can't reproduce DQN performance #14

Comments

SunCherry commented Oct 18, 2019

aslanides commented Oct 18, 2019

SunCherry commented Oct 31, 2019