Breakout STDP having low rewards #345

jethrokuan · 2019-11-28T11:02:27Z

Hi!

I'm particularly interested in spiking neural networks in the reinforcement learning framework, so I thought I'd run the breakout example in breakout_stdp.py. Running the script, I get the results:

Episode 95 total reward:1.0
Episode 96 total reward:0.0
Episode 97 total reward:2.0
Episode 98 total reward:1.0
Episode 99 total reward:4.0

Is this to be expected? What's the mean/variance of the reward values I should be expecting? I'm looking to get something working so I can establish some baselines. Thanks!

The text was updated successfully, but these errors were encountered:

Hananel-Hazan · 2019-11-29T13:56:58Z

Hello, thank you for using BindsNET. The code breakout_stdp.py demonstrates how to use a spiking network to play an ATARI game.
The results that you reported are normal and can be expected from a random choice of the untrained spiking network.

Currently, it is not an easy task for training spiking neurons to perform well in the RL environment.
However, in this paper, we show a way to train regular neuronal network and convert it to the spiking network.

Hananel-Hazan closed this as completed Nov 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Breakout STDP having low rewards #345

Breakout STDP having low rewards #345

jethrokuan commented Nov 28, 2019

Hananel-Hazan commented Nov 29, 2019

Breakout STDP having low rewards #345

Breakout STDP having low rewards #345

Comments

jethrokuan commented Nov 28, 2019

Hananel-Hazan commented Nov 29, 2019