Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Breakout STDP having low rewards #345

Closed
jethrokuan opened this issue Nov 28, 2019 · 1 comment
Closed

Breakout STDP having low rewards #345

jethrokuan opened this issue Nov 28, 2019 · 1 comment

Comments

@jethrokuan
Copy link

Hi!

I'm particularly interested in spiking neural networks in the reinforcement learning framework, so I thought I'd run the breakout example in breakout_stdp.py. Running the script, I get the results:

Episode 95 total reward:1.0
Episode 96 total reward:0.0
Episode 97 total reward:2.0
Episode 98 total reward:1.0
Episode 99 total reward:4.0

Is this to be expected? What's the mean/variance of the reward values I should be expecting? I'm looking to get something working so I can establish some baselines. Thanks!

@Hananel-Hazan
Copy link
Collaborator

Hello, thank you for using BindsNET. The code breakout_stdp.py demonstrates how to use a spiking network to play an ATARI game.
The results that you reported are normal and can be expected from a random choice of the untrained spiking network.

Currently, it is not an easy task for training spiking neurons to perform well in the RL environment.
However, in this paper, we show a way to train regular neuronal network and convert it to the spiking network.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants