Reinforcement Learning basic sample not working #112

domswit · 2019-09-04T19:18:28Z

The basic example provided here does not seem to work because the output was always 0:
https://cs.stanford.edu/people/karpathy/convnetjs/docs.html

Proof:
I tried changing this line:
var reward = action === 0 ? 1.0 : 0.0;
into:
var reward = action === 1 ? 1.0 : 0.0;

*** and got the same result which is 0

Code Example:
/START CODE/
var brain = new deepqlearn.Brain(3, 2); // 3 inputs, 2 possible outputs (0,1)
var state = [Math.random(), Math.random(), Math.random()];
for(var k=0;k<10000;k++) {
var action = brain.forward(state); // returns index of chosen action
var reward = action === 0 ? 1.0 : 0.0;
brain.backward([reward]); // <-- learning magic happens here
state[Math.floor(Math.random()*3)] += Math.random()*2-0.5;
}
brain.epsilon_test_time = 0.0; // don't make any more random choices
brain.learning = false;
// get an optimal action from the learned policy
var action = brain.forward(state);
/END CODE/

samodostal · 2019-12-01T14:36:49Z

Exactly the same issue here.
The author of this library is not taking care of this repo tho. I think it doesn't work anymore...

TrevorBlythe · 2021-11-07T11:25:30Z

Exactly the same issue here. The author of this library is not taking care of this repo tho. I think it doesn't work anymore...

Things just dont start working unless they are outdated and this javascript still works fine. i got an example working but i had to link the brain js library. Its not included here but i found it in the source code of the example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reinforcement Learning basic sample not working #112

Reinforcement Learning basic sample not working #112

domswit commented Sep 4, 2019

samodostal commented Dec 1, 2019

TrevorBlythe commented Nov 7, 2021

Reinforcement Learning basic sample not working #112

Reinforcement Learning basic sample not working #112

Comments

domswit commented Sep 4, 2019

samodostal commented Dec 1, 2019

TrevorBlythe commented Nov 7, 2021