Fixing the DDPG agent #206

raimannma · 2020-03-06T19:12:42Z

Here comes a fix for the DDPG agent

ToDo: performance improving

simplify code

bug fixes

This reverts commit c636b2a

This reverts commit 87bd0db

need to copy array

christianechevarria · 2020-03-09T01:26:13Z

Hey @raimannma thanks for updating the branch! Not sure if you got a chance to look at the TravisCI build but it seems like there's some long-lived process in the unit tests that is logging and the build was unable to finish. Would you be able to look into why that's happening?

raimannma · 2020-03-09T09:52:22Z

Oh, this seems to be just a timeout problem.

My PC is so much faster than the travis-ci server.
I will do some tests and change the timeout.

raimannma · 2020-03-18T18:12:15Z

if I test it locally, everything works just fine.

Can you retry the travis build?

christianechevarria · 2020-03-19T18:33:11Z

Hey @raimannma, just tried to re-run the Travis build but wasn't able to, I think Travis has a time limit to re-run builds. Any chance you could make a tiny chance and push to the branch? If possible removing the console logging could make the test run much faster

raimannma · 2020-03-19T19:00:01Z

Hey,
I'm sorry, I think there is still a issue.
9 of 10 tries it works perfect.
But sometimes the agent makes the same actions, although he knows that they are bad, from previous experience.

I am not sure where the bug is, probably it's the noise function.

christianechevarria · 2020-03-20T18:33:23Z

I am not sure where the bug is, probably it's the noise function.

Yeah, I've found that debugging ML code is often really tough. Even with unit tests isolating where things happen is usually a tall order. That's part of the reason I've been trying to work on a way to log events for NNs in a Redux-style so you can see whenever changes happen to the network structure / configuration and step forwards / backwards in time.

For now what about trying to just log everything and follow the execution of the code? Maybe that could help to find the bug

raimannma · 2020-03-28T10:06:32Z

There is an issue with the target networks. I will do this whoule reinforcement things in the new typescript version.
So, for now closing this pull request.

raimannma added 30 commits December 11, 2019 08:01

change in package-lock.json

a4be90c

removing deprecated notation

b265370

fixing a bug

f547d81

fixing JSDoc

96aad21

adding support for continuous environments in DDPG

00f8d81

remove DQN from network.js

fc60ae6

adding RL part to carrot.js

8974b99

removing fault test method

d10b96e

adding timout to test method

a911379

fixing a bug

6120085

npm update

4604515

adding module riteway

da99be3

DDPG: adding possibility to prohibit actions

b736f73

implemented the concept of shared weights.

84e276f

ToDo: performance improving

changing toJSON()

f64fb43

adding shared weights to FFW-Mutation

a020c40

performance improvement

39c56ca

simplify code

adding timeouts to test methods

fc8a9c0

Merge branch 'add-reinforcement-learning' into add-sharedweights

8e52b78

updating self.bias at activation

03a8d87

renaming mutation method

12be1a4

refactor replay-buffer.js

1b4cce8

fixing bug in replay-buffer.js

3dfda03

adding todo

46bc349

adding shared weights for connections

9037ebf

adding timeouts to test methods

dfdfa35

Merge branch 'add-reinforcement-learning' into add-sharedweights

a8cb0c2

refactor

c636b2a

bug fixes

Revert "refactor bug fixes"

04ae0c5

This reverts commit c636b2a

refactor bug fixes

be490ed

raimannma added 21 commits January 2, 2020 07:52

increasing max timeout time

fefdf03

bug fixing in DDPG

87bd0db

removing debugging nodes

a713335

add prohibited actions to DQN

18d9ba3

Revert

4bb7c9f

This reverts commit 87bd0db

fixing

95e4a46

fixing

8f2f409

replace {no_trace: true} with {trace: false}

bfe667c

only use epsilon greedy if training

7152603

refactor

b718702

bug fixing

46893a5

BUGFIX:

1a684f4

need to copy array

checking experience entry, before copying to replayBuffer

e2dfa06

refactor

2ff1f2d

fixing the DDPG agent

ae911d1

refactor

7d27341

refactor

580a970

build:src

fef8921

refactor

3bcec15

changing parameters for stability

231ad0a

remove unused npm packages

e2f9cfc

raimannma closed this Mar 28, 2020

raimannma deleted the add-reinforcement-learning branch April 10, 2020 08:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing the DDPG agent #206

Fixing the DDPG agent #206

raimannma commented Mar 6, 2020

christianechevarria commented Mar 9, 2020

raimannma commented Mar 9, 2020 •

edited

Loading

raimannma commented Mar 18, 2020

christianechevarria commented Mar 19, 2020

raimannma commented Mar 19, 2020

christianechevarria commented Mar 20, 2020

raimannma commented Mar 28, 2020

Fixing the DDPG agent #206

Fixing the DDPG agent #206

Conversation

raimannma commented Mar 6, 2020

christianechevarria commented Mar 9, 2020

raimannma commented Mar 9, 2020 • edited Loading

raimannma commented Mar 18, 2020

christianechevarria commented Mar 19, 2020

raimannma commented Mar 19, 2020

christianechevarria commented Mar 20, 2020

raimannma commented Mar 28, 2020

raimannma commented Mar 9, 2020 •

edited

Loading