Add reinforcement learning #175

raimannma · 2019-10-30T15:28:17Z

Fully implemented DQN

More test methods are needed!

adding a new util-class "Window", works similar as the Java ArrayDeque add training mode for the DQN much improvement in performance allow mulit-hidden layers

The sign for the epsilon action comparison was reversed. Epsilon should begin as a high number which will lead to high exploration at first. By exploring at a high rate the agent can discover new states instead of following its judgement (poor at first) about what the best state is and with a decay function over time epsilon is decreased which then leads the agent to trust its judgement more as it has more experiences.

To be as beginner friendly as possible, we change 'epsilon'-based property names to 'explore'. We do this because the people aware of epsilon's meaning should be able to know that explore is an equivalent term, but the reverse may not be true

raimannma and others added 15 commits October 28, 2019 19:55

updating gitignore for IDEA support

472cc48

implement basic DQN

174ab64

adding learningRateDecay and epsilonDecay

0615b91

adding a new util-class "Window", works similar as the Java ArrayDeque add training mode for the DQN much improvement in performance allow mulit-hidden layers

fixing test method

d2d6f23

changing test method

08bdef1

add comments to the DQN class

5507c2e

removing debug output

ed92323

adding required import

3c12e07

reducing complexity

fe326ca

reducing complexity#2

7b596c2

fixing bug in DQN.learn()

bf3d3c9

Change 'epsilon' properties to 'explore'

84b649a

To be as beginner friendly as possible, we change 'epsilon'-based property names to 'explore'. We do this because the people aware of epsilon's meaning should be able to know that explore is an equivalent term, but the reverse may not be true

Change 'epsilon' properties to 'explore'

d4cb7cb

To be as beginner friendly as possible, we change 'epsilon'-based property names to 'explore'. We do this because the people aware of epsilon's meaning should be able to know that explore is an equivalent term, but the reverse may not be true

Refactor 'DQN.act', Add description details

af0b3d2

christianechevarria approved these changes Nov 1, 2019

View reviewed changes

christianechevarria merged commit 1b27357 into liquidcarrot:add-reinforcement-learning Nov 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add reinforcement learning #175

Add reinforcement learning #175

raimannma commented Oct 30, 2019

Add reinforcement learning #175

Add reinforcement learning #175

Conversation

raimannma commented Oct 30, 2019