Fixes for issues #3, #5 and #7. Agent learns better #8

praveen-palanisamy · 2017-11-01T02:46:24Z

Below is the summary of the contributions made by this PR:

Fixes for issues Unmatching size and error #3, pytorch 0.2 #5 and typo: 15 lines in main.py (TARGER_UPDATE_FREQ = 10000 --> TARGET_UPDATE_FREQ = 10000) #7
Overcomes the limitation of Need correction #4. This PR enables the agent to collect more rewards with minimal changes to the original scripts.
Tested with Pytorch 0.2 (as requested in pytorch 0.2 #5 )

…loss instead of the difference as the objective to minimize

SSARCandy · 2017-11-01T03:20:55Z

thanks for your contribution! It works on pytorch 0.2 👍

BTW, can I ask why doing gradient clipping, is that matters to performance?
thanks for your code again :)

praveen-palanisamy · 2017-11-01T04:57:05Z

Glad to hear that my contributions helped you.

Clipping the gradient will make sure that the gradients don't "explode" which is a common problem encountered when using gradient descent algorithms with neural networks.
In this case with DQN, gradient clipping will ensure that the optimization algorithm only takes small (in magnitude) steps in the direction pointed to by the gradient. Making a larger descent step and hence a big update to the Q-value function approximation might throw the approximation off from (converging to) the optimal values.

Hope the explanation helps.

praveen-palanisamy added 2 commits October 31, 2017 22:29

Fixed the Tensor dimension mismatch in gather, backward. Added Huber …

e6accaf

…loss instead of the difference as the objective to minimize

Fixed the typo in TARGER_UPDATE_FREQ. TO solve issue hungtuchen#7

6e0018d

This was referenced Nov 1, 2017

pytorch 0.2 #5

Open

Unmatching size and error #3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for issues #3, #5 and #7. Agent learns better #8

Fixes for issues #3, #5 and #7. Agent learns better #8

praveen-palanisamy commented Nov 1, 2017

SSARCandy commented Nov 1, 2017

praveen-palanisamy commented Nov 1, 2017

Fixes for issues #3, #5 and #7. Agent learns better #8

Are you sure you want to change the base?

Fixes for issues #3, #5 and #7. Agent learns better #8

Conversation

praveen-palanisamy commented Nov 1, 2017

SSARCandy commented Nov 1, 2017

praveen-palanisamy commented Nov 1, 2017