Removed the unnecessary softplus in NTMHeadBase._address_memory #6

JulesGM · 2018-03-26T18:33:36Z

Removed the softplus in the softmax:

        s = F.softmax(F.softplus(s), dim=1)

softmax already constrains the values to (0, 1), the softplus doesn't achieve anything. Pytorch's softmax implementation is already numerically stable, so that's not the preoccupation.

Removed the softplus in the softmax: ```python s = F.softmax(F.softplus(s), dim=1) ``` softmax already constrains the values to (0, +inf), the softplus doesn't achieve much.

loudinthecloud · 2018-03-26T20:18:54Z

Makes sense, thanks for that. Can you please run the copy-task notebook and see we're getting the same results?

JulesGM · 2018-03-26T23:54:54Z

I trained a bunch of pretty long models, and get good results in the notebooks.

JulesGM · 2018-03-27T03:39:55Z

Like this one, which was trained for a while on sequences up to 120 long, and converges very sharply

loudinthecloud · 2018-03-27T07:43:14Z

Tested it as well, seems to alter convergence a bit but perhaps for the better.

Removed the unnecessary softplus in _address_memory

5019f04

Removed the softplus in the softmax: ```python s = F.softmax(F.softplus(s), dim=1) ``` softmax already constrains the values to (0, +inf), the softplus doesn't achieve much.

loudinthecloud merged commit d7b3840 into loudinthecloud:master Mar 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removed the unnecessary softplus in NTMHeadBase._address_memory #6

Removed the unnecessary softplus in NTMHeadBase._address_memory #6

JulesGM commented Mar 26, 2018 •

edited

loudinthecloud commented Mar 26, 2018

JulesGM commented Mar 26, 2018

JulesGM commented Mar 27, 2018

loudinthecloud commented Mar 27, 2018

Removed the unnecessary softplus in NTMHeadBase._address_memory #6

Removed the unnecessary softplus in NTMHeadBase._address_memory #6

Conversation

JulesGM commented Mar 26, 2018 • edited

loudinthecloud commented Mar 26, 2018

JulesGM commented Mar 26, 2018

JulesGM commented Mar 27, 2018

loudinthecloud commented Mar 27, 2018

JulesGM commented Mar 26, 2018 •

edited