RNN gradients are wrong #518

eaplatanios · 2019-09-29T21:39:17Z

This line should be 𝛁cell += new𝛁cell instead. There is also an issue with the initial state initialization to a zeros tensor with batch size 1 that does not broadcast. I don't have time to open a PR right now because I'm traveling but will try to open one once I get a chance.

The text was updated successfully, but these errors were encountered:

rxwei · 2019-09-29T22:25:26Z

Nice catch!

Shashi456 · 2019-10-01T06:24:55Z

Should I take care of this @eaplatanios

eaplatanios · 2019-10-01T14:33:03Z

@Shashi456 thanks! Feel free to fix the += bug. The initial state issue is a less trivial API design question that I haven't thought about much yet. There are a couple of possible solutions that I used in swift-rl, so I'll try to look into it sometime soon.

sgugger · 2019-12-06T19:40:56Z

I think this has been solved by the two PRs merged. Feel free to reopen if there is still a problem.

Shashi456 · 2019-12-06T19:46:35Z

@sgugger the tests against these changes were remaining so I thought we'd keep this PR open. #555, #554 track the tests. But since we have separate issues for them maybe it's okay.

sgugger · 2019-12-06T19:50:33Z

Like you said there are already two issues tracking this. I wanted to do a bit of clean-up, but if anyone feels strongly about this issue remaining open, they can click the button :)

Shashi456 mentioned this issue Oct 1, 2019

Fix RNN gradient accumulation. #519

Merged

eaplatanios mentioned this issue Oct 2, 2019

Fixed a couple RNN bugs. #522

Merged

sgugger closed this as completed Dec 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RNN gradients are wrong #518

RNN gradients are wrong #518

eaplatanios commented Sep 29, 2019

rxwei commented Sep 29, 2019

Shashi456 commented Oct 1, 2019

eaplatanios commented Oct 1, 2019

sgugger commented Dec 6, 2019

Shashi456 commented Dec 6, 2019

sgugger commented Dec 6, 2019

RNN gradients are wrong #518

RNN gradients are wrong #518

Comments

eaplatanios commented Sep 29, 2019

rxwei commented Sep 29, 2019

Shashi456 commented Oct 1, 2019

eaplatanios commented Oct 1, 2019

sgugger commented Dec 6, 2019

Shashi456 commented Dec 6, 2019

sgugger commented Dec 6, 2019