[WIP] fixing memory issue in RNN #112

jermainewang · 2016-12-19T21:23:17Z

Main reason of RNN memory issue:

We are not using the backward order to compute gradient. This leaves many intermediate values saved in the tape.

Changes:

The tape.get_gradient now takes a tuple of arrays as input and return the tuple as results.
- This could be further simplified however. The core.grad function is self-contained (all the inputs and outputs are specified through the interface). Therefore the tape.get_gradient could take no arguments since all arguments required to get gradient must be the root node of the tape.
The tape.get_gradient function should implement a reverse order from the last node to the root node. The forward path could be saved since it is already done when generating grad_records.

1. Tape autograd logic. 2. Use generated id rather than builtin id function for tape. 3. Fix bug in tape: gradient += is also recorded in tape.

Conflicts: minpy/primitive.py minpy/tape.py

jermainewang · 2016-12-31T04:12:53Z

Everything on minpy side should be fixed in this PR right now. @lryta @ZihengJiang @hotpxl please have a look. @HrWangChengdu would you mind help me rerun all the examples to see whether they are still working?

jermainewang added 3 commits December 23, 2016 21:58

[WIP] fixing memory issue in RNN

3538943

Draft change finished

4af2f97

fix

d07eb78

jermainewang force-pushed the rnn-memory branch from bd8b3b9 to d07eb78 Compare December 23, 2016 21:59

jermainewang added 7 commits December 23, 2016 22:11

Merge branch 'master' into rnn-memory

8a17cce

Many fixes:

d6af152

1. Tape autograd logic. 2. Use generated id rather than builtin id function for tape. 3. Fix bug in tape: gradient += is also recorded in tape.

Merge branch 'master' into rnn-memory

f03a556

Conflicts: minpy/primitive.py minpy/tape.py

pass unittests and benchmark tests

4f344c2

fix lint

6143299

travis setting

77f0432

fix lint

740eab8

jermainewang merged commit bfe1328 into master Jan 2, 2017

jermainewang deleted the rnn-memory branch January 2, 2017 15:42

jermainewang mentioned this pull request Jan 10, 2017

LSTM_Captioning using minpy: memory issue #91

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] fixing memory issue in RNN #112

[WIP] fixing memory issue in RNN #112

jermainewang commented Dec 19, 2016 •

edited

jermainewang commented Dec 31, 2016

[WIP] fixing memory issue in RNN #112

[WIP] fixing memory issue in RNN #112

Conversation

jermainewang commented Dec 19, 2016 • edited

jermainewang commented Dec 31, 2016

jermainewang commented Dec 19, 2016 •

edited