Fused LSTM grad-grad #3256

unnonouno · 2017-08-27T03:02:44Z

LSTM grad-grad calls too many kernels. I used cupy.fuse to combine them.
Merge #3206 first.

okuta · 2017-09-18T13:13:52Z

jenkins, test this please.

okuta · 2017-09-19T17:47:17Z

chainer/functions/activation/lstm.py

+    return cuda.fusion.tanh(x * half) * half + half
+
+
+@cuda.fuse()


How about using input_num?
lstm_grad_grad will accept writable arguments.

@cuda.fuse(input_num=13) def lstm_grad_grad( c_prev, a, i, f, o, c, gc, gh, ggc_prev, gga, ggi, ggf, ggo, gc_prev, ga, gi, gf, go, gc_next, ggc, ggh):

How to fix?

I wrote sample. That kernel will reduce array copy.

Sorry, This is my mistake.

okuta · 2017-10-14T22:08:20Z

LGTM!

unnonouno added 3 commits September 4, 2017 16:47

Make cuda.fuse and use it in LSTM

250c970

Use empty_like

f9d08b3

Use cuda.fusion

fee7d2d

unnonouno force-pushed the lstm-fusion branch from a6bd4fd to fee7d2d Compare September 4, 2017 07:49

unnonouno added cat:enhancement Implementation that does not break interfaces. st:ready-for-review cat:feature Implementation that introduces new interfaces. labels Sep 4, 2017

okuta self-assigned this Sep 18, 2017

okuta reviewed Sep 19, 2017

View reviewed changes

okuta added st:awaiting-author State indicating that response is needed from contributors, often authors of pull request. and removed st:ready-for-review labels Sep 19, 2017

okuta merged commit 9e9a9e1 into chainer:master Oct 14, 2017

unnonouno deleted the lstm-fusion branch October 15, 2017 02:56

beam2d added this to the v4.0.0a1 milestone Oct 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fused LSTM grad-grad #3256

Fused LSTM grad-grad #3256

unnonouno commented Aug 27, 2017

okuta commented Sep 18, 2017

okuta Sep 19, 2017

unnonouno Sep 23, 2017

okuta Sep 23, 2017

okuta Oct 14, 2017

okuta commented Oct 14, 2017

Fused LSTM grad-grad #3256

Fused LSTM grad-grad #3256

Conversation

unnonouno commented Aug 27, 2017

okuta commented Sep 18, 2017

okuta Sep 19, 2017

Choose a reason for hiding this comment

unnonouno Sep 23, 2017

Choose a reason for hiding this comment

okuta Sep 23, 2017

Choose a reason for hiding this comment

okuta Oct 14, 2017

Choose a reason for hiding this comment

okuta commented Oct 14, 2017