Add optimized LSTM cell. #648

bastings · 2020-11-18T10:32:15Z

This adds an optimized LSTM cell that is compatible with the regular LSTMCell.
It is faster because multiple smaller matrix multiplications are combined into larger ones.
it is compatible because the parameter matrices are still saved individually and combined dynamically before the coputation, so LSTMCell and OptimizedLSTMCell can be exchanged without changing the computation. A test verifies this.

Work together with @adarob .

Context: https://twitter.com/avitaloliver/status/1328965366173851649?s=20

codecov-io · 2020-11-18T10:50:59Z

Codecov Report

Merging #648 (e7a8d33) into master (3362ce1) will increase coverage by 0.41%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #648      +/-   ##
==========================================
+ Coverage   80.60%   81.02%   +0.41%     
==========================================
  Files          55       55              
  Lines        4254     4347      +93     
==========================================
+ Hits         3429     3522      +93     
  Misses        825      825

Impacted Files	Coverage Δ
flax/linen/__init__.py	`100.00% <100.00%> (ø)`
flax/linen/recurrent.py	`100.00% <100.00%> (ø)`
flax/nn/recurrent.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3362ce1...e7a8d33. Read the comment docs.

avital · 2020-11-18T10:52:15Z

Thanks @bastings! flax.nn is now deprecated, could you rewrite this on top of flax.linen? The Linen upgrade guide may be helpful.

bastings · 2020-11-18T10:57:23Z

Thanks @bastings! flax.nn is now deprecated, could you rewrite this on top of flax.linen? The Linen upgrade guide may be helpful.

Oh shoot.. call me old fashioned. :-) Would it still be useful to have this in both nn AND linen, or just in linen?

avital · 2020-11-18T10:58:52Z

If you already have the code on flax.nn and tests are passing, I guess there's no harm in adding it. But I wouldn't spend any additional work there.

bastings · 2020-11-18T11:37:15Z

Sounds good. Will add the linen version. 👍

avital · 2020-11-26T12:51:21Z

Hi @bastings -- looking forward to merging once we have a Linen version!

avital · 2020-11-26T12:52:05Z

(Or feel free to make this as "pull requests welcome" as perhaps someone else can help with this, if you're too busy)

bastings · 2020-11-26T15:28:34Z

I'll add it, just ran out of time that day :)

bastings · 2020-12-04T16:52:04Z

@avital @marcvanzee done!

marcvanzee

In general, it is better to create your own fork of Flax and make your changes in your own branch. Otherwise we end up with a huge amount of branches in Flax.

marcvanzee · 2020-12-07T11:12:51Z

flax/linen/recurrent.py

    return init_fn(key1, mem_shape), init_fn(key2, mem_shape)


+class DummyDense(Module):


It seems we are using this special Dense layer because we want to do the lax.dot_general outside of it. I would rename it to something more descriptive, maybe DenseNoMatMul or DenseNoDotGeneral? One could even argue whether it is still a Dense, since it seem you just get a kernel and bias, so a name KernelAndBias is also fitting, I think.

We could also consider making this class private to OptimizedLSTMCell.

I tried to make it private to OptimizedLSTMCell but Flax doesn't like that (error), so I'll keep this outside.

I renamed to DenseParams because that is what it is. Does that work for you? I still like DummyDense too though, since it follows the Dense API, just doesn't apply it.

flax/linen/recurrent.py

tests/linen/linen_test.py

flax/linen/recurrent.py

Co-authored-by: Marc van Zee <marcvanzee@google.com>

flax/linen/recurrent.py

Add optimized LSTM cell.

86c80c6

google-cla bot added the cla: yes label Nov 18, 2020

Remove unnessary import from nn_test.

eb12ad8

bastings requested a review from marcvanzee November 18, 2020 10:47

avital assigned bastings Nov 26, 2020

avital added Priority: P2 - no schedule Best effort response and resolution. We have no plan to work on this at the moment. Status: in progress labels Nov 26, 2020

avital added the Type: enhancement label Nov 26, 2020

bastings added 6 commits December 4, 2020 16:49

Add OptimizedLSTMCell Linen version.

5135d95

Merge with main branch

deda93b

Fix initializer for DummyDense.

3415c20

Fix initializers.

36fae9a

Fix return types.

1f35eb4

Fix type.

094e2d9

bastings marked this pull request as draft December 4, 2020 16:51

bastings marked this pull request as ready for review December 4, 2020 16:51

marcvanzee suggested changes Dec 7, 2020

View reviewed changes

avital reviewed Dec 8, 2020

View reviewed changes

flax/linen/recurrent.py Show resolved Hide resolved

bastings and others added 2 commits December 8, 2020 14:54

Apply suggestions from code review

ed67ff1

Co-authored-by: Marc van Zee <marcvanzee@google.com>

Rename DummyDense and make _concat_dense more readable.

895aa1d

bastings requested a review from avital December 8, 2020 14:37

bastings requested a review from marcvanzee December 8, 2020 14:37

marcvanzee approved these changes Dec 9, 2020

View reviewed changes

flax/linen/recurrent.py Outdated Show resolved Hide resolved

bastings added 3 commits December 9, 2020 15:43

Simplify _concat_dense using jnp.dot.

f593186

Fix ConcretizationTypeError in jnp.split.

331bdc2

Merge remote-tracking branch 'origin/master' into optimized_lstm

2461482

bastings removed the request for review from avital December 9, 2020 15:46

Remove trailing whitespace.

e7a8d33

copybara-service bot merged commit 692a62c into master Dec 9, 2020

copybara-service bot deleted the optimized_lstm branch December 9, 2020 16:31

		return init_fn(key1, mem_shape), init_fn(key2, mem_shape)


		class DummyDense(Module):

Add optimized LSTM cell. #648

Add optimized LSTM cell. #648

Conversation

bastings commented Nov 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-io commented Nov 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

avital commented Nov 18, 2020

Uh oh!

bastings commented Nov 18, 2020

Uh oh!

avital commented Nov 18, 2020

Uh oh!

bastings commented Nov 18, 2020

Uh oh!

avital commented Nov 26, 2020

Uh oh!

avital commented Nov 26, 2020

Uh oh!

bastings commented Nov 26, 2020

Uh oh!

bastings commented Dec 4, 2020

Uh oh!

marcvanzee left a comment

Choose a reason for hiding this comment

Uh oh!

marcvanzee Dec 7, 2020

Choose a reason for hiding this comment

Uh oh!

bastings Dec 8, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bastings commented Nov 18, 2020 •

edited

Loading

codecov-io commented Nov 18, 2020 •

edited

Loading