Add Keras LSTM support #1752

q-ycong-p · 2021-10-22T21:23:01Z

This commit adds Keras LSTM support in response to this issue.
The changes include:

Modified existing LSTM pattern-matching and rewriters to handle Keras LSTM;
Added unit tests to ensure no loops in converted ONNX models from Keras LSTM. (with reference to previous commit that added Keras GRU).

lgtm-com · 2021-10-22T21:40:07Z

This pull request introduces 1 alert when merging ca4d655 into 42e800d - view on LGTM.com

new alerts:

1 for Unused local variable

lgtm-com · 2021-10-23T06:04:35Z

This pull request introduces 1 alert when merging 2583cd3 into 42e800d - view on LGTM.com

new alerts:

1 for Unused local variable

lgtm-com · 2021-10-23T06:26:21Z

This pull request introduces 1 alert when merging c227c7a into 42e800d - view on LGTM.com

new alerts:

1 for Unused local variable

q-ycong-p · 2021-10-23T09:08:36Z

Hi @TomWildenhain-Microsoft and other contributors. Any suggestion on below issue?

I've encountered failure against tf2.6: in tests/test_backend.py/test_rfft_ops on line-5497. Full error log found in pipeline history. In short, it complains RuntimeError: Failed to run tfjs model: Error: Argument 'x' passed to 'cos' must be float32 tensor, but got complex64 tensor when executing tf.cos(tf.signal.rfft(x), name=_TFOUTPUT). I looked into it but cannot associate this error with my changes which modified pattern-matching and rewriter for keras LSTM only. I traced to def _rfft(...) but has no further clue...

I've tested in my local conda env (linux system): tensorflow==2.6.0; onnxruntime==1.9.0; python==3.9.0; onnx==1.10.0; numpy==1.21.3, which is same as the said pipeline test case. I couldn't reproduce this error locally. All tests passed for me locally. Any suggestion is appreciated here!

TomWildenhain-Microsoft · 2021-10-25T17:56:36Z

@q-ycong-p The issue is a failing tfjs test. I don't think it is related to your change. Add an @skip_tfjs("Fails to run tfjs model") decorator to the test.

q-ycong-p · 2021-10-25T19:46:21Z

@TomWildenhain-Microsoft I added the decorator as suggested, see here (though we might want to also figure out why it failed?). The pipeline is now green. The PR is ready for review/merge. Thanks!

TomWildenhain-Microsoft · 2021-10-25T20:31:08Z

Awesome, Thanks!

we might want to also figure out why it failed?

Yeah, we have to skip tfjs tests decently regularly. It might be that the script for running the tfjs model isn't converting from float to complex, or that tensorflow's tfjs converter isn't properly converting the model (not uncommon). In any case, the test is failing before it even gets to the converter. It is running as a tf model, but fails to run after using tensorflow's tfjs model converter. So it doesn't tell us anything about whether our conversion is working.

TomWildenhain-Microsoft · 2021-10-25T20:33:21Z

tests/keras2onnx_unit_tests/test_layers.py

    onnx_model = convert_keras(model, name=model.name)
+    if gru_class.__module__.split('.')[-1] == "recurrent_v2":


I'm not a huge fan of this check. Can you just do gru_class == recurrent_v2.GRU?

Or make GRU_CLASSES a tuple like you did with LSTM_CLASSES.

gru_class == recurrent_v2.GRU alone doesn't seen to do, because the class name is evaluated to equal tensorflow.python.keras.layers.recurrent_v2.GRU in my current env. However I was hesitant to match exactly this. I assume it would vary depending on TF versions which import keras relies on, see line 40 to line 50 here.

I propose to change GRU_CLASSES to the same structure as LSTM_CLASSES, i.e., a list of tuples where each tuple contains a rnn_version to be used like here.

Would it be a good practice? Thanks!

Ok, the tuple option is good then.

Done! Pipeline is green.

TomWildenhain-Microsoft

Overall great, really appreciate it. Very thorough tests. LSTM patterns can be tricky.

Signed-off-by: congyc <congyc@amazon.com>

q-ycong-p · 2021-10-25T23:44:01Z

Thank you @TomWildenhain-Microsoft for reviewing and helping fix this issue!

q-ycong-p · 2021-10-27T06:56:10Z

@TomWildenhain-Microsoft i saw the errors in onnxruntime-nightly-unittest-matrix for latest commit. This issue seems to be directly related. The suggested solution is either, change python and/or numpy vresions, or change tensorflow source code as suggested. Either doesn't seem doable here.

I'm not sure if we should add @skip_tf_versions("2.1", "Bug in TF 2.1") decorator to problematic test LSTMTests.test_keras_lstm? Or there's other reason causing it, or better way fixing it?

guschmue · 2021-10-27T14:33:52Z

Thanks for debugging it!! Yes, little we can do on it since it is a tf-2.1 issue.
@skip_tf_versions("2.1", "Bug in TF 2.1") seems to be the best option.

q-ycong-p force-pushed the keras_lstm_dev branch from 48b00d5 to ca4d655 Compare October 22, 2021 21:24

q-ycong-p force-pushed the keras_lstm_dev branch from ca4d655 to 2583cd3 Compare October 23, 2021 05:46

q-ycong-p force-pushed the keras_lstm_dev branch from 2583cd3 to c227c7a Compare October 23, 2021 06:12

q-ycong-p force-pushed the keras_lstm_dev branch 2 times, most recently from 78460f4 to e4a26b8 Compare October 23, 2021 08:36

q-ycong-p mentioned this pull request Oct 23, 2021

tf.keras.layers.LSTM not converted to ONNX LSTM layer #1546

Closed

TomWildenhain-Microsoft linked an issue Oct 24, 2021 that may be closed by this pull request

tf.keras.layers.LSTM not converted to ONNX LSTM layer #1546

Closed

q-ycong-p force-pushed the keras_lstm_dev branch from e4a26b8 to a816aa1 Compare October 25, 2021 06:07

q-ycong-p force-pushed the keras_lstm_dev branch from a816aa1 to 57a4206 Compare October 25, 2021 18:21

TomWildenhain-Microsoft reviewed Oct 25, 2021

View reviewed changes

Add Keras LSTM support

8a14f66

Signed-off-by: congyc <congyc@amazon.com>

q-ycong-p force-pushed the keras_lstm_dev branch from 57a4206 to 8a14f66 Compare October 25, 2021 21:23

TomWildenhain-Microsoft approved these changes Oct 25, 2021

View reviewed changes

TomWildenhain-Microsoft merged commit f19a036 into onnx:master Oct 25, 2021

q-ycong-p mentioned this pull request Oct 27, 2021

Bug-fix to skip keras LSTM test for TF2.1 #1754

Merged

JonTriebenbach mentioned this pull request Feb 15, 2022

Single layer Keras LSTM converted to loop instead of ONNX LSTM op #1851

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Keras LSTM support #1752

Add Keras LSTM support #1752

q-ycong-p commented Oct 22, 2021

lgtm-com bot commented Oct 22, 2021

lgtm-com bot commented Oct 23, 2021

lgtm-com bot commented Oct 23, 2021

q-ycong-p commented Oct 23, 2021 •

edited

TomWildenhain-Microsoft commented Oct 25, 2021

q-ycong-p commented Oct 25, 2021

TomWildenhain-Microsoft commented Oct 25, 2021

TomWildenhain-Microsoft Oct 25, 2021

TomWildenhain-Microsoft Oct 25, 2021

q-ycong-p Oct 25, 2021

TomWildenhain-Microsoft Oct 25, 2021

q-ycong-p Oct 25, 2021

TomWildenhain-Microsoft left a comment

q-ycong-p commented Oct 25, 2021

q-ycong-p commented Oct 27, 2021

guschmue commented Oct 27, 2021

		onnx_model = convert_keras(model, name=model.name)
		if gru_class.__module__.split('.')[-1] == "recurrent_v2":

Add Keras LSTM support #1752

Add Keras LSTM support #1752

Conversation

q-ycong-p commented Oct 22, 2021

lgtm-com bot commented Oct 22, 2021

lgtm-com bot commented Oct 23, 2021

lgtm-com bot commented Oct 23, 2021

q-ycong-p commented Oct 23, 2021 • edited

TomWildenhain-Microsoft commented Oct 25, 2021

q-ycong-p commented Oct 25, 2021

TomWildenhain-Microsoft commented Oct 25, 2021

TomWildenhain-Microsoft Oct 25, 2021

Choose a reason for hiding this comment

TomWildenhain-Microsoft Oct 25, 2021

Choose a reason for hiding this comment

q-ycong-p Oct 25, 2021

Choose a reason for hiding this comment

TomWildenhain-Microsoft Oct 25, 2021

Choose a reason for hiding this comment

q-ycong-p Oct 25, 2021

Choose a reason for hiding this comment

TomWildenhain-Microsoft left a comment

Choose a reason for hiding this comment

q-ycong-p commented Oct 25, 2021

q-ycong-p commented Oct 27, 2021

guschmue commented Oct 27, 2021

q-ycong-p commented Oct 23, 2021 •

edited