DL4J KerasSequentialImport problem #8701

tintinxue1 · 2020-02-12T14:27:49Z

Issue Description

Simple LSTM models trained using Keras give wrong predictions after imported in Java using Keras Sequential Import.
The model start with a Masking Layer, then LSTM layer, and Dense Layer.
When the model is small untrained, the java imported version would give the same predictions as the python version.
But whenever I provide a trained model or relatively larger model. I get a complete mismatch.

To make sure both python and java model takes the same input, I preprocessed sample input in python, saved them into the npy file after transposing the second and the third dimension. Then in Java, I used Nd4j.createFromNpyfile to create the same input. Then import the h5 file saved from python.

So both the python and java load model from same h5 file, same input npy file except the second and third dimension transposed. They disagree completely on the result.
Only when the model is extremely small or untrained, they do match.

I've put two model files in the github, model_latest.h5 is the one wouldn't match, model_toy.h5 is the one that would match. the test_matrix.npy is the sample input read in by both python and java side.
https://github.com/tintinxue1/dl4j_KerasImport/tree/master

Version Information

Deeplearning4j version ==> 1.0.0 beta-6
Platform information (OS, etc) ==> MacOs
CUDA version, if used ==> not applicable
NVIDIA driver version, if in use. ===> not applicable

treo · 2020-02-12T15:02:27Z

Originally started on https://community.konduit.ai/t/imported-keras-lstm-layer-mismatch/124.

Even though this looks like it might be a bug, it would be better if you had waited to get a confirmation of it instead of cross posting.

eraly · 2020-02-13T21:46:26Z

"Only when the model is extremely small or untrained, they do match"

Try giving the larger model a random input and you will find the outputs match for the larger model too. This is a bug in our keras import of the masking layer. With a random input where none of the time steps have all values in the input equal to the mask_value the mask layer does nothing in keras and that is why the outputs are equal.

eraly · 2020-02-13T23:56:17Z

Tagging @AlexDBlack as per Paul's instructions to assign someone (me?) to work on the fix for the bug.
Unit test in repo linked here

eraly · 2020-02-13T23:57:11Z

In the unit test note that the model passes for the random input and doesn't for the other due to the bug I described earlier.

tintinxue1 · 2020-02-14T14:49:29Z

Using the SequenceRecordReader with the Alignment made it work, although i have to write the inputs into txt file and read them back in. but It'll do for now, hope you can fix the bug soon. thanks

treo added Bug Bugs and problems DL4J Keras Issues related to Keras import labels Feb 13, 2020

treo assigned eraly Feb 28, 2020

agibsonccc assigned agibsonccc and unassigned eraly Jan 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DL4J KerasSequentialImport problem #8701

DL4J KerasSequentialImport problem #8701

tintinxue1 commented Feb 12, 2020

treo commented Feb 12, 2020

eraly commented Feb 13, 2020

eraly commented Feb 13, 2020

eraly commented Feb 13, 2020

tintinxue1 commented Feb 14, 2020

DL4J KerasSequentialImport problem #8701

DL4J KerasSequentialImport problem #8701

Comments

tintinxue1 commented Feb 12, 2020

Issue Description

Version Information

treo commented Feb 12, 2020

eraly commented Feb 13, 2020

eraly commented Feb 13, 2020

eraly commented Feb 13, 2020

tintinxue1 commented Feb 14, 2020