CRF layer continued #1363

gabrieldemarmiesse · 2020-03-21T18:46:38Z

This is a new take on #377

I started with the implementation of @howl-anderson and tried to make it as keras friendly as possible.

Disclaimer: I don't know anything about CRF or the math behind it, so I won't be able to answer any questions about the maths. This PR is just #377 with the code moved around.

Review process

I suggest that if we agree on the general idea, we merge this and then we can add a docstring and tutorials. I didn't add the layer to the public API so it's fine to merge even if there are some rough edges. We can merge this PR as-is and then let @howl-anderson polish the CRF in other pull requests since 90% of this pull request is his work.

With this new architecture, you need two models, one for training and one for inferences. Both can be saved and loaded normally in the keras format. There seems to be a minor bug when using the tf format, I believe that it's a tensorflow bug, I'll open an issue and link it here.

minimal example:

There are two models, one for training, one for inference and they share some layers.

One layer is in charged of computing the loss, the loss is returned as a tensor in the network. We then give fake targets (0) and ask .compile() to use the MAE to leave the loss tensor untouched.

I took the numerical accuracy test from @howl-anderson and the loss is exactly the same :)

x_np, y_np = get_test_data()

x_input = tf.keras.layers.Input(shape=x_np.shape[1:])
y_input = tf.keras.layers.Input(shape=y_np.shape[1:])

crf_outputs = CRF(5, name="L")(x_input)
decoded_sequence, potentials, sequence_length, chain_kernel = crf_outputs

crf_loss = CRFLossLayer()([potentials, y_input, sequence_length, chain_kernel])

inference_model = tf.keras.Model(x_input, decoded_sequence)
training_model = tf.keras.Model([x_input, y_input], crf_loss)

training_model.compile("adam", loss="mae")
training_model.fit((x_np, y_np), y=np.zeros((2,)))
training_model.evaluate((x_np, y_np), y=np.zeros((2,)))

decoded_sequences_numpy = inference_model.predict(x_np)

gabrieldemarmiesse · 2020-04-26T08:40:48Z

@howl-anderson in tensorflow 2.2, it's possible to override the train_step method. Here is an example with gans: https://twitter.com/fchollet/status/1250622989541838848

Maybe we could keep the CFR layer as implemented here and let the user call crf_log_likelihood directly in train_step? We don't have then to subclass Layer or Loss, and we still have a clean implementation without private API and without hacks like passing the targets as input to the model.

The implementation would then be the CRF layer implemented here + a tutorial showing how to override train_step. Do you think it's a good idea?

gabrieldemarmiesse · 2020-04-26T19:14:23Z

Superseded by #1733

howl-anderson and others added 3 commits March 21, 2020 18:41

Squash all.

8d754a1

Merge branch 'master' into trying_to_squash

a080aaa

Cleanup for easier review.

037549c

gabrieldemarmiesse requested review from facaiy and seanpmorgan as code owners March 21, 2020 18:46

boring-cyborg bot added the layers label Mar 21, 2020

This comment has been minimized.

Sign in to view

googlebot added the cla: no label Mar 21, 2020

gabrieldemarmiesse mentioned this pull request Mar 21, 2020

Add Conditional Random Field (CRF) layer #377

Closed

Calming the angry bazel.

e4cdfcb

gabrieldemarmiesse changed the title ~~CRF continued~~ CRF layer continued Mar 21, 2020

This comment has been minimized.

Sign in to view

googlebot added cla: yes and removed cla: no labels Mar 22, 2020

gabrieldemarmiesse added 5 commits March 22, 2020 11:31

Fix the strange bug.

76a4375

Replaced one bug by another bug.

bf691c8

Minor simplification.

413f242

Fix unused parameter.

a6afeb9

Simplified the signature.

3c0f306

howl-anderson mentioned this pull request Mar 23, 2020

[Feature Request][Keras] Allow loss function using multiple tensors as input tensorflow/tensorflow#37818

Closed

Squadrick self-assigned this Mar 25, 2020

gabrieldemarmiesse added 3 commits April 26, 2020 09:31

Merge branch 'master' into trying_to_squash

4517e98

Removing boilerplate

fa347ae

Unused import.

4f820b4

gabrieldemarmiesse closed this Apr 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CRF layer continued #1363

CRF layer continued #1363

gabrieldemarmiesse commented Mar 21, 2020 •

edited

Loading

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

gabrieldemarmiesse commented Apr 26, 2020

gabrieldemarmiesse commented Apr 26, 2020

CRF layer continued #1363

CRF layer continued #1363

Conversation

gabrieldemarmiesse commented Mar 21, 2020 • edited Loading

Review process

minimal example:

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

gabrieldemarmiesse commented Apr 26, 2020

gabrieldemarmiesse commented Apr 26, 2020

gabrieldemarmiesse commented Mar 21, 2020 •

edited

Loading