Add Conditional Random Field (CRF) layer #377

howl-anderson · 2019-07-30T11:44:31Z

related to: #314 #22
depend on: #314

What's include

TensorFlow Keras layer for CRF
Loss function for CRF

How to use it

The document is coming soon.

Progress

[x] Layer class for CRF
[x] Test cases

Limitation

Currently, only support right padding mask (due to the underline CRF functions), CRF layer will detect and report to user if mask contains left padding.

Roadmap (next version)

Include constraints: an optional list of allowed transitions (from_tag_id, to_tag_id). Learned from AllenNLP

* Rename crf_ops* -> crf* * The RNN cells inherit `AbstractRNNCell` instead of `Layer` * Remove used `training` variable * Add docstring for RNN Cells

googlebot · 2019-07-30T11:44:40Z

All (the pull request submitter and all commit authors) CLAs are signed, but one or more commits were authored or co-authored by someone other than the pull request submitter.

We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that by leaving a comment that contains only @googlebot I consent. in this pull request.

Note to project maintainer: There may be cases where the author cannot leave a comment, or the comment is not properly detected as consent. In those cases, you can manually confirm consent of the commit author(s), and set the cla label to yes (if enabled on your project).

ℹ️ Googlers: Go here for more info.

googlebot · 2019-07-30T12:00:27Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

howl-anderson · 2020-03-05T04:46:24Z

@gabrieldemarmiesse I am working on using the add_loss way to implement the CRF layer. When it's done, I will update this PR.

gabrieldemarmiesse · 2020-03-05T08:00:31Z

Thanks a lot @howl-anderson I hope it helped.

albertnanda · 2020-03-19T12:15:22Z

Any update here?

howl-anderson · 2020-03-20T01:44:13Z

@albertnanda I encountered some problems during the implementation and are trying to solve them.

gabrieldemarmiesse · 2020-03-21T19:01:25Z

I moved some code around in this PR, and I made #1363 which has less hacks. @howl-anderson , @Squadrick, please take a look when you can.

@howl-anderson , I've put you as author of the first commit.

bodak · 2020-03-23T09:43:15Z

I moved some code around in this PR, and I made #1363 which has less hacks. @howl-anderson , @Squadrick, please take a look when you can.

@howl-anderson , I've put you as author of the first commit.

While _keras_history approach in this PR is unfortunate (but necessary to access the layer at loss), it has no impact on the end API user.

The add_loss() approach unless I missed something won't work due to not having access to y_true inside the layer.

@gabrieldemarmiesse Your approach seems to have introduced hacks that lead to multiple impacts on the API user: two models to track and fake targets to input. From an end user perspective, I would be hesitant to leave this to be handled externally.

gabrieldemarmiesse · 2020-03-23T11:49:38Z

@bodak we can't use private APIs in code used by users.

If at the next release, this attribute disapears, we get an attribute error and we have to redo all the work of this pull request to find another pattern that works for CRF.

Or worst case scenario, the behavior changes silently (for example tensorflow decide to enforce that we can't use tensors or variables not declared in the signature in the loss), and all users get wrong numerical results/horrible error messages and we spend a lot of time understanding what is going on. To in the end fall back to #1363 because it's the only way that is garanteed to last since we don't use private APIs.

I agree with you that it leads to more confusing code for end users, but they'll have more flexibility and stability. For example, in the implementation proposed in #1363, the user is not forced to have the CRF layer be the last layer of the model.

If users don't like the pattern of having two models or passing the labels as inputs, they're free to use only one model and do a custom training loop. Making a custom training loop is getting easier and easier nowadays.

mhartig · 2020-03-23T17:34:50Z

Hi, huge thanks to you for making this effort and building a CRF layer for tensorflow 2. For my application I require the probabilities or confidences for the class predictions in order to conduct a majority vote. As far as I understand, if I use the variable self.potentials that is used in the function get_viterbi_decoding as additional output, this will yield the probabilities that come from the dense layer, so they are not necessarily the "true" probabilities for the CRF class prediction. Is there a simple way to get the predicted class together with the probability?
If I change the code from return decoded_sequence to return decoded_sequence, self.potentials I get a problem with the ranks of the expected outputs. Before my "solution" was to load another Model like so:
dense_layer = self.model.get_layer('crf')._dense_layer proba_predictor = tf.keras.Model(self.model.inputs, dense_layer.output) y_proba = proba_predictor.predict(x) y_pred_probas = softmax(y_proba).numpy().max(axis=-1)

but as expected this doubled the prediction time.
I would be very thankful for a solution.

howl-anderson · 2020-03-24T05:30:19Z

Hi @mhartig, output the probabilities or confidences for the prediction is a good feature. After the CRF layer is merged, I can implement it.

mhartig · 2020-03-24T15:02:54Z

I have tried to modify the output of the CRF layer myself, unfortunately I'm not very familiar with the underlying structure of tf.keras and the context in which the class methods are called. The output shape seems to be restricted in more than one place, but it is very hard to see exactly where the expected output shape needs to be changed in order to return for example a tensor of shape (batch_size, sequence length, number_of_classes + 1) and dtype float32 instead of a tensor of shape (batch_size, sequence_length) and dtype int32.
If it doesn't take too much time for you, it would be awesome if you could share a list of places where the output settings need to be modified. I could do the manipulations within the layer by myself.

…ature/crf_layers

…into feature/crf_layers

howl-anderson · 2020-03-25T06:14:49Z

@mhartig Since this CRF layer implement is not merged, so I am not planning to implement this feature now. But if I get enough free time, I willing to help you with this.

HuggingLLM · 2020-04-12T14:34:17Z

Any update？

howl-anderson · 2020-04-12T14:47:45Z

@NLP-ZY waiting for tensorflow/tensorflow#37818

soumayan · 2020-04-22T06:44:41Z

any example of how to implement this crf layer?

howl-anderson · 2020-04-22T07:01:47Z

@soumayan Currently, it's waiting for other PR. Doc will be added to the repo when it's ready.

soumayan · 2020-04-22T15:09:11Z

@howl-anderson when I'm installing tensorflow_addons package from your github repository using

!pip install git+https://github.com/howl-anderson/addons.git

I'm getting error--
ERROR: Failed building wheel for tensorflow-addons
Failed to build tensorflow-addons
ERROR: Could not build wheels for tensorflow-addons which use PEP 517 and cannot be installed directly

howl-anderson · 2020-04-22T15:18:31Z

@soumayan You should download it and built it to a wheel package. https://github.com/tensorflow/addons has docs for how to build from source.

zhoushaoxiang · 2020-04-26T03:23:14Z

excuse me，how to get accuray while training？I can‘t find crf_accuracy.py @howl-anderson

howl-anderson · 2020-04-26T03:47:27Z

@zhoushaoxiang I removed it for more focus on CRF itself. I am planning to add metric functions (just like the ACC function ) after the CRF implement merged.

gabrieldemarmiesse · 2020-04-26T19:38:56Z

@howl-anderson , I'll close this pull request in favor of #1733 which has an API as clean as this one, is flexible and can be serialized, the whole package without using any private api.

Squadrick and others added 12 commits June 13, 2019 18:22

Port CRF from tf.contrib to tfa.text

3bcc068

Port CRF from tf.contrib to tfa.text

b24ee9c

Port CRF from tf.contrib to tfa.text

2b9f3f6

Merge branch 'crf' of https://github.com/Squadrick/addons into crf

e900aed

Format using make code-format

707ed99

Add tf.function to all the CRF functions

829ac65

Merge branch 'master' of https://github.com/tensorflow/addons into crf

36ea1ca

RNN call masks computation based on seq len

9140ce1

Add @tf.function to all crf functions

3a8b0be

Rename files and minor fixes

a90b478

* Rename crf_ops* -> crf* * The RNN cells inherit `AbstractRNNCell` instead of `Layer` * Remove used `training` variable * Add docstring for RNN Cells

code format

d25b747

save work progress

34811e6

howl-anderson requested review from facaiy, seanpmorgan, Squadrick and WindQAQ as code owners July 30, 2019 11:44

googlebot added the cla: no label Jul 30, 2019

howl-anderson added 3 commits July 30, 2019 19:53

save work progress

23460eb

remove useless file

a0b6b7d

Update & bugfix

d8d98a8

howl-anderson force-pushed the feature/crf_layers branch from cc639b3 to d8d98a8 Compare July 30, 2019 12:00

googlebot added cla: yes and removed cla: no labels Jul 30, 2019

howl-anderson mentioned this pull request Jul 31, 2019

Implement Conditional Random Field #22

Closed

Squadrick added layers wip Work in-progress labels Jul 31, 2019

seanpmorgan added the kokoro:force-run label Aug 1, 2019

kokoro-team removed the kokoro:force-run label Aug 1, 2019

gabrieldemarmiesse mentioned this pull request Mar 21, 2020

CRF layer continued #1363

Closed

howl-anderson mentioned this pull request Mar 23, 2020

[Feature Request][Keras] Allow loss function using multiple tensors as input tensorflow/tensorflow#37818

Closed

howl-anderson added 3 commits March 25, 2020 13:43

Merge branch 'master' of https://github.com/tensorflow/addons into fe…

50ff813

…ature/crf_layers

Update CODEOWNERS

307678d

Merge branch 'feature/crf_layers' of github.com:howl-anderson/addons …

e0b5e0f

…into feature/crf_layers

boring-cyborg bot added the github label Mar 25, 2020

ivyleavedtoadflax mentioned this pull request Apr 5, 2020

Future of CRF layer wellcometrust/deep_reference_parser#28

Open

gabrieldemarmiesse closed this Apr 26, 2020

jaspersjsun mentioned this pull request Jul 16, 2020

CRF layer v3.0 continued #1999

Merged

howl-anderson mentioned this pull request Jul 30, 2020

How Exactly Do I Use CRF with this Library? #1769

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Conditional Random Field (CRF) layer #377

Add Conditional Random Field (CRF) layer #377

howl-anderson commented Jul 30, 2019 •

edited

Loading

googlebot commented Jul 30, 2019

googlebot commented Jul 30, 2019

howl-anderson commented Mar 5, 2020

gabrieldemarmiesse commented Mar 5, 2020

albertnanda commented Mar 19, 2020

howl-anderson commented Mar 20, 2020

gabrieldemarmiesse commented Mar 21, 2020

bodak commented Mar 23, 2020

gabrieldemarmiesse commented Mar 23, 2020

mhartig commented Mar 23, 2020 •

edited

Loading

howl-anderson commented Mar 24, 2020

mhartig commented Mar 24, 2020

howl-anderson commented Mar 25, 2020

HuggingLLM commented Apr 12, 2020

howl-anderson commented Apr 12, 2020

soumayan commented Apr 22, 2020

howl-anderson commented Apr 22, 2020

soumayan commented Apr 22, 2020

howl-anderson commented Apr 22, 2020

zhoushaoxiang commented Apr 26, 2020 •

edited

Loading

howl-anderson commented Apr 26, 2020

gabrieldemarmiesse commented Apr 26, 2020

Add Conditional Random Field (CRF) layer #377

Add Conditional Random Field (CRF) layer #377

Conversation

howl-anderson commented Jul 30, 2019 • edited Loading

What's include

How to use it

Progress

Limitation

Roadmap (next version)

googlebot commented Jul 30, 2019

googlebot commented Jul 30, 2019

howl-anderson commented Mar 5, 2020

gabrieldemarmiesse commented Mar 5, 2020

albertnanda commented Mar 19, 2020

howl-anderson commented Mar 20, 2020

gabrieldemarmiesse commented Mar 21, 2020

bodak commented Mar 23, 2020

gabrieldemarmiesse commented Mar 23, 2020

mhartig commented Mar 23, 2020 • edited Loading

howl-anderson commented Mar 24, 2020

mhartig commented Mar 24, 2020

howl-anderson commented Mar 25, 2020

HuggingLLM commented Apr 12, 2020

howl-anderson commented Apr 12, 2020

soumayan commented Apr 22, 2020

howl-anderson commented Apr 22, 2020

soumayan commented Apr 22, 2020

howl-anderson commented Apr 22, 2020

zhoushaoxiang commented Apr 26, 2020 • edited Loading

howl-anderson commented Apr 26, 2020

gabrieldemarmiesse commented Apr 26, 2020

howl-anderson commented Jul 30, 2019 •

edited

Loading

mhartig commented Mar 23, 2020 •

edited

Loading

zhoushaoxiang commented Apr 26, 2020 •

edited

Loading