GH-3070: Fix inconsistency between best path and scores in ViterbiDecoder #3189

mauryaland · 2023-04-06T12:17:08Z

Related to #3070. It is the same fix used back then #949 that has been removed with the refacto. It is definitely not a good interpretation of probabilities from the Viterbi decoding but at least avoid confusion.

Some improvements could be achieved, see for example this solution from Manning stanfordnlp/stanza#744 (comment) and this CRF implementation https://github.com/RasaHQ/rasa/blob/main/rasa/utils/tensorflow/crf.py.

I would be happy to discuss more about what can be done.

mauryaland · 2023-06-22T13:29:36Z

Hi @alanakbik @whoisjones, any thought on this topic? I think it should be discussed regarding the consequences of the CRF scoring implementation.

mauryaland · 2023-10-03T11:17:17Z

@alanakbik @whoisjones @helpmefindaname it could be great to discuss this topic before the release of flair 0.13. Thanks in advance!

helpmefindaname · 2023-10-09T14:25:35Z

Hi @mauryaland ,
I agree this should be part of the realse, I am currently fixing the merge conflicts etc. and then talk to @alanakbik about it.

alanakbik · 2023-10-23T13:46:20Z

Thanks @mauryaland for improving this!

Code to test:

from flair.data import Sentence
from flair.models import SequenceTagger

# example sentence
sentence = Sentence("I work for the Humboldt Universität of Berlin")

# print NER tags
tagger = SequenceTagger.load("ner-fast")
tagger.predict(sentence, return_probabilities_for_all_classes=True)
print(sentence)

# one token has a higher probability for another class pre-viterbi
difficult_token = sentence[6]

# print the problem token "of"
print(difficult_token)

# print token prediction probabilities
sorted_predictions = sorted(difficult_token.get_tags_proba_dist("ner"), key=lambda x: x.score, reverse=True)
print(sorted_predictions[:2])

Before the PR, the token "of" will have the following individual probabilities:

['Token[6]: "of"'/'O' (0.8465999960899353), 'Token[6]: "of"'/'I-ORG' (0.08910000324249268)]

After the PR, the token "of" will have the following individual probabilities:

['Token[6]: "of"'/'I-ORG' (0.8465999960899353), 'Token[6]: "of"'/'O' (0.08910000324249268)]

Since it is labeled as I-ORG after Viterbi, the score post-PR is more appropriate.

Fix inconsistency between best path and scores

7dc4584

Merge branch 'master' into crf_score

27a952a

Benedikt Fuchs and others added 4 commits October 12, 2023 14:07

Merge branch 'master' into mauryaland-crf_score

b554291

fix E721 Do not compare types, use isinstance()

c3ee3ab

fix typing errors & simplify logic

ae5ca66

fix ruff tenary error

04856ef

helpmefindaname added the release-0.13 label Oct 23, 2023

alanakbik merged commit c5c1bf8 into flairNLP:master Oct 23, 2023
1 check passed

mauryaland deleted the crf_score branch October 24, 2023 16:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-3070: Fix inconsistency between best path and scores in ViterbiDecoder #3189

GH-3070: Fix inconsistency between best path and scores in ViterbiDecoder #3189

mauryaland commented Apr 6, 2023

mauryaland commented Jun 22, 2023 •

edited

mauryaland commented Oct 3, 2023

helpmefindaname commented Oct 9, 2023

alanakbik commented Oct 23, 2023

GH-3070: Fix inconsistency between best path and scores in ViterbiDecoder #3189

GH-3070: Fix inconsistency between best path and scores in ViterbiDecoder #3189

Conversation

mauryaland commented Apr 6, 2023

mauryaland commented Jun 22, 2023 • edited

mauryaland commented Oct 3, 2023

helpmefindaname commented Oct 9, 2023

alanakbik commented Oct 23, 2023

mauryaland commented Jun 22, 2023 •

edited