GH-642: Added return probability distribution over all classes #782

philiprekers · 2019-06-05T08:05:10Z

Hi,

this PR is about issue #745. It:

adds the property tags_proba_dist to Token, which contains a list of Label. The list of labels represents the probability distribution over all possible labels for this token.
modifies SequenceTagger._obtain_labels to not only return the most probable label, but in addition a complete list of all possible labels with their respective scores.

The required changes in SequenceTagger._viterbi_decode are taken from #642.

Disclaimer: This is my first PR to the flair project. I'm happy for any feedback or ideas on how to improve this.

jantrienes

Hi @Philipduerholt, thanks for implementing this really useful feature. I tried out your implementation and it works like expected! See my comments below :)

jantrienes · 2019-06-07T07:05:58Z

flair/models/sequence_tagger_model.py

@@ -587,11 +601,21 @@ def _viterbi_decode(self, feats):
            _, idx = torch.max(backscore, 0)
            prediction = idx.item()
            best_scores.append(softmax[prediction].item())
+            scores.append([elem.item() for elem in softmax.flatten()])
+        # This has been taken from https://github.com/zalandoresearch/flair/pull/642


It is not entirely clear to me what this code actually does. I surrounded it with a simple assertion and it seems that scores remains unchanged.

_scores_before = scores.copy() swap_best_path, swap_max_score = ( best_path[0], scores[-1].index(max(scores[-1])), ) scores[-1][swap_best_path], scores[-1][swap_max_score] = ( scores[-1][swap_max_score], scores[-1][swap_best_path], ) assert _scores_before == scores

@mauryaland Since you added this code in your original PR (#642), can you perhaps shed some light on this? :)

Sorry for the late reply! This is done when using CRF because sometimes the best path is not related to the max score. However,I'm not sure that my implementation is optimal or always good.

Glad to see this feature added and thanks @Philipduerholt for having enhance and finish the few work I started.

jantrienes · 2019-06-07T07:20:31Z

flair/models/sequence_tagger_model.py

@@ -514,26 +515,28 @@ def _calculate_loss_old(self, features, lengths, tags) -> float:
            score /= len(features)
            return score

-    def _obtain_labels(self, feature, sentences) -> List[List[Label]]:
+    def _obtain_labels(self, feature, sentences) -> (List[List[Label]], List[List[List[Label]]]):


I think it's not very transparent what the return values correspond to. Maybe we can add some explanation along the lines of:

""" Returns a tuple of two lists: - The first list corresponds to the most likely `Label` per token in each sentence. - The second list contains a probability distribution over all `Labels` for each token in a sentence for all sentences. """

philiprekers · 2019-06-07T08:57:07Z

Hi @jantrienes, thank you for your timely feedback.

I'm glad everything worked as expected. If there are any other suggestions, please let me know.

alanakbik · 2019-06-11T12:20:01Z

@Philipduerholt thanks for adding this! A lot of people will find this useful.

alanakbik · 2019-06-11T12:22:32Z

👍

kashif · 2019-06-11T13:36:29Z

👍

alanakbik · 2019-06-11T13:41:00Z

@Philipduerholt and @mauryaland: thank you for adding this!

flairNLPGH-642: Added return probability distribution over all classes

0b274c6

This was referenced Jun 5, 2019

Predict top n most likely entity classes depending on confidence score #745

Closed

Add the possibility to get the score for all tags when using CRF #642

Closed

philipduerholt added 2 commits June 5, 2019 10:35

fix bug for use_crf=False

ea31af8

fix for use_crf=False

1c74191

jantrienes reviewed Jun 7, 2019

View reviewed changes

add explanation for return value of _obtain_labels

4e32d01

kashif merged commit fd62d01 into flairNLP:master Jun 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-642: Added return probability distribution over all classes #782

GH-642: Added return probability distribution over all classes #782

philiprekers commented Jun 5, 2019

jantrienes left a comment

jantrienes Jun 7, 2019 •

edited

mauryaland Jun 11, 2019

jantrienes Jun 7, 2019 •

edited

philiprekers commented Jun 7, 2019

alanakbik commented Jun 11, 2019

alanakbik commented Jun 11, 2019

kashif commented Jun 11, 2019

alanakbik commented Jun 11, 2019

GH-642: Added return probability distribution over all classes #782

GH-642: Added return probability distribution over all classes #782

Conversation

philiprekers commented Jun 5, 2019

jantrienes left a comment

Choose a reason for hiding this comment

jantrienes Jun 7, 2019 • edited

Choose a reason for hiding this comment

mauryaland Jun 11, 2019

Choose a reason for hiding this comment

jantrienes Jun 7, 2019 • edited

Choose a reason for hiding this comment

philiprekers commented Jun 7, 2019

alanakbik commented Jun 11, 2019

alanakbik commented Jun 11, 2019

kashif commented Jun 11, 2019

alanakbik commented Jun 11, 2019

jantrienes Jun 7, 2019 •

edited

jantrienes Jun 7, 2019 •

edited