n-best tagging results #45

usptact · 2015-05-18T22:37:33Z

Is possible to return n the most likely predictions using CRF? If so, which place should be modified in the source code to get this behavior since I could not find any parameter that gives this.

Thank you.

kmike · 2015-05-18T22:49:38Z

Hey @usptact,

CRFsuite doesn't currently support n-best tagging.

It seems the relevant code is

crfsuite/lib/crf/src/crf1d_context.c

Line 466 in 5566039

floatval_t crf1dc_viterbi(crf1d_context_t* ctx, int *labels)

usptact · 2015-05-19T00:09:44Z

kmike,

Thank you a lot for a pointer! I read elsewhere that for Vitterbi based algorithms one needs to increase the beam size. I am not sure what it means.

kmike · 2015-05-19T00:29:39Z

Currently the function computes a single max_score, stores a single backward link at each j, and finds a single best label sequence using these backwads links.

If I'm not mistaken, for n-best parsing you need to keep top-n max_score values, n best backward links at each position j and use them to compute n best label sequences.

There are also more efficient algorithms for n-best decoding, see e.g. http://www.keerthis.com/P12-1064.pdf for an overview.

kmike · 2015-05-19T00:36:19Z

As a side not, Wapiti CRF toolkit supports n-best decoding.
Implementation is not optimal though (see Jekub/Wapiti#2).

usptact · 2015-05-19T05:31:39Z

Thank you very much, kmike! I am playing with Wapiti right now and trying to assess the top n-best results. Up to this moment I was always relying on the top-1 result which was not the best in all the cases. I am curios whether good tagging is in the n-best results list.

usptact closed this as completed May 20, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

n-best tagging results #45

n-best tagging results #45

usptact commented May 18, 2015

kmike commented May 18, 2015

usptact commented May 19, 2015

kmike commented May 19, 2015

kmike commented May 19, 2015

usptact commented May 19, 2015

n-best tagging results #45

n-best tagging results #45

Comments

usptact commented May 18, 2015

kmike commented May 18, 2015

usptact commented May 19, 2015

kmike commented May 19, 2015

kmike commented May 19, 2015

usptact commented May 19, 2015