Question about the "word coreference" part #2

yhcc · 2021-11-07T14:42:44Z

First of all, thank you to release the code. Really nice work, so glad to see advances without enumerating span combinations. But I have one question about the "word coreference" part. Since the one word may be assigned to different coference clusters, such as for "a b c d e", "a b c" and "c" are both coferences and belong to different cluseters (if 'c' is the head word for "a b c"), how will this "word coreference" stage deal with it?

vdobrovolskii · 2021-11-07T15:03:46Z

Hi, thanks for the interest!

I am not sure I completely understand the question. I'll try to answer the way I understand it, please clarify if needed.

First of all, coreference resolution with word representations works the same way as with span representations, i.e. each mention can be only assigned to one cluster.

Then, the assumption is that there is one-to-one correspondence between all the valid spans in the text and their head words. That is, in "A big black cat sat on a mat" there will three valid spans for Ontonotes: "a big black cat" -> "cat", "sat" -> "sat" and "a mat" -> "mat".

So when the coreference links are found between individual words, each word is assigned to only one coreferent cluster and is then converted to only one span, so there are unique non-overlapping coreferent clusters of spans.

Now there are (very few) cases when two spans in the original Ontonotes dataset share the same head word. Almost all of them are related to conjunction. In those cases "A" and "A & B" are different spans with the same head word, "A". In our implementation such cases were simply discarded from the training set, because they were few and we were able to perform well, even though we couldn't predict any of such cases during inference.

To adopt this system under requirements that such cases should work, I would try assigning an artificial "head word" to such spans, for instance, the conjunction itself.

yhcc · 2021-11-13T15:00:04Z

Thanks for your answer. Your understanding is right, you have solved my question.

vdobrovolskii closed this as completed Nov 11, 2021

vdobrovolskii mentioned this issue Nov 13, 2021

Problems about the head2span #3

Closed

leileilin mentioned this issue Jul 19, 2022

some confusions about convert_to_head.py #27

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the "word coreference" part #2

Question about the "word coreference" part #2

yhcc commented Nov 7, 2021

vdobrovolskii commented Nov 7, 2021

yhcc commented Nov 13, 2021

Question about the "word coreference" part #2

Question about the "word coreference" part #2

Comments

yhcc commented Nov 7, 2021

vdobrovolskii commented Nov 7, 2021

yhcc commented Nov 13, 2021