Maxsum returns wrong similarlities #92

kunihik0 · 2022-02-23T09:54:29Z

Hi, thank you for good KeyBert!
I think the method of getting the distance by index in _maxsum.py is wrong . The problem is that the similarity between the sentence and the candidate is specified with a different index than the actual one, so the wrong similarity is returned.
I think the following points need to be changed.

return [(words_vals[idx], round(float(distances[0][idx]), 4)) for idx in candidate]
to
return [(words_vals[idx], round(float(distances[0][words_idx[idx]]), 4)) for idx in candidate]

https://github.com/MaartenGr/KeyBERT/blob/master/keybert/_maxsum.py#:~:text=return%20%5B(words_vals%5Bidx%5D%2C%20round(float(distances%5B0%5D%5Bidx%5D)%2C%204))%20for%20idx%20in%20candidate%5D

The text was updated successfully, but these errors were encountered:

MaartenGr · 2022-02-27T06:25:44Z

Sorry for the late response and thank you for tracking this down! I'll make sure it gets fixed in the next release.

kunihik0 · 2022-03-02T09:38:47Z

Thank you for your reply.
I had never heard of fork before but I have learned a bit about it.
So I would like you to use this pull request if possible after you have checked.

MaartenGr · 2022-03-25T09:05:19Z

Thank you for the pull request! It was merged and will be added in the next pypi release.

kunihik0 · 2022-03-25T09:21:02Z

Thanks for checking and merging!

kunihik0 mentioned this issue Mar 2, 2022

fix(maxsum): Fix the way to specify the distance by index. #94

Merged

MaartenGr closed this as completed Mar 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maxsum returns wrong similarlities #92

Maxsum returns wrong similarlities #92

kunihik0 commented Feb 23, 2022 •

edited

MaartenGr commented Feb 27, 2022

kunihik0 commented Mar 2, 2022 •

edited

MaartenGr commented Mar 25, 2022

kunihik0 commented Mar 25, 2022

Maxsum returns wrong similarlities #92

Maxsum returns wrong similarlities #92

Comments

kunihik0 commented Feb 23, 2022 • edited

MaartenGr commented Feb 27, 2022

kunihik0 commented Mar 2, 2022 • edited

MaartenGr commented Mar 25, 2022

kunihik0 commented Mar 25, 2022

kunihik0 commented Feb 23, 2022 •

edited

kunihik0 commented Mar 2, 2022 •

edited