Matching % seems incorrect #91

oom- · 2021-02-21T20:29:30Z

I just tryed the example:

stringSimilarity.compareTwoStrings("healed", "sealed");
//0.8

=> 80% for a 1 letter change.

stringSimilarity.compareTwoStrings("healed", "ehaled");
//0.6

=> 60% for 2 letter switching

Ok I get it but now I just try with another word that contains 1 letter less (5 char length vs 6)

stringSimilarity.compareTwoStrings("fuira", "fuia");
//0.57

=> 57% for a 1 letter change (just lost 23%)

stringSimilarity.compareTwoStrings("furia", "fuira");
//0.25

=> 25% for a 1 letter change (just lost 35%)

Seems to me that less the string is long more the matching is severe.
Is there a way to make it "average" undepending of the length ?

aceakash · 2021-03-11T18:43:00Z

@oom- Different algorithms will have different trade-offs. This library implements the Sørensen–Dice_coefficient as the similarity score. I would encourage you to try out other string comparison algorithms to see which one best fits your needs. This might be a good starting point.

aceakash closed this as completed Mar 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matching % seems incorrect #91

Matching % seems incorrect #91

oom- commented Feb 21, 2021 •

edited

Loading

aceakash commented Mar 11, 2021

Matching % seems incorrect #91

Matching % seems incorrect #91

Comments

oom- commented Feb 21, 2021 • edited Loading

aceakash commented Mar 11, 2021

oom- commented Feb 21, 2021 •

edited

Loading