Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Already on GitHub? Sign in to your account

'LC-40LE830U' and 'LC-60LE830U' returns 90% match #9

Closed
johnlyn763 opened this Issue Apr 24, 2012 · 1 comment

Comments

Projects
None yet
2 participants

These two strings have a 90% match, regardless of the algorithm -- I tried WRatio, ratio, partial_ratio, etc.

Is there some way to detect that single digit difference - the "4" and the "6" - in the two strings? The thinking is that the mismatched numbers should lower the score.

Thanks

Owner

acslater00 commented Aug 9, 2012

Unfortunately they have a 90% match because they are 10 character strings sharing 9 characters. There's basically no way to disambiguate those strings using fuzzy string matching, since fuzzy string matching is premised on the idea that a small number of mismatched characters is acceptable in a match. Closing.

@acslater00 acslater00 closed this Aug 9, 2012

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment