How to decrease False positive matches? (process.extract / WRatio) #328

Pranav082001 · 2022-08-02T06:53:31Z

I am using process.extract method, And I know it uses WRatio under the hood for calculating score. Following is the case in which I am getting very high score of 90 despite the string hardly equal. Is there any way to fix this in WRatio?

inp_name="america"

name_list=["american Futures and Options Exchange"]
        
process.extractOne(inp_name,name_list)

Output--> ('american Futures and Options Exchange', 90.0, 0)

PS: I know other alternatives likes fuzz.ratio, partial_ratio, token_sort_ratio. But WRatio works pretty well for my usecase. So any workaround for the same would be appreciated... Thanks!

The text was updated successfully, but these errors were encountered:

maxbachmann · 2022-08-02T06:59:17Z

Maybe write your own version of WRatio, which does not fall back to the partial version of the algorithms.

Pranav082001 · 2022-08-02T07:23:05Z

Could you please help me. Do I need to set try_partial parameter False in def WRatio?

fuzzywuzzy/fuzzywuzzy/fuzz.py

Line 272 in af443f9

try_partial = True

maxbachmann · 2022-08-03T14:23:56Z

Yes thats what I would try

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to decrease False positive matches? (process.extract / WRatio) #328

How to decrease False positive matches? (process.extract / WRatio) #328

Pranav082001 commented Aug 2, 2022

maxbachmann commented Aug 2, 2022

Pranav082001 commented Aug 2, 2022

maxbachmann commented Aug 3, 2022

How to decrease False positive matches? (process.extract / WRatio) #328

How to decrease False positive matches? (process.extract / WRatio) #328

Comments

Pranav082001 commented Aug 2, 2022

maxbachmann commented Aug 2, 2022

Pranav082001 commented Aug 2, 2022

maxbachmann commented Aug 3, 2022