-
Notifications
You must be signed in to change notification settings - Fork 91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
division by 0 in fuzzy_join when there are only perfect matches #764
Comments
Good catch, this is an important special case. Ideally, when there are only exact matches, we shouldn't even go down that code path and only use standard matching from the backend (pandas, polars...), as it will be much faster. |
Yes, indeed, you are right. This is IMO equivalent to #730 if we agree that we should rely on the backend for perfect matches.. |
this is not the same as when the user requires perfect matches by setting |
fixed by #802 |
Describe the bug
if we join 2 tables on columns for which matches do exist, the rescaling of nearest neighbor distances does a division by 0; we get a warning and the matching scores are NaN
Steps/Code to Reproduce
Expected Results
no division by 0, scores are numbers (prob. equal to 1)
Actual Results
division by 0, scores are NaN
Versions
The text was updated successfully, but these errors were encountered: