[patch:lib] Fix major kNN label scoring bug #187

eonu · 2022-06-07T21:21:51Z

Fixes #186 and generally improves the code in the KNNClassifier.

Split _find_nearest into two functions:
- _find_k_nearest: Finds the label and weighting/score of the k nearest neighbors with numpy.argpartition.
- _find_max_labels: Finds the labels out of the nearest k which had the highest total score.
A major bug in finding the labels with the highest total score was spotted by @manisci (thanks!). This has now been fixed, and splitting the function into the two described above made it easier to write unit tests for kNN.

Release notes for older versions will also be updated to include a warning about this bug.
Disabled progress bars for multi-processed kNN predictions due to rendering issues.
Added a _predict function for the logic of making a single prediction, and updated _chunk_predict to call _predict for each example.
Renamed _argmax to _multi_argmax for clarity.

eonu added 8 commits June 4, 2022 11:38

Fix KNN labels

13a7815

Fix kNN label issue

ecab8ec

Fix kNN tests

caacefc

Update notebooks

c0ddb39

Update contributors

9872efe

Update contributors

8959f86

Add #186 disclaimer

9a08367

Update contributors

e68afbb

eonu merged commit 7ad060a into dev Jun 7, 2022

eonu deleted the patch/knn-labels branch June 7, 2022 21:47

This was referenced Jun 9, 2022

Disable tqdm when making multi-processed predictions (and sync with KNNClassifier) #197

Closed

KNN classifier labels #186

Closed

eonu mentioned this pull request Jun 26, 2022

[release] 0.13.0 🎉 #220

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[patch:lib] Fix major kNN label scoring bug #187

[patch:lib] Fix major kNN label scoring bug #187

eonu commented Jun 7, 2022

[patch:lib] Fix major kNN label scoring bug #187

[patch:lib] Fix major kNN label scoring bug #187

Conversation

eonu commented Jun 7, 2022