-
Notifications
You must be signed in to change notification settings - Fork 514
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] DBSCAN results incorrect #80
Comments
branch-0.5 should be compared against 0.4 release. |
what is [1]? |
Updated original comment |
I ran @daxionshu's notebook against branches 0.5, 0.4, 0.3. This means this has been broken since before the refactor. I believe the sklearn toy datasets should be tested even on the C++ side. That way when results don't match it's very clear to see which layer bugs were introduced. |
@cjnolet which of these issues against dbscan needs to be prioritized? 54, 63 or 80? |
Also, is there a standalone python script that could repro this mismatch? (Sorry, if you have had it somewhere already!) |
I’m going to go ahead and close this for now since we have discussed how the subtle differences in eps affect the results. |
@daxiongshu ran our DBSCAN & k-means implementations against [1] and found that our results do not match, even for datasets as small as size 2^10.
[1] https://scikit-learn.org/stable/auto_examples/cluster/plot_cluster_comparison.html
The text was updated successfully, but these errors were encountered: