-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
A bug in the cross_validation
function
#995
Comments
The implementation for the cross-validation algorithm has been up for debate for quite some time now. Refer to this issue and this issue for details. It needs to be re-implemented but we haven't reached a conclusion yet. This will probably be updated in the newer versions of AIMA. |
I see a |
The debate here is that the conventional meaning of |
@pyaf sir I made a PR related 995 issue now what else we can do here |
Hi! I would like to know whether this issue is resolved. |
@pyaf can you review my PR |
@hackerashish25 you should ask aima-python mentors for that :) |
Hi,
Currently,
cross_validation
function shuffles data before every step of k fold cross-validation [here]. Which is wrong because of this the test set at each step is not unique (when compared to all the other steps) which gives us the wrong k fold cross validation perfomance score.Instead, we should shuffle the dataset before performing k fold cross validation (rather than doing the same at each step of k fold cross-validation).
Let me know your thoughts.
The text was updated successfully, but these errors were encountered: