New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fetch_openml('zoo') raises IndexError in sklearn.datasets.openml._convert_arff_data #14340
Comments
Thanks for the report, looks like a bug indeed. |
we should be ignoring the ignored features (in this case 'animal') and change the indices accordingly. see https://github.com/openml/openml-python/blob/347c4a6c2a7b072de574d2bd2f5e0952f6375a84/openml/datasets/dataset.py#L537 for a reference implementation. |
@amueller Hi if this issue is not hard to solve, can I take a look at it? |
@corona10 Not entirely sure how hard it is, feel free to have a look. |
@corona10 Hi, are you still working on this issue? Otherwise, I can take it on. |
Go ahead please |
The shape extraction from data_qualities was using NumberOfFeatures, which excluded the ignored features. This exclusion caused a bug in the data conversion, since we tried to reshape the whole dataset with a lower number of features. This commit returns all features in the shape extraction. Fixes scikit-learn#14340
Hi, |
The shape extraction from data_qualities was using NumberOfFeatures, which excluded the ignored features. This exclusion caused a bug in the data conversion, since we tried to reshape the whole dataset with a lower number of features. This fix uses data_features to include ignored features in the shape extraction Fixes scikit-learn#14340
The shape extraction from data_qualities was using NumberOfFeatures, which excluded the ignored features. This exclusion caused a bug in the data conversion, since we tried to reshape the whole dataset with a lower number of features. This fix uses data_features to include ignored features in the shape extraction Fixes scikit-learn#14340
The shape extraction from data_qualities was using NumberOfFeatures, which excluded the ignored features. This exclusion caused a bug in the data conversion, since we tried to reshape the whole dataset with a lower number of features. This fix uses data_features to include ignored features in the shape extraction Fixes scikit-learn#14340
The shape extraction from data_qualities was using NumberOfFeatures, which excluded the ignored features. This exclusion caused a bug in the data conversion, since we tried to reshape the whole dataset with a lower number of features. This fix uses data_features to include ignored features in the shape extraction Fixes scikit-learn#14340
The shape extraction from data_qualities was using NumberOfFeatures, which excluded the ignored features. This exclusion caused a bug in the data conversion, since we tried to reshape the whole dataset with a lower number of features. This fix uses data_features to include ignored features in the shape extraction Fixes scikit-learn#14340
The shape extraction from data_qualities was using NumberOfFeatures, which excluded the ignored features. This exclusion caused a bug in the data conversion, since we tried to reshape the whole dataset with a lower number of features. This fix uses data_features to include ignored features in the shape extraction Fixes scikit-learn#14340
OpenML 'zoo' dataset fails to load.
First reported as openml/OpenML#989
Versions
The text was updated successfully, but these errors were encountered: