Issue with Loading Japanese Vowel Dataset #113

ashe-v00 · 2023-06-26T17:11:31Z

Hello!

I am having some issues accessing the Japanese Vowels dataset through ReservoirPy. I believe the URL that points towards the database is out of date as the link (https://archive.ics.uci.edu/ml/machine-learning-databases/JapaneseVowels-mld/) points to a page that subsequently says "NOT FOUND." I found a potentially updated link to the database (https://archive.ics.uci.edu/dataset/128/japanese+vowels or http://doi.org/10.24432/C5NS47)? However, when I downloaded it to my local computer, I am unable to get ReservoirPy to load the files. Do you have any suggestions on how to get it to load from a local directory?

Thank you in advance for your help!

import reservoirpy.datasets as rpyds
xtra, ytra, xtest, ytest = rpyds.japanese_vowels()

OR

xtra, ytra, xtest, ytest = rpyds.japanese_vowels(data_folder="/directory/to/japanesevowels")

Error:
KeyError Traceback (most recent call last)
Cell In[24], line 2
1 import reservoirpy.datasets as rpyds
----> 2 xtr, ytr, xtes, ytes = rpyds.japanese_vowels(data_folder="/directory/to/japanesevowels")

File ~/Path/to/lib/python3.10/site-packages/reservoirpy/datasets/_japanese_vowels.py:158, in japanese_vowels(one_hot_encode, repeat_targets, data_folder, reload)
156 data_files[file_role] = _download(data_folder, file_name, file_role)
157 else:
--> 158 data_files[file_role] = joblib.load(data_folder / file_path)
160 X_train, Y_train = _format_data(
161 data_files["train"], data_files["train_sizes"], one_hot_encode
162 )
164 X_test, Y_test = _format_data(
165 data_files["test"], data_files["test_sizes"], one_hot_encode
166 )

File ~/Path/to/lib/python3.10/site-packages/joblib/numpy_pickle.py:658, in load(filename, mmap_mode)
652 if isinstance(fobj, str):
653 # if the returned file object is a string, this means we
654 # try to load a pickle file generated with an version of
655 # Joblib so we load it with joblib compatibility function.
656 return load_compatibility(fobj)
--> 658 obj = _unpickle(fobj, filename, mmap_mode)
...
-> 1213 dispatch[key[0]] (self)
1214 except _Stop as stopinst:
1215 return stopinst.value

KeyError: 60

Version Info: Python 3.10.9, MacOS Ventura (13.4.1 (22F82))

Package Version

joblib 1.2.0
numpy 1.22.4
pickleshare 0.7.5
reservoirpy 0.3.8
scipy 1.7.3

Downloaded Data From https://archive.ics.uci.edu/dataset/128/japanese+vowels

japanese+vowels.zip

The text was updated successfully, but these errors were encountered:

nTrouvain added a commit that referenced this issue Jul 17, 2023

Fix: wrong link and data format in Japanese vowels dataset (#113)

8d48ebf

nTrouvain closed this as completed Jul 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with Loading Japanese Vowel Dataset #113

Issue with Loading Japanese Vowel Dataset #113

ashe-v00 commented Jun 26, 2023 •

edited

Issue with Loading Japanese Vowel Dataset #113

Issue with Loading Japanese Vowel Dataset #113

Comments

ashe-v00 commented Jun 26, 2023 • edited

ashe-v00 commented Jun 26, 2023 •

edited