Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with Loading Japanese Vowel Dataset #113

Closed
ashe-v00 opened this issue Jun 26, 2023 · 0 comments
Closed

Issue with Loading Japanese Vowel Dataset #113

ashe-v00 opened this issue Jun 26, 2023 · 0 comments

Comments

@ashe-v00
Copy link

ashe-v00 commented Jun 26, 2023

Hello!

I am having some issues accessing the Japanese Vowels dataset through ReservoirPy. I believe the URL that points towards the database is out of date as the link (https://archive.ics.uci.edu/ml/machine-learning-databases/JapaneseVowels-mld/) points to a page that subsequently says "NOT FOUND." I found a potentially updated link to the database (https://archive.ics.uci.edu/dataset/128/japanese+vowels or http://doi.org/10.24432/C5NS47)? However, when I downloaded it to my local computer, I am unable to get ReservoirPy to load the files. Do you have any suggestions on how to get it to load from a local directory?

Thank you in advance for your help!

import reservoirpy.datasets as rpyds
xtra, ytra, xtest, ytest = rpyds.japanese_vowels()

OR

xtra, ytra, xtest, ytest = rpyds.japanese_vowels(data_folder="/directory/to/japanesevowels")

Error:
KeyError Traceback (most recent call last)
Cell In[24], line 2
1 import reservoirpy.datasets as rpyds
----> 2 xtr, ytr, xtes, ytes = rpyds.japanese_vowels(data_folder="/directory/to/japanesevowels")

File ~/Path/to/lib/python3.10/site-packages/reservoirpy/datasets/_japanese_vowels.py:158, in japanese_vowels(one_hot_encode, repeat_targets, data_folder, reload)
156 data_files[file_role] = _download(data_folder, file_name, file_role)
157 else:
--> 158 data_files[file_role] = joblib.load(data_folder / file_path)
160 X_train, Y_train = _format_data(
161 data_files["train"], data_files["train_sizes"], one_hot_encode
162 )
164 X_test, Y_test = _format_data(
165 data_files["test"], data_files["test_sizes"], one_hot_encode
166 )

File ~/Path/to/lib/python3.10/site-packages/joblib/numpy_pickle.py:658, in load(filename, mmap_mode)
652 if isinstance(fobj, str):
653 # if the returned file object is a string, this means we
654 # try to load a pickle file generated with an version of
655 # Joblib so we load it with joblib compatibility function.
656 return load_compatibility(fobj)
--> 658 obj = _unpickle(fobj, filename, mmap_mode)
...
-> 1213 dispatch[key[0]] (self)
1214 except _Stop as stopinst:
1215 return stopinst.value

KeyError: 60


Version Info: Python 3.10.9, MacOS Ventura (13.4.1 (22F82))

Package Version


joblib 1.2.0
numpy 1.22.4
pickleshare 0.7.5
reservoirpy 0.3.8
scipy 1.7.3

Downloaded Data From https://archive.ics.uci.edu/dataset/128/japanese+vowels

japanese+vowels.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants