Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data nihreports not available anymore #72

Closed
AlessandroVol23 opened this issue Sep 23, 2019 · 4 comments
Closed

Data nihreports not available anymore #72

AlessandroVol23 opened this issue Sep 23, 2019 · 4 comments

Comments

@AlessandroVol23
Copy link
Contributor

Some datasets are not available anymore.

For example the following:
nihtraindata = shorttext.data.nihreports(sample_size=None)

Error message:

Downloading...
Source:  http://storage.googleapis.com/pyshorttext/nih_grant_public/nih_full.csv.zip
Failure to download file!
(<class 'urllib.error.HTTPError'>, <HTTPError 404: 'Not Found'>, <traceback object at 0x7f09063ed788>)

Python error:

HTTPError: HTTP Error 404: Not Found

During handling of the above exception, another exception occurred:

When opening the link the same error appears:

image

@stephenhky
Copy link
Owner

Thanks for submitting the issue. This is happening for all datasets because the old bucket is no longer working.

I am finding a new place for the datasets and I will publish a new release.

@AlessandroVol23
Copy link
Contributor Author

Perfect thanks! Do you have a link to the mentioned dataset? Because I'd need it for a tutorial at the moment . A small sample would be more then enough. Unfortunately I can't find it online at the moment.

Thanks! :-)

@stephenhky
Copy link
Owner

At the moment, download it from here first: https://github.com/stephenhky/PyShortTextCategorization/tree/aws_s3/data

@stephenhky
Copy link
Owner

Release 1.1.5 has been published for the fix.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants