Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NlTK downloading corpus using nltk.download() giving 403 error #1791

Closed
datNurd opened this issue Jul 27, 2017 · 5 comments
Closed

NlTK downloading corpus using nltk.download() giving 403 error #1791

datNurd opened this issue Jul 27, 2017 · 5 comments

Comments

@datNurd
Copy link

datNurd commented Jul 27, 2017

I've tried downloading punkt corpus using nltk.download() as well as manually from http://www.nltk.org/nltk_data/ both are giving me a 403 forbidden, Varnish cache server error.

@usakey
Copy link

usakey commented Jul 27, 2017

#1787 A workaroud could be found on this thread.

@datNurd
Copy link
Author

datNurd commented Jul 27, 2017

Even the solution/work around in #1787 giving a 403 response. Even that did not work for me.

@usakey
Copy link

usakey commented Jul 27, 2017

@pnikhilvarma try below zip file solution rather than the Downloader one:

export PATH_TO_NLTK_DATA=/home/username/nltk_data/
wget https://github.com/nltk/nltk_data/archive/gh-pages.zip
unzip gh-pages.zip
mv nltk_data-gh-pages $PATH_TO_NLTK_DATA
# add below code
mv $PATH_TO_NLTK_DATA/nltk_data-gh-pages/packages/* $PATH_TO_NLTK_DATA/

@kirankotari
Copy link

#1792 A workaround could be found on this thread.

@alvations
Copy link
Contributor

Please see #1787
Let's consolidate the same problem to #1787 so that we don't end up tracking the same bug on the different issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants