NlTK downloading corpus using nltk.download() giving 403 error #1791

datNurd · 2017-07-27T04:45:49Z

I've tried downloading punkt corpus using nltk.download() as well as manually from http://www.nltk.org/nltk_data/ both are giving me a 403 forbidden, Varnish cache server error.

usakey · 2017-07-27T05:01:29Z

#1787 A workaroud could be found on this thread.

datNurd · 2017-07-27T06:54:34Z

Even the solution/work around in #1787 giving a 403 response. Even that did not work for me.

usakey · 2017-07-27T07:52:07Z

@pnikhilvarma try below zip file solution rather than the Downloader one:

export PATH_TO_NLTK_DATA=/home/username/nltk_data/
wget https://github.com/nltk/nltk_data/archive/gh-pages.zip
unzip gh-pages.zip
mv nltk_data-gh-pages $PATH_TO_NLTK_DATA
# add below code
mv $PATH_TO_NLTK_DATA/nltk_data-gh-pages/packages/* $PATH_TO_NLTK_DATA/

kirankotari · 2017-07-27T08:25:17Z

#1792 A workaround could be found on this thread.

alvations · 2017-07-27T08:27:06Z

Please see #1787
Let's consolidate the same problem to #1787 so that we don't end up tracking the same bug on the different issues.

alvations closed this as completed Jul 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NlTK downloading corpus using nltk.download() giving 403 error #1791

NlTK downloading corpus using nltk.download() giving 403 error #1791

datNurd commented Jul 27, 2017

usakey commented Jul 27, 2017

datNurd commented Jul 27, 2017

usakey commented Jul 27, 2017

kirankotari commented Jul 27, 2017

alvations commented Jul 27, 2017

NlTK downloading corpus using nltk.download() giving 403 error #1791

NlTK downloading corpus using nltk.download() giving 403 error #1791

Comments

datNurd commented Jul 27, 2017

usakey commented Jul 27, 2017

datNurd commented Jul 27, 2017

usakey commented Jul 27, 2017

kirankotari commented Jul 27, 2017

alvations commented Jul 27, 2017