Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calling nlp() on an article causes 'tokenizers/punkt/english.pickle' Not Found Error #1

Closed
codelucas opened this issue Dec 21, 2013 · 1 comment

Comments

@codelucas
Copy link
Owner

I know the fix to this, will wait for tomorrow to implement it, it's late. I'll have the setup.py install the required nltk tokenizers.

@codelucas
Copy link
Owner Author

Closing, if users want nlp() features they can just run an extra line to download some corpus files specified in the README.

codelucas pushed a commit that referenced this issue Dec 17, 2014
Require BeautifulSoup4 so that pip3 install works.
codelucas added a commit that referenced this issue Jan 15, 2015
codelucas added a commit that referenced this issue Jan 22, 2015
post_cleanup more lenient, `<li>` => newlines, less strict outputformatting, remove trailing media after article
hartym added a commit to hartym/newspaper that referenced this issue Jan 3, 2017
updating with exceptions for top_node = None
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant