Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove Pattern lib dependency in News Classification tutorial #1118

Merged
merged 2 commits into from
Jan 29, 2017
Merged

Remove Pattern lib dependency in News Classification tutorial #1118

merged 2 commits into from
Jan 29, 2017

Conversation

luizcavalcanti
Copy link
Contributor

Even thought NLTK is old and all, Pattern does not currently work on Python3. So changing this tutorial to use NLTK for lemmatization makes it possible to work on both python 2.5+ and 3.x. We are also making sure the user has its NLTK packages downloaded and up-to-date.

Makes sure the user has downloaded the 'stopwords' package from NLTK without
having to do it from shell.

Signed-off-by: Luiz Carlos Cavalcanti <cavalcanti.luiz@gmail.com>
Since Pattern lib is not supported yet on Python 3, NLTK is now used also for
lemmatization, removing the dependency for Pattern. Now it should be possible
to run this tutorial ob both Python 2.5+ and 3.x.

Signed-off-by: Luiz Carlos Cavalcanti <cavalcanti.luiz@gmail.com>
@tmylk
Copy link
Contributor

tmylk commented Jan 29, 2017

Thanks for supporting Python 3.
CC @dsquareindia

@tmylk tmylk merged commit 9cb4910 into piskvorky:develop Jan 29, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants