New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hotfix/fix encoding cess (with test) #1141

merged 2 commits into from Sep 23, 2015


None yet
2 participants
Copy link

zeehio commented Sep 23, 2015

This fix amends the encoding of teh cess_cat and cess_esp corpora.

It is split in two commits:

The first commit enhances the TestCess to assert for some words different in ISO-8859-2 and ISO-8859-15 in both corpus. As a result of the test enhancement, after this first commit the: nosetests nltk.test.unit.test_corpora.TestCess command fails.

The second commit changes the encoding of both corpora and then the enhanced test passes.


This comment has been minimized.

Copy link

stevenbird commented Sep 23, 2015

Thanks @zeehio

stevenbird added a commit that referenced this pull request Sep 23, 2015

Merge pull request #1141 from zeehio/hotfix/fix_encoding_cess
Hotfix/fix encoding cess (with test)

@stevenbird stevenbird merged commit febbc9c into nltk:develop Sep 23, 2015

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment