Navigation Menu

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hotfix/fix encoding cess (with test) #1141

merged 2 commits into from Sep 23, 2015


Copy link

@zeehio zeehio commented Sep 23, 2015

This fix amends the encoding of teh cess_cat and cess_esp corpora.

It is split in two commits:

The first commit enhances the TestCess to assert for some words different in ISO-8859-2 and ISO-8859-15 in both corpus. As a result of the test enhancement, after this first commit the: nosetests nltk.test.unit.test_corpora.TestCess command fails.

The second commit changes the encoding of both corpora and then the enhanced test passes.

Copy link

Thanks @zeehio

stevenbird added a commit that referenced this pull request Sep 23, 2015
Hotfix/fix encoding cess (with test)
@stevenbird stevenbird merged commit febbc9c into nltk:develop Sep 23, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
None yet
None yet

Successfully merging this pull request may close these issues.

None yet

2 participants