New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hotfix/fix encoding cess (with test) #1141

Merged
merged 2 commits into from Sep 23, 2015

Conversation

Projects
None yet
2 participants
@zeehio
Copy link
Contributor

zeehio commented Sep 23, 2015

This fix amends the encoding of teh cess_cat and cess_esp corpora.

It is split in two commits:

The first commit enhances the TestCess to assert for some words different in ISO-8859-2 and ISO-8859-15 in both corpus. As a result of the test enhancement, after this first commit the: nosetests nltk.test.unit.test_corpora.TestCess command fails.

The second commit changes the encoding of both corpora and then the enhanced test passes.

@stevenbird

This comment has been minimized.

Copy link
Member

stevenbird commented Sep 23, 2015

Thanks @zeehio

stevenbird added a commit that referenced this pull request Sep 23, 2015

Merge pull request #1141 from zeehio/hotfix/fix_encoding_cess
Hotfix/fix encoding cess (with test)

@stevenbird stevenbird merged commit febbc9c into nltk:develop Sep 23, 2015

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment