Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hotfix/fix encoding cess (with test) #1141

Merged
merged 2 commits into from Sep 23, 2015

Conversation

@zeehio
Copy link
Contributor

@zeehio zeehio commented Sep 23, 2015

This fix amends the encoding of teh cess_cat and cess_esp corpora.

It is split in two commits:

The first commit enhances the TestCess to assert for some words different in ISO-8859-2 and ISO-8859-15 in both corpus. As a result of the test enhancement, after this first commit the: nosetests nltk.test.unit.test_corpora.TestCess command fails.

The second commit changes the encoding of both corpora and then the enhanced test passes.

@stevenbird
Copy link
Member

@stevenbird stevenbird commented Sep 23, 2015

Thanks @zeehio

stevenbird added a commit that referenced this pull request Sep 23, 2015
Hotfix/fix encoding cess (with test)
@stevenbird stevenbird merged commit febbc9c into nltk:develop Sep 23, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Linked issues

Successfully merging this pull request may close these issues.

None yet

2 participants