Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.Sign up
Hotfix/fix encoding cess (with test) #1141
This fix amends the encoding of teh cess_cat and cess_esp corpora.
It is split in two commits:
The first commit enhances the TestCess to assert for some words different in ISO-8859-2 and ISO-8859-15 in both corpus. As a result of the test enhancement, after this first commit the:
The second commit changes the encoding of both corpora and then the enhanced test passes.