Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

switch to charset_normalizer for encoding to support py 3.10+ #76

Merged
merged 1 commit into from Apr 4, 2023

Conversation

brockhaywood
Copy link
Contributor

@brockhaywood brockhaywood commented Aug 8, 2022

Switches partridge to use charset_normalizer instead of cchardet.

Due to this issue: #73

Potential alternative fix: #75

@@ -55,7 +55,7 @@ def test_empty_df():
[
(b"abcde", "utf-8"), # straight up ascii is a subset of unicode
(b"Eyjafjallaj\xc3\xb6kull", "utf-8"), # actual unicode
(b"\xC4pple", "WINDOWS-1252"), # non-unicode, ISO characterset
(b"\xC4pple", "cp037"), # non-unicode, ISO characterset
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems there is some disagreement between the latest versions of the libs on py3.10 as to what this should be.

cchardet: WINDOWS-1252
chardet: ISO-8859-1
charset_normalizer: cp037

@brockhaywood brockhaywood marked this pull request as ready for review August 8, 2022 17:13
@brockhaywood
Copy link
Contributor Author

brockhaywood commented Aug 8, 2022

Looks like travis no longer exists.. and tox has been removed a6741e0 so im not sure what the compatibility of this is like.

With that said, tests pass on 3.7.4.

@dget dget merged commit bb55870 into remix:master Apr 4, 2023
@harto harto mentioned this pull request Nov 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants