WIP: blindly change all encoding to UTF-8 #224
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
My public key doesn't properly decode when loaded with PGPy:
In [38]: pgpy.PGPKey.from_file('anarcat.gpg')[0].userids[0].name
Out[38]: 'Antoine Beaupré'
By switching to UTF-8 everywhere, it gets properly parsed. Using
Latin-1 is not a right default: it is europe-centric and will
necessarily break whenever another language than english or some
european countries is used (e.g. any asian language will fail). As you
can see, even french accents fail, if they are (properly) encoded in
UTF-8.
This is a stopgap measure: I am not sure how to decode user
identifiers properly, nor if this touches on too much stuff to fix the
actual problem. But I would assert that using UTF-8 is a better
default than latin-1.