Audit: multi-byte character awareness & handling #759

schlessera · 2022-07-04T08:44:23Z

Some parts of the Requests library are very heavy on string manipulations. There are parts of the code that calculate string lengths, use substrings, etc...

We need to do a full audit of the string-handling in Requests to ensure it handles multi-byte characters gracefully and appropriately. In some instances, this means properly discarding multi-byte characters upfront, because whatever RFC/standard/protocol disallowing their use.

In some instances, we need to have them properly pass through the string handling without causing invalid characters and random mismatches because of bad truncation of MB characters.

And for subsystems like the domain matching, we need to ensure that internationalization works properly across the combination of IDNA encoding and string manipulations.

The text was updated successfully, but these errors were encountered:

schlessera added the Type: testing/chores/QA label Jul 4, 2022

schlessera closed this as completed Jul 4, 2022

jrfnl mentioned this issue Jul 4, 2022

[Task] Audit whether I18n domains are handled correctly everywhere #758

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audit: multi-byte character awareness & handling #759

Audit: multi-byte character awareness & handling #759

schlessera commented Jul 4, 2022

Audit: multi-byte character awareness & handling #759

Audit: multi-byte character awareness & handling #759

Comments

schlessera commented Jul 4, 2022