Skip to content

Test whether non-UTF-8 encoded paths in URLs are left intact#30

Open
tfmorris wants to merge 2 commits intointernetarchive:masterfrom
tfmorris:19-non-utf-8-url
Open

Test whether non-UTF-8 encoded paths in URLs are left intact#30
tfmorris wants to merge 2 commits intointernetarchive:masterfrom
tfmorris:19-non-utf-8-url

Conversation

@tfmorris
Copy link

Closes #19

This PR just adds the test that @sebastian-nagel wrote back in 2017, confirming that the current code works correctly, as well as a couple of related tests from the corresponding issue in the Java implementation commoncrawl/ia-web-commons#6

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Keep non-UTF-8 encoded URLs (Python 3)

2 participants