Incorrect normalzation behaviour on character sequence '%e2%80%b3' #160

gh2k · 2014-05-13T11:05:55Z

Specifically, this produces an incorrect result:

1.9.3-p392 :019 > u = Addressable::URI.parse('http://example.org/%e2%80%b3')
 => #<Addressable::URI:0xd005e8 URI:http://example.org/%e2%80%b3> 
1.9.3-p392 :020 > u.normalize!
 => #<Addressable::URI:0xd005e8 URI:http://example.org/%E2%80%B2%E2%80%B2>

Note that the normalized URL no longer matches.

I think this is related to Addressable::IDNA.unicode_normalize_kc

Specifiaclly:

1.9.3-p392 :013 > s = Addressable::URI.unencode('%e2%80%b3')
 => "″" 
1.9.3-p392 :014 > Addressable::IDNA.unicode_normalize_kc(s)
 => "′′"

The output is now two UTF-8 characters, when previously it was one.

The text was updated successfully, but these errors were encountered:

sporkmonger · 2015-02-05T11:26:24Z

This is not a bug. URIs, and particularly IRIs, use Unicode normalization form KC to eliminate visual ambiguities which may result in phishing attacks. NFKC splits that codepoint up to the characters that Addressable is giving you. If this behavior is undesirable for your use-case, you can normalize instead on a component-by-component basis.

gh2k mentioned this issue May 13, 2014

Should be possible to use Ruby's URI implementation instead of Addressable::URI stewartmckee/cobweb#27

Open

jaimeiniesta mentioned this issue Dec 23, 2014

Incorrect normalization of character sequence "%EF%BD%9E" #182

Closed

sporkmonger closed this as completed Feb 5, 2015

sporkmonger added the Rejected label Feb 5, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect normalzation behaviour on character sequence '%e2%80%b3' #160

Incorrect normalzation behaviour on character sequence '%e2%80%b3' #160

gh2k commented May 13, 2014

sporkmonger commented Feb 5, 2015

Incorrect normalzation behaviour on character sequence '%e2%80%b3' #160

Incorrect normalzation behaviour on character sequence '%e2%80%b3' #160

Comments

gh2k commented May 13, 2014

sporkmonger commented Feb 5, 2015