Non capitalized HTTP headers #524

ninoseki · 2019-02-02T05:49:05Z

Sorry for a dumb question.

I have to work with an API server which deals HTTP headers in case-sensitive manner.
(Yep I know that is a RFC violation)

Is there a good way to send non-capitalized HTTP headers?

tarcieri · 2019-02-02T17:25:33Z

I believe this is a dupe of #337, but I'll let @ixti confirm that.

This gem canonicalizes both request and response headers.

We've discussed disabling the canonicalization for requests, which I think is the most straightforward solution. We could also make it configurable on a request and response basis, e.g.:

HTTP.canonicalize_headers(false)
HTTP.canonicalize_headers(:request)
HTTP.canonicalize_headers(:response)
HTTP.canonicailze_headers(:all)

...or thereabouts.

ixti · 2019-02-03T01:23:34Z

Yeah it's a dupe. And I'm going to work on this soon. Finally have an idea in my head how to provide predictable API without intorducing public API changes. Will layout the idea as code soon so that discussion will be started :D But in few words, my idea is to change Headers class to keep header names as-is but using normalized names for headers lookup:

# imagine repsonse was sent with headers:
# foo_bar: 123
response.headers["Foo-Bar"] # => ["123"]
response.headers["foo_bar"] # => ["123"]
response.headers.keys # => ["foo_bar"]

When passing headers, that's pretty much the only public API change that will be needed:

Fail if given header name is neither String nor Symbol
Pre-normalize header name if it's given as Symbol

tarcieri · 2019-02-03T16:52:37Z

@ixti neat! SGTM 👍

brasic · 2019-03-11T19:32:47Z

Hi! Just came across this issue while checking to see if anyone had run into the problem we are currently dealing with. An API we need to communicate with requires a particular header be specified with an underscore in the header name. It's otherwise case-insensitive. HTTP.rb makes it impossible to pass a header in an underscore because of this line, which transforms it to -:

http/lib/http/headers.rb

Line 206 in e31d0a5

normalized = name.split(/[\-_]/).each(&:capitalize!).join("-")

I would argue that the behavior of HTTP::Headers#normalize_header violates RFC 7230, which allows the _ character to be part of a header name. We can monkeypatch this for now on our side but are there any thoughts on making the canonicalization algorithm spec-compliant? I don't know of any case-insensitivity scheme that considers _ and - to be equivalent. I'd be happy to open a PR to remove _ from the split regex but that would likely require a major version bump (looking at the specs, lots of code depends on _ canonicalizing as -).

ixti · 2019-03-11T21:24:32Z

@brasic yes - I'm working on refactoring HTTP::Headers completely so that it will llow to pass any RFC compliant header (with or without normalization).

Hendrione-Moka · 2019-08-27T00:21:50Z

any update for this? I still got the issue.

ixti · 2019-08-27T16:15:24Z

I had no time to work on this yet.

joshuaflanagan · 2019-12-13T01:13:16Z

Just ran into this again, dealing with an API that has case-sensitive headers. That may be "wrong" on their side, but it is what it is, and switching http libraries is much easier than getting an external API to change their behavior.
Do you have an in-process branch? I'd be happy to take a stab at fleshing it out. Would rather do that than switch libraries.

tarcieri · 2019-12-13T01:17:27Z

@joshuaflanagan I think if you implemented the change @ixti suggested here it'd be accepted, and shouldn't be too difficult: #524 (comment)

The original behavior was to normalize all header names so that they were broken up into words, delimited by `-` or `_`, capitalize each word, and then join the words together with a `_`. This made it impossible to make a request with an underscore in the header name, or with a different casing (ex: all caps). However, the normalized name made it possible to access (or delete) headers, without having to know the exact casing. The new behavior is based on the following rules (as specified in httprb#524 (comment)) 1) Fail if a header name is not specified as a String or Symbol 2) If the header name is specified as a Symbol, normalize it when writing it in a request. If the header name is specified as a String, preserve it as-is when writing it in a request. 3) Allow lookup of any header using the normalized form of the name I implemented this behavior by storing three elements for each header value: 1) normalized header name 2) header name as it will be written in a request 3) header value Element 2 is the new addition. I considered just storing the header value as it would be written, and only doing normalization during lookup, but it seemed wasteful to potentially normalize the same value over and over when searching through the list for various lookups. This way we only normalize each name once, and can continue to use that value for lookups. However, whenever asked for the contents (ex: via `each` or `keys`) we return the new, non-normalized name. Fixes: httprb#524

The original behavior was to normalize all header names so that they were broken up into words, delimited by `-` or `_`, capitalize each word, and then join the words together with a `-`. This made it impossible to make a request with an underscore in the header name, or with a different casing (ex: all caps). However, the normalized name made it possible to access (or delete) headers, without having to know the exact casing. The new behavior is based on the following rules (as specified in httprb#524 (comment)) 1) Fail if a header name is not specified as a String or Symbol 2) If the header name is specified as a Symbol, normalize it when writing it in a request. If the header name is specified as a String, preserve it as-is when writing it in a request. 3) Allow lookup of any header using the normalized form of the name I implemented this behavior by storing three elements for each header value: 1) normalized header name 2) header name as it will be written in a request 3) header value Element 2 is the new addition. I considered just storing the header value as it would be written, and only doing normalization during lookup, but it seemed wasteful to potentially normalize the same value over and over when searching through the list for various lookups. This way we only normalize each name once, and can continue to use that value for lookups. However, whenever asked for the contents (ex: via `each` or `keys`) we return the new, non-normalized name. Fixes: httprb#524

The original behavior was to normalize all header names so that they were broken up into words, delimited by `-` or `_`, capitalize each word, and then join the words together with a `-`. This made it impossible to make a request with an underscore in the header name, or with a different casing (ex: all caps). However, the normalized name made it possible to access (or delete) headers, without having to know the exact casing. The new behavior is based on the following rules (as specified in #524 (comment)) 1) Fail if a header name is not specified as a String or Symbol 2) If the header name is specified as a Symbol, normalize it when writing it in a request. If the header name is specified as a String, preserve it as-is when writing it in a request. 3) Allow lookup of any header using the normalized form of the name I implemented this behavior by storing three elements for each header value: 1) normalized header name 2) header name as it will be written in a request 3) header value Element 2 is the new addition. I considered just storing the header value as it would be written, and only doing normalization during lookup, but it seemed wasteful to potentially normalize the same value over and over when searching through the list for various lookups. This way we only normalize each name once, and can continue to use that value for lookups. However, whenever asked for the contents (ex: via `each` or `keys`) we return the new, non-normalized name. Fixes: #524

joshuaflanagan mentioned this issue Dec 15, 2019

Preserve header casing as specified #576

Merged

ixti closed this as completed in #576 Dec 19, 2019

tarcieri mentioned this issue May 13, 2021

v5.0.0 #660

Merged

cben mentioned this issue Oct 31, 2022

Update HTTP gem requirement to allow version 5 ManageIQ/kubeclient#571

Merged

jeffgran-dox mentioned this issue Dec 14, 2022

Allow http dependency v5+ doximity/oauth2c#15

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non capitalized HTTP headers #524

Non capitalized HTTP headers #524

ninoseki commented Feb 2, 2019

tarcieri commented Feb 2, 2019 •

edited

Loading

ixti commented Feb 3, 2019 •

edited

Loading

tarcieri commented Feb 3, 2019

brasic commented Mar 11, 2019 •

edited

Loading

ixti commented Mar 11, 2019

Hendrione-Moka commented Aug 27, 2019

ixti commented Aug 27, 2019

joshuaflanagan commented Dec 13, 2019

tarcieri commented Dec 13, 2019

Non capitalized HTTP headers #524

Non capitalized HTTP headers #524

Comments

ninoseki commented Feb 2, 2019

tarcieri commented Feb 2, 2019 • edited Loading

ixti commented Feb 3, 2019 • edited Loading

tarcieri commented Feb 3, 2019

brasic commented Mar 11, 2019 • edited Loading

ixti commented Mar 11, 2019

Hendrione-Moka commented Aug 27, 2019

ixti commented Aug 27, 2019

joshuaflanagan commented Dec 13, 2019

tarcieri commented Dec 13, 2019

tarcieri commented Feb 2, 2019 •

edited

Loading

ixti commented Feb 3, 2019 •

edited

Loading

brasic commented Mar 11, 2019 •

edited

Loading