Security considerations: `_` in field names #843

mnot · 2021-05-19T05:02:25Z

Many implementations normalise - to _ when passing field names to applications, for historical reasons. This is a vector for smuggling attacks; see here.

Suggestions:

Consistently recommend that _ not be used in field names.
When an implementation normalises - to _, it MUST remove fields whose names contain _; it MAY make them available through an alternative method.

The text was updated successfully, but these errors were encountered:

reschke · 2021-05-19T06:52:28Z

AFAIR, we have discussed this in the past, and decided not to do this. Has anything changed since then?

mnot · 2021-05-19T06:54:41Z

These attacks have become pretty prominent; I don't think wishing them away is helping. Do you have a ref to the previous discussion?

reschke · 2021-05-19T07:20:48Z

There is #30 - I also think this was discussed during the last AUTH48, but I may be wrong.

(and yes, I realize that recommended char set and security considerations are not the same thing)

reschke · 2021-05-19T09:48:49Z

Consistently recommend that _ not be used in field names.

We do that already for new field names; for existing names, there's little one can do, right?

When an implementation normalises - to _, it MUST remove fields whose names contain _; it MAY make them available through an alternative method.

IIUC, a common attack is to override messaging related fields such as Content-Length, Content-Encoding and Transfer-Encoding. For fields like these, wouldn't it be better to recommend dropping the underscore variant?

royfielding · 2021-05-19T18:46:27Z

Suggestions:

1. Consistently recommend that `_` not be used in field names.

That would have no effect on such attacks. Recommending that neither hyphen nor underscore be used in the names of sensitive DIY header field passing would work, as would recommending that applications choosing to do sensitive DIY field passing using header fields ought to be prepared for normalization of both (and a dozen other weird characters, unicode, CTLs, etc.). But, to be clear, that is not about HTTP.

2. When an implementation normalises `-` to `_`, it MUST remove fields whose names contain `_`; it MAY make them available through an alternative method.

These are not implementations of HTTP. When we are talking about gateways to other protocols (like CGI), then yes. There is no historical normalization that occurs from HTTP-to-HTTP. There is a spec for HTTP-to-CGI that does normalization because of restrictions in env variables, and that's what needs updating. The problem comes when an implementation that used to be behind a CGI interface is moved to be behind an HTTP interface, and the damn kids think that should work without any changes in the application code. Well, app platforms often get this wrong. It is not an HTTP issue.

At most, we can warn application platforms that HTTP != CGI and that trying to "handle" an HTTP message as if it were received by CGI requires that the platforms implement CGI normalization properly.

An HTTP gateway sending a message to an HTTP application platform has no way of knowing that the receiving implementation isn't expecting HTTP, and there's no way in hell that we are going to castrate HTTP just in case it happens to be received by a poor app implementation. HTTP is not CGI.

mnot · 2021-05-20T04:43:17Z

We do that already for new field names; for existing names, there's little one can do, right?

We don't explicitly mention this issue in Considerations for New Field Names

IIUC, a common attack is to override messaging related fields such as Content-Length, Content-Encoding and Transfer-Encoding. For fields like these, wouldn't it be better to recommend dropping the underscore variant?

Yes.

That would have no effect on such attacks.

On its own, no - but these measures are designed to work together.

These are not implementations of HTTP [...] It is not an HTTP issue.

Given how widespread these practices are, I don't think it's responsible for us to take that stance.

At most, we can warn application platforms that HTTP != CGI and that trying to "handle" an HTTP message as if it were received by CGI requires that the platforms implement CGI normalization properly.

I'm fine with avoiding normative language here, but I do think we need to go into the details. I'll do a PR.

Fixes #843

mnot added the semantics label May 19, 2021

mnot added a commit that referenced this issue May 20, 2021

Application handling of field names

ede4b40

Fixes #843

mnot mentioned this issue May 20, 2021

Application handling of field names #844

Merged

mnot added the has-proposal label May 20, 2021

reschke closed this as completed in #844 May 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security considerations: `_` in field names #843

Security considerations: `_` in field names #843

mnot commented May 19, 2021

reschke commented May 19, 2021

mnot commented May 19, 2021

reschke commented May 19, 2021 •

edited

Loading

reschke commented May 19, 2021

royfielding commented May 19, 2021

mnot commented May 20, 2021

Security considerations: _ in field names #843

Security considerations: _ in field names #843

Comments

mnot commented May 19, 2021

reschke commented May 19, 2021

mnot commented May 19, 2021

reschke commented May 19, 2021 • edited Loading

reschke commented May 19, 2021

royfielding commented May 19, 2021

mnot commented May 20, 2021

Security considerations: `_` in field names #843

Security considerations: `_` in field names #843

reschke commented May 19, 2021 •

edited

Loading