conflicting order of operations when applying Content-Encoding and Content-Type headers #2868

karenetheridge · 2022-01-30T23:15:50Z

The specification describes how to handle requests/responses with both a Content-Type and Content-Encoding header:

" JSON Schema offers a contentEncoding keyword, which may be used to specify the Content-Encoding for the schema. ... The encoding specified by the contentEncoding keyword is independent of an encoding specified by the Content-Type header in the request or response or metadata of a multipart body – when both are present, the encoding specified in the contentEncoding is applied first and then the encoding specified in the Content-Type header."
--- https://spec.openapis.org/oas/v3.1.0#considerations-for-file-uploads

(emphasis mine)

However, this ordering is backwards with how an openapi validator would operate: the contentEncoding keyword is hidden inside the schema, at a lower level than the media-type property.

The way around this would be for the validator to peek inside the schema to see if there is a contentEncoding keyword at the top level, and use it to decode the content first before then using the media-type decoder. That's pretty ugly IMO; schemas ought to be opaque. It would be better for this data to be brought up into the media-type object itself, or simply automatically decode the content according to the Content-Encoding header before applying the media-type in the Content-Type header (is it really a desired feature that the openapi document must specify the Content-Encoding that is used? what if multiple different encodings are supported, e.g. gzip compression and brotli compression - how would that be indicated?)

In practice, how is a desired content encoding indicated in a document in order to enable proper validation of the request/response body content? I didn't find any hints on the Swagger site.

handrews · 2022-09-26T15:51:08Z

@karenetheridge There are some confusing typos in this area (really I need to go back over the whole thing, it was changed close to the end of the 3.1 process and we went through several revisions trying to get it right). See #2476 and #2477 for more info that might help.

Note that:

JSON Schema contentEncoding is about string-encoding binary data (e.g. quoted-printable, base64, base64url, hex, etc.)
HTTP Content-Encoding is about managing the size of the payload (e.g. gzip, compress, etc.)

Somewhere in the text, there's a stray Content-Type that should be one of the encodings, but I don't recall which one.

handrews · 2023-03-01T01:54:05Z

We should probably clarify this in a patch release. And also find that stray Content-Type error I mentioned above.

karenetheridge · 2023-10-19T22:23:15Z

I've just run into this again, as I'm trying to describe and validate requests with Content-Encoding: gzip and Content-Type: application/json. Obviously we need to gunzip the body first before we can start to access its json-encoded content.

handrews added 3.1.patch and removed 3.0.4 labels Mar 1, 2023

handrews added this to the v3.1.1 milestone Jan 27, 2024

handrews added clarification requests to clarify, but not change, part of the spec media and encoding Issues regarding media type support and how to encode data (outside of query/path params) and removed 3.1.patch labels Jan 27, 2024

handrews self-assigned this Apr 20, 2024

handrews linked a pull request Apr 20, 2024 that will close this issue

Clarify how to model binary data in 3.1 #3727

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conflicting order of operations when applying Content-Encoding and Content-Type headers #2868

conflicting order of operations when applying Content-Encoding and Content-Type headers #2868

karenetheridge commented Jan 30, 2022 •

edited

handrews commented Sep 26, 2022

handrews commented Mar 1, 2023

karenetheridge commented Oct 19, 2023

conflicting order of operations when applying Content-Encoding and Content-Type headers #2868

conflicting order of operations when applying Content-Encoding and Content-Type headers #2868

Comments

karenetheridge commented Jan 30, 2022 • edited

handrews commented Sep 26, 2022

handrews commented Mar 1, 2023

karenetheridge commented Oct 19, 2023

karenetheridge commented Jan 30, 2022 •

edited