compress/gzip: Reader unable to parse headers with long comments #14639

dsnet · 2016-03-04T11:17:42Z

Using go1.6

RFC 1952 for gzip does not specify a length for the comment and name fields.

If FCOMMENT is set, a zero-terminated file comment is
present. This comment is not interpreted; it is only
intended for human consumption. The comment must consist of
ISO 8859-1 (LATIN-1) characters. Line breaks should be
denoted by a single line feed character (10 decimal).

The current implementation is inconsistent:

gzip.Writer permits the writing of any length comment/name string in gzip.Header.
gzip.Reader fails to read any comment/name string in gzip.Header longer than 511 bytes.

Playground example: https://play.golang.org/p/Zvjf8Q7jXe
Change 512 to 511 in the example, and it works again.

This causes issues reading gzip files produced by GZinga, which produces syntactically valid gzip files, but abuses the comment field to store meta data.

Update: I have no intention of fixing this unless there is someone who needs this functionality. Whatever fix we do will need to be careful that we don't introduce a potential vector for DOS attacks since gzip is a common Transfer-Encoding in HTTP. We don't want an infinitely long comment to cause a server to allocate an infinite amount of memory.

The text was updated successfully, but these errors were encountered:

dsnet · 2017-04-22T20:21:05Z

Bumping up milestone since #20083 provides good evidence that this is a real issue.

gopherbot · 2017-08-08T02:36:34Z

Change https://golang.org/cl/53637 mentions this issue: compress/gzip: permit parsing of GZIP files with long header fields

dsnet · 2017-11-08T22:00:29Z

Pushing to Go1.11 milestone... but I think a more general API for resource limitation might be the better approach. See #20169 for some discussion.

philippfrank · 2020-09-05T16:40:39Z

@dsnet This is still an issue. Did it go under?

dsnet · 2020-09-05T16:43:11Z

Running the snippet on Go1.15 clearly shows it's still an issue.

Huh, apparently I had a fix for this that I never submitted: https://golang.org/cl/53637

philippfrank · 2020-09-05T17:11:14Z

@dsnet Okay, any chance you can push that forward? I am using gzip.NewWriter() to compress arbitrary text files (up to 1GB in size) while being unable to uncompress those using the same gzip.NewReader().

For anyone else having that issue: a pragmatic solution might be to copy gunzip.go and adjust the maximum buffer size manually. It seems several other gzip modules out there have that exact same issue (because they copied the stdlib code).

dsnet · 2020-09-07T23:18:36Z

Having reviewed my CL, the reason I didn't push it through is because of concerns with this feature being an trivial avenue for denial-of-service attack where a malicious GZIP file could cause a server to allocate arbitrary amounts of memory.

We have several options:

We could immediately permit parsing of small headers (I proposed up to 32KiB to match the window size of the DEFLATE algorithm itself). However, this won't help use-cases that abuse the comment field to store machine-parsable information where it can easily exceed 32KiB.
Add a new Reader.MaxFieldSize method so that the user can opt-in to parsing larger headers. Use of this option would still be a bit odd, since you would have to do something like:

var zr gzip.Reader
zr.MaxFieldSize(math.MaxInt32)
if err := zr.Reset(r); err != nil {
    ... // handle err
}
... // make use of zr.Header.Comment

Since this would probably require an API change, I'm adding the NeedDecision label.
\cc @FiloSottile since he's probably interested in any possible DOS-related issues.

ianlancetaylor added this to the Unplanned milestone Mar 4, 2016

dsnet self-assigned this May 9, 2016

dsnet mentioned this issue Apr 22, 2017

Gzip header size in gzip.Reader is limited to 512 bytes #20083

Closed

dsnet modified the milestones: Go1.10, Unplanned Apr 22, 2017

dsnet mentioned this issue Oct 20, 2017

encoding/csv: add a way to limit bytes read per field/record #20169

Open

dsnet modified the milestones: Go1.10, Go1.11 Nov 8, 2017

dsnet modified the milestones: Go1.11, Unplanned Feb 16, 2018

dsnet added the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Sep 7, 2020

rsc unassigned dsnet Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compress/gzip: Reader unable to parse headers with long comments #14639

compress/gzip: Reader unable to parse headers with long comments #14639

dsnet commented Mar 4, 2016 •

edited

dsnet commented Apr 22, 2017

gopherbot commented Aug 8, 2017

dsnet commented Nov 8, 2017

philippfrank commented Sep 5, 2020

dsnet commented Sep 5, 2020

philippfrank commented Sep 5, 2020

dsnet commented Sep 7, 2020

compress/gzip: Reader unable to parse headers with long comments #14639

compress/gzip: Reader unable to parse headers with long comments #14639

Comments

dsnet commented Mar 4, 2016 • edited

dsnet commented Apr 22, 2017

gopherbot commented Aug 8, 2017

dsnet commented Nov 8, 2017

philippfrank commented Sep 5, 2020

dsnet commented Sep 5, 2020

philippfrank commented Sep 5, 2020

dsnet commented Sep 7, 2020

dsnet commented Mar 4, 2016 •

edited