Fix issue 20239 - chameleon zip-file should be rejected by std.zip #7198

ghost · 2019-09-25T19:46:15Z

The lines around 640 where wrongly indented. I corrected them, although this is not directly related to the bug fix. Hope, this doesn't disturb.

dlang-bot · 2019-09-25T19:46:17Z

Thanks for your pull request and interest in making D better, @crocopaw! We are looking forward to reviewing it, and you should be hearing from a maintainer soon.
Please verify that your PR follows this checklist:

My PR is fully covered with tests (you can see the coverage diff by visiting the details link of the codecov check)
My PR is as minimal as possible (smaller, focused PRs are easier to review than big ones)
I have provided a detailed rationale explaining my changes
New or modified functions have Ddoc comments (with Params: and Returns:)

Please see CONTRIBUTING.md for more information.

If you have addressed all reviews or aren't sure how to proceed, don't hesitate to ping us with a simple comment.

Bugzilla references

Auto-close	Bugzilla	Severity	Description
✓	20239	normal	chameleon zip-file should be rejected by std.zip

Testing this PR locally

If you don't have a local development environment setup, you can use Digger to test this PR:

dub fetch digger
dub run digger -- build "master + phobos#7198"

CyberShadow

Finding and reading a directory from the end seems like an optimization attempt for slow disk devices. But, std.zip already has the data in a void[] buffer. In this case, wouldn't it be simpler and more correct to simply walk all archive members from the beginning to the end?

std/zip.d

ghost · 2019-09-26T09:09:58Z

Finding and reading a directory from the end seems like an optimization attempt for slow disk devices. But, std.zip already has the data in a void[] buffer. In this case, wouldn't it be simpler and more correct to simply walk all archive members from the beginning to the end?

That was my first idea too. But when I dived more deeply into the data format of zip files it was soon clear, that this is impossible. The chunks inside a zip file do not need to be alligned (nor sorted). There can be other stuff before (think of self extracting zip files) and between (e.g. removed files from the archive, where only the entry in the end record has been deleted). Even overlapping is not prohibitted by the specs. This gives a lot of possibilities for malware. Avoiding being trapped by malware makes it necessary to start at the end of central directory record.

Unfortunately zip allowes for a comment (of apriori unknown size) after that record, which again gives leeway for malware (see bug report). I think, best one can do is to scan these 16k of bytes. While the old approach stopped when it found one correct entry, the new version looks if there is another correct entry and rejects the whole zip file in this case. Having more than one correct entry is a strong indicator of having to do with malware, because in normal zip files, there should never be comments containing x05/x06 bytes. And the chances, that the data before the intended record form a valid second entry is almost zero too, but in rare circumstances possible.

And one more reason to start at the end: Although the whole file is kept in memory at the moment, I think it might be reasonable to improve this later by replacing the 'void[] buffer' with a random-access-range. This will speed things up in some circumstances (think of a large zip-archive, where only few small files need to be retrieved and therefore only part of the archive needs to be read from file).

CyberShadow

Thanks for the detailed explanation.

Please address the constness nit. LGTM otherwise.

ghost · 2019-09-26T09:38:16Z

I moved the problematic enum back into the code. I'll cope with those magic constants in a separate PR.

ghost · 2019-09-27T12:34:34Z

See PR #7201 for a solution of the problem with the enum.

ghost · 2019-10-06T13:18:59Z

Resolved merge conflicts with PR #7201.

ghost requested a review from CyberShadow as a code owner September 25, 2019 19:46

dlang-bot added the Bug Fix label Sep 25, 2019

CyberShadow reviewed Sep 25, 2019

View reviewed changes

std/zip.d Outdated Show resolved Hide resolved

Geod24 reviewed Sep 26, 2019

View reviewed changes

std/zip.d Outdated Show resolved Hide resolved

std/zip.d Outdated Show resolved Hide resolved

CyberShadow approved these changes Sep 26, 2019

View reviewed changes

Fix issue 20239 - chameleon zip-file should be rejected by std.zip

d750324

thewilsonator added the auto-merge label Oct 7, 2019

dlang-bot merged commit 6a1a06c into dlang:master Oct 7, 2019

ghost mentioned this pull request Oct 10, 2019

Fix issue 20027 - std.zip susceptible to zip malware attacks #7223

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue 20239 - chameleon zip-file should be rejected by std.zip #7198

Fix issue 20239 - chameleon zip-file should be rejected by std.zip #7198

ghost commented Sep 25, 2019

dlang-bot commented Sep 25, 2019

CyberShadow left a comment

ghost commented Sep 26, 2019 •

edited by ghost

CyberShadow left a comment

ghost commented Sep 26, 2019

ghost commented Sep 27, 2019

ghost commented Oct 6, 2019

Fix issue 20239 - chameleon zip-file should be rejected by std.zip #7198

Fix issue 20239 - chameleon zip-file should be rejected by std.zip #7198

Conversation

ghost commented Sep 25, 2019

dlang-bot commented Sep 25, 2019

Bugzilla references

Testing this PR locally

CyberShadow left a comment

Choose a reason for hiding this comment

ghost commented Sep 26, 2019 • edited by ghost

CyberShadow left a comment

Choose a reason for hiding this comment

ghost commented Sep 26, 2019

ghost commented Sep 27, 2019

ghost commented Oct 6, 2019

ghost commented Sep 26, 2019 •

edited by ghost