Get the correct fileName from extra filed when decodeStrings is false #113

fpsqdb · 2019-11-15T10:32:42Z

This commit add option decodeStrings adds support return fileName or comment as buffer and fixes issue #42, but the fileName is not a buffer when decodeStrings is false.
This PR makes fileName is a buffer when decodeStrings is false.

…lse`

thejoshwolfe · 2024-02-16T14:54:59Z

Sorry for the delayed response. I'm not sure I understand the purpose or intended effect of this PR. Are you trying to bypass the security validation but still support reading the Info-ZIP Unicode Path Extra Field? If that's the case, what part of the validation is causing issues for you?

fpsqdb · 2024-02-18T01:12:46Z

@thejoshwolfe Sorry, the commit link and related issue is wrrong, i have modified my comment.

thejoshwolfe · 2024-02-18T12:03:04Z

Why do you want an undecoded buffer for the file name?

fpsqdb · 2024-02-19T01:10:27Z

Set decodeStrings to false to decode the buffer by myself.
And the code implementation does not match the document description.
https://github.com/thejoshwolfe/yauzl#filename

If decodeStrings is false (see open()), this field is the undecoded Buffer instead of a decoded String.

thejoshwolfe · 2024-02-19T01:37:55Z

I've just released yauzl 3.1.0, which includes support for decoding file names in UTF-8 without the safety validation. But it sounds like that's not actually what you're looking for.

It sounds like what you're looking for is:

Ignore General Purpose Bit 11.
Support finding the Info-ZIP Unicode Path Extra Field in the extra fields, and perform the version check and crc32 verification as required, but don't convert the Buffer into a string using UTF-8.
return either the basic fileName as a Buffer or the override filename from the Info-ZIP Unicode Path Extra Field as a Buffer if present.

Is that what you want? If so, ... I'm very curious why. Have you found zip files using the Info-ZIP Unicode Path Extra Field that use an encoding other than UTF-8? Or are you curious what the bytes were before the UTF-8 decoding? If that's all you want, you should be able to simply re-encode the value into UTF-8 (UTF-8 is bijective for non-error code points).

In any case, what you're looking for can be accomplished by copying the logic in yauzl, which is now located in getFileNameLowLevel(). It's only about 30 lines of code.

Unless I can understand the use case for this PR, I can't properly support it.

thejoshwolfe · 2024-02-19T01:40:08Z

And the code implementation does not match the document description.

What's the discrepancy that you're seeing? If you're talking about how the undecoded Buffer is always the basic name and never the one from the Info-ZIP Unicode Path Extra Field, that's mentioned in the very next sentence in the docs. Maybe that could be communicated more clearly.

fpsqdb · 2024-02-19T01:53:18Z

The latest version has fixed this problem

fpsqdb added 2 commits November 15, 2019 18:21

get the correct fileName from extra filed when decodeStrings is `fa…

7f0d766

…lse`

Merge branch 'thejoshwolfe:master' into master

a005c62

fpsqdb closed this Feb 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get the correct fileName from extra filed when decodeStrings is false #113

Get the correct fileName from extra filed when decodeStrings is false #113

fpsqdb commented Nov 15, 2019 •

edited

thejoshwolfe commented Feb 16, 2024

fpsqdb commented Feb 18, 2024

thejoshwolfe commented Feb 18, 2024

fpsqdb commented Feb 19, 2024

thejoshwolfe commented Feb 19, 2024

thejoshwolfe commented Feb 19, 2024

fpsqdb commented Feb 19, 2024

Get the correct fileName from extra filed when decodeStrings is false #113

Get the correct fileName from extra filed when decodeStrings is false #113

Conversation

fpsqdb commented Nov 15, 2019 • edited

thejoshwolfe commented Feb 16, 2024

fpsqdb commented Feb 18, 2024

thejoshwolfe commented Feb 18, 2024

fpsqdb commented Feb 19, 2024

thejoshwolfe commented Feb 19, 2024

thejoshwolfe commented Feb 19, 2024

fpsqdb commented Feb 19, 2024

fpsqdb commented Nov 15, 2019 •

edited