New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unpacking zip, encoding corrupts filenames #12
Comments
no offence, Entropy do it well. |
thanks for the tip, will look at it |
@tkisme can you share a file to test? |
Maybe use p7zip-natspec or unzip-iconv for encoding autodetection? |
Just deleted one of you comments because it contained a file with adult images. For the battle-net issue, here is the right issue that already contains a test file: #2 For the zip encoding, here an all ages test file, found here. Just note that the bundled macOS compression utility also fails to extract it with the correct encoding. |
Just for test I've opened Unicode test.zip file, that you can find inside the test file posted before and Entropy did not extracted the files with the Chinese encoding, omitted them. Also the adult file you've posted before got unusable with Entropy, no images nor text file when extracted. |
Here another test file. This one is encoded in Greek DOS (737 codepage). This one was found here. |
Some useful info here: https://sourceforge.net/p/p7zip/discussion/383044/thread/3d213124 No quick fix though. |
maybe add iconv is enough?Can't wait to test. |
Official fix could take some time, but as I have pasted above, there is already third-party patches to add encoding autodetection support to unzip and p7zip :-) |
I think keka is the replacement of the default 'Archive Utility.app'.Use different application to handle different archives is just annoying. |
I can't get the point. Keka is behaving similar to built-in archive utility (double click and decompress), I did not see anything annoying. |
@maz-1 , what natspec library you've used to compile p7zip? Can you share those files? |
Here is the natspec patch for p7zip 15.14.1 @tkisme |
@maz-1 I've already downloaded both the patch and the library files, and fixed the patch. But I'm unable to link p7zip to the natspec library. How you did it? In case you need it, here the patch: natspec_p7zip1602.patch.zip |
I made a homebrew formula to build natspec: Download natspec.rb and run "brew install natspec.rb" should install the library. But the encoding detection is still borken (15.14.1 has the same problem):
|
Another file that gets broken, Russian content shared by Колобок via mail: IIS.zip |
Should check https://github.com/ethereon/p7zip-hybrid encoding changes |
This one should be fixed in 1.1.6!!! |
I am doing a research whether I could replace 'The Unarchiver' with this great App. The only concern seems to be zip file name encoding. IMHO, auto encoding detection does not work 100%, so manual selection should be provided as a last resort. With no offence, is it implemented? |
It was implemented in 1.1.6 @ffffwh. Inspired in The Unarchiver way, so will be very (maybe too much) familiar. You can test the feature with the example file found in the comment #12 (comment). |
It's happening to names of files and folders within zip archives (scrambled when looking or unzipping them), and to the content in some Text files, other files seems to fare better even though their names are gibberish.
The text was updated successfully, but these errors were encountered: