-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
lrzip-bz2 performs worse than just bz2 for large json file #51
Comments
BZIP operates much better on small files. Lrzip on large ones. There are Interestingly, if you tar up the Kernel source tree, you get much better On 05/14/2016 02:41 PM, phiresky wrote:
Peter Hyman |
The json files are very similar, they can basically be seen as a single file (same file structure, same words, just different order). The result stays the same when instead of tarring them up using |
"Should this happen?" - on rare occasions the underlying compression does a better job on the file due to the tightly packed redundancy, and separating out the dictionary from the matches is counterproductive. The only thing lrzip offers for this particular archive is much faster compression and decompression. |
See here:
Using just bzip2, the 880MB file compresses to 11MB. Using
lrzip -b
, it is 21MB.How can this happen / should this happen?
Here is the file (.zip so github accepts the upload):
files.tar.zip
The text was updated successfully, but these errors were encountered: