-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Windows Command-Line Tools #1
Comments
For more benchmarks, 11 in total so far: TEXTORAMIC_Decompression_Showdown_2019-Feb-21.pdf: Nakamichi_Ryuugan-ditto-1TB_btree_source.pdf: |
Hi Sanmayce--sorry for the delay in responding; I need to check my Github notification settings, it seems!
Thanks for your benchmark results, by the way--very interesting! |
Thank you Jeff for detailed explanation, you see, me being an C amateur "clouds" my extra limited knowledge of other languages and what their purposes are. |
Hi Sanmayce, FWIW I'm sure there are tools that can build self-contained executables for Java programs (and probably for compilation to native code rather than bytecode, too), so while we don't provide such executables ourselves you should be able to build them if you're sufficiently motivated :) (IMO it's easy enough to just invoke it using the |
Hi Jeff, thanks again. As far as I see, this COMPUTEX 2019 (May 27) will set a new trend in CPUs - 8 cores affordable by poor-people in the long run. I myself intend to have Matisse with 16 threads. Simply, MiGz' forte ought to be shown on some modern machine, enwik9 is one superb roster, de facto THE BENCHMARK. |
Thanks for the suggestion, Sanmayce. I didn't know about this standardized dataset, but will definitely keep it in mind if/when I have time to run more benchmarks in the future. In practice we're already using MiGz on very high core count servers (e.g. 32+ logical cores) but the cores tend to be individually slower than those you'd find in higher-end desktops. |
Your German XML dump is superbly on point, but not having references (other performers) is kinda not telling. In my view, MiGz will set a Pareto Frontier (decompression rate vs compressed size) with enwik9, it is interesting to see how threaded decompression fares against well-optimized single-threaded competition. After a month or so, my toy Nakamichi will finish enwik9, my expectation is to set a Pareto Frontier: |
I have written DEFLATE compressors suite, whose backend has zlib / 7-zip / zopfli / miniz / libslz / libdeflate / zlib-ng / igzip. And I have added MiGz format frontend with parallel compression / decompression. Help yourself if interested. https://github.com/cielavenir/7bgzf/tree/dev |
Thanks, @cielavenir! Looks very nice. I'll be particularly interested to compare the performance/overhead of C multithreading vs. Java's (assuming the underlying zlib is the same). At a low compression levels, it might be that your machine is unable to stream data from disk quickly enough to benefit from multithreading--I haven't observed this myself, but my test machine has a rather fast SSD. |
@cielavenir
The above results are for i7-3630QM, the initial package is downloadable at: |
If you have written it as a benchmark suite (similarly to lzbench and turbobench) it would be great. |
perhaps I should print the processing time, but for now please:
|
I'm a Windows user; an suggestion, putting the output of above script on your homepage (as other authors do) would be informative, now no one knows how fast your code is. |
|
@cielavenir For Windows 10 64bit, laptop with I7-3630QM 4cores/8threads and 16GB DDR3, SSD Samsung 860 PRO 256GB:
Note1: Under Windows filenames are case insensitive, thus your 's' and 'S' and 'z' and 'Z' are overlapping. Could you explain why so many executables, I wish I had one only to include in future benchmarks... |
Yes, but options are case sensitive. ...Perhaps you need to change the output filename, though.
What do you mean... I can see -cS1==-cS4, -cS5==-cS6, -cS7==-cS8 though. I read such information in 7-zip documentation. cf https://sevenzip.osdn.jp/chm/cmdline/switches/method.htm#ZipX
Before, I have said |
Hi, could you provide binaries for Windows? Wanna include MiGz into my benchmark roster using 512KB and bigger blocks, by the way, how bigger they can be?
Your explanation of the acronym is not clear to me, please explain:
"... also supports multithreaded decompression, which is especially important for large files that are read repeatedly. Hence, MiGz."
What does the little 'i' stand for?
Allow me two more questions:
As you can see from the above (random) test, LzTurbo 30 decompresses 1233.67/333.83=3.69x and compresses 306.55/15.35=19.97x faster than zlib 9!
For the second random example, LzTurbo 29 is 4x faster than zlib 9 in decompression, no need of 4 cores.
The text was updated successfully, but these errors were encountered: