Parallelize marker replacement #4

mxmlnkn · 2022-11-14T09:04:15Z

The decoding works in two steps:

Decode with a bogus backreference buffer initialized to 16-bit indexes.
Replace those 16-bit indexes (markers) with the actual backreference contents.

Currently, the second step is done on the orchestrator thread. This might limit performance. Marker replacement yields benchmark results of 12 GB/s and compacting the buffers from 16-bit storage type that only contains 8-bit values takes 4 GB/s.

This is quite fast, and parallelizing might effectively (only) yield a factor 2 speedup. Furthermore, at this point NUMA behavior might have to be considered when it comes to the ThreadPool.
Another problem is the load balancing. Introducing yet another thread pool would oversaturate the processor or underutilize the processor when limiting the decoding thread pool instead. Therefore, it might be nice to also use the existing thread pool for marker replacement. But then, it would have to implement a kind of priority system because marker replacement should always have higher priority. And we still would have to ensure that at least one thread can always decode or else it would still slow down. Maybe the orchestrator thread can keep acting as the main marker replacer but it also can distribute further work into the thread pool. And in case that even with higher priority, no one has begun to do the marker replacement when the orchestrator thread has finished its work, then it should be possible to steal back that work packet from the thread pool and let the orchestrator thread do it. This would also require a kind of work package ID to query for work completion and taking work back from the threda pool.

All in all, this slowly becomes an academic/high-performance computing issue not one of general ratarmount/pragzip usage but it would still be nice to have.

mxmlnkn · 2023-01-16T13:52:04Z

Implemented with mxmlnkn/indexed_bzip2@6cb4ab6

mxmlnkn added the performance Something is slower than it could be label Nov 14, 2022

mxmlnkn closed this as completed Jan 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize marker replacement #4

Parallelize marker replacement #4

mxmlnkn commented Nov 14, 2022 •

edited

Loading

mxmlnkn commented Jan 16, 2023

Parallelize marker replacement #4

Parallelize marker replacement #4

Comments

mxmlnkn commented Nov 14, 2022 • edited Loading

mxmlnkn commented Jan 16, 2023

mxmlnkn commented Nov 14, 2022 •

edited

Loading