Avoid Exception-related performance issues in LZ4BlockInputStream when stopOnEmptyBlock == false #143

JoshRosen · 2019-06-23T20:35:27Z

This PR improves the performance of LZ4BlockInputStream when stopOnEmptyBlock = false, a mode used in Apache Spark (see #105 and #76 for background).

When stopOnEmptyBlock = false, LZ4BlockInputStream will attempt to call refill() after it's reached the end of a compressed block (because end-of-block no longer implies end-of-stream). If we reach the end of the stream then refill()'s attempt to read the next block's magic header will fail and throw an EOFException, which is then caught and ignored.

Throwing and catching this exception has a significant performance penalty, especially in applications with deep call stacks: a significant amount of time is spent collecting exception stack traces. We could try to solve this problem by throwing a static exception, but that harms users' ability to debug unexpected exceptions.

This PR addresses this problem by adding a private tryReadFully() method, which is like readFully() except it signals success / failure by returning a boolean instead of throwing an exception. This allows us to skip the exception overhead in the case of "expected" exceptions, significantly improving performance.

Benchmarks

Here's a quick-and-dirty microbenchmark that I ran on my Mac (written in Scala):

import java.io._
import net.jpountz.lz4._

# Construct a test input consisting of two concatenated LZ4 streams
val testBytes = "Testing!".getBytes("UTF-8")
val bytes = new ByteArrayOutputStream()
val out = new LZ4BlockOutputStream(bytes)
out.write(testBytes)
out.close()
val concatInput = (bytes.toByteArray() ++ bytes.toByteArray())

// Measure repeated decompression speed
val start = System.currentTimeMillis()
var i = 0
while (i < 1000 * 1000) {
    i += 1
    val in = new LZ4BlockInputStream(new ByteArrayInputStream(concatInput), false)
    while (in.read() != -1) {}
    in.close()
}
val end = System.currentTimeMillis()
println(end - start)

This took ~5 seconds before and ~600ms after. This isn't the most scientific benchmark (no warmup), but I think it's a good illustration of the performance problems in the original code. Note that the perf. issues are even more pronounced with deep stacktraces (the stack is very shallow in this benchmark).

JoshRosen · 2019-06-23T20:37:21Z

/cc @maropu (who worked on the original implementation of this feature) and @kiszk (who recently upgraded lz4-java in Apache Spark)

kiszk · 2019-06-24T05:43:19Z

@JoshRosen Good catch, LGTM

@odaira WDYT? As we know, it looks too expensive to return the state to a caller by using an exception.

odaira · 2019-06-24T23:32:33Z

I haven't thoroughly checked the code, but your statement makes sense. I'll try the code myself.

maropu · 2019-06-28T06:55:37Z

src/java/net/jpountz/lz4/LZ4BlockInputStream.java

-  private void readFully(byte[] b, int len) throws IOException {
+  // Like readFully(), except it signals incomplete reads by returning
+  // false instead of throwing EOFException.
+  private boolean tryReadFully(byte[] b, int len) throws IOException {


nit: drop throws IOException in the end.

I considered this, but it will not work: the underlying read() call declares that it throws IOException so we need to keep this here.

maropu · 2019-06-28T07:05:05Z

I checked the code and it looks quite reasonable to me.

maropu · 2019-07-15T23:24:57Z

kindly ping

kiszk · 2019-07-30T18:31:37Z

@odaira gentle ping

odaira · 2019-07-31T15:39:18Z

Sorry, I have some urgent things, but I'll review this by the end of August.

odaira · 2019-08-30T22:42:23Z

Thanks for your contribution. I confirmed the microbenchmark was more than 2x faster with this change. Good optimization!

maropu · 2019-08-30T22:44:31Z

Thanks!

joshrosen-stripe · 2019-09-10T00:05:25Z

Hi @odaira,

If you have time, would it be possible to make a new release of lz4-java so Spark can pick up this performance optimization?

odaira · 2019-09-10T15:14:09Z

Yes, but I am working on a few more patches. The next release will be sometime in next month.

maropu · 2019-12-09T23:48:21Z

Thanks for the v1.7 release! @odaira

### What changes were proposed in this pull request? This pr intends to upgrade lz4-java from 1.6.0 to 1.7.0. ### Why are the changes needed? This release includes a performance bug (lz4/lz4-java#143) fixed by JoshRosen and some improvements (e.g., LZ4 binary update). You can see the link below for the changes; https://github.com/lz4/lz4-java/blob/master/CHANGES.md#170 ### Does this PR introduce any user-facing change? No ### How was this patch tested? Existing tests. Closes #26823 from maropu/LZ4_1_7_0. Authored-by: Takeshi Yamamuro <yamamuro@apache.org> Signed-off-by: HyukjinKwon <gurwls223@apache.org>

Avoid Exception overhead when stopOnEmptyBlock == false.

0ab93f1

maropu reviewed Jun 28, 2019

View reviewed changes

maropu approved these changes Jun 29, 2019

View reviewed changes

kiszk approved these changes Jun 29, 2019

View reviewed changes

maropu mentioned this pull request Aug 6, 2019

[SPARK-27768][SQL] Support Infinity/NaN-related float/double literals case-insensitively apache/spark#25331

Closed

odaira merged commit 9441e66 into lz4:master Aug 30, 2019

maropu mentioned this pull request Dec 9, 2019

[SPARK-30196][BUILD] Bump lz4-java version to 1.7.0 apache/spark#26823

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid Exception-related performance issues in LZ4BlockInputStream when stopOnEmptyBlock == false #143

Avoid Exception-related performance issues in LZ4BlockInputStream when stopOnEmptyBlock == false #143

JoshRosen commented Jun 23, 2019

JoshRosen commented Jun 23, 2019

kiszk commented Jun 24, 2019

odaira commented Jun 24, 2019

maropu Jun 28, 2019

JoshRosen Jun 28, 2019

maropu Jun 29, 2019

maropu commented Jun 28, 2019

maropu commented Jul 15, 2019

kiszk commented Jul 30, 2019

odaira commented Jul 31, 2019 •

edited

odaira commented Aug 30, 2019

maropu commented Aug 30, 2019

joshrosen-stripe commented Sep 10, 2019

odaira commented Sep 10, 2019

maropu commented Dec 9, 2019

Avoid Exception-related performance issues in LZ4BlockInputStream when stopOnEmptyBlock == false #143

Avoid Exception-related performance issues in LZ4BlockInputStream when stopOnEmptyBlock == false #143

Conversation

JoshRosen commented Jun 23, 2019

Benchmarks

JoshRosen commented Jun 23, 2019

kiszk commented Jun 24, 2019

odaira commented Jun 24, 2019

maropu Jun 28, 2019

Choose a reason for hiding this comment

JoshRosen Jun 28, 2019

Choose a reason for hiding this comment

maropu Jun 29, 2019

Choose a reason for hiding this comment

maropu commented Jun 28, 2019

maropu commented Jul 15, 2019

kiszk commented Jul 30, 2019

odaira commented Jul 31, 2019 • edited

odaira commented Aug 30, 2019

maropu commented Aug 30, 2019

joshrosen-stripe commented Sep 10, 2019

odaira commented Sep 10, 2019

maropu commented Dec 9, 2019

odaira commented Jul 31, 2019 •

edited