KAFKA-8717: Reuse cached offset metadata when reading from log #7081

hachikuji · 2019-07-12T08:20:08Z

Although we currently cache offset metadata for the high watermark and last stable offset, we don't use it when reading from the log. Instead we always look it up from the index. This patch pushes fetch isolation into Log.read so that we are able to reuse the cached offset metadata.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

ijuma · 2019-07-15T04:40:05Z

Thanks for the PR. Concretely, does this mean we save one index look up in the usual consumer reading to the end of the log (hw or lso) case? Even though one would expect the index to be in the page cache, always good to do less work when possible.

hachikuji · 2019-07-15T16:34:28Z

@ijuma Yes, that's right. This came up in the context of KIP-392 because we realized that, unlike the leader, the follower would always need one lookup to fetch the high watermark or LSO. However, we saw that the leader was already doing an extra lookup whenever it read from the log, so we thought we could eliminate that lookup and bring the follower fetch at least up to parity with the leader's current behavior.

To be perfectly honest, I am skeptical of the benefit of this offset metadata caching. I did some brief consumer performance testing with this patch and saw basically no difference. But it's a bit harder to say in a more general context with more partitions and more consumers. In any case, I thought this presented a nice opportunity to simplify some of the internal fetch APIs a little bit.

hachikuji · 2019-07-15T20:57:35Z

retest this please

hachikuji · 2019-07-22T22:38:07Z

retest this please

junrao

@hachikuji : Thanks for the PR. Great cleanup patch. A few comments below. Also, since this PR is kind of tricky, it's probably useful to have a jira to track it.

core/src/main/scala/kafka/log/Log.scala

core/src/main/scala/kafka/cluster/Partition.scala

core/src/test/scala/unit/kafka/log/LogTest.scala

hachikuji · 2019-07-28T19:26:35Z

retest this please

junrao

@hachikuji : Thanks for the PR. LGTM. Just a minor comment below.

junrao · 2019-07-29T20:34:38Z

core/src/main/scala/kafka/log/Log.scala

@@ -2086,6 +2107,14 @@ class Log(@volatile var dir: File,
    }
  }

+  /**
+   * Get the largest log segment with a base offset less than the given offset, if one exists.


less than => less than or equal to ?

hachikuji force-pushed the offset-metadata-refactor branch from 54b2a62 to 6c0712a Compare July 12, 2019 08:23

junrao reviewed Jul 25, 2019

View reviewed changes

hachikuji changed the title ~~MINOR: Reuse cached offset metadata when reading from log~~ KAFKA-8717: Reuse cached offset metadata when reading from log Jul 25, 2019

hachikuji added 4 commits July 25, 2019 18:02

MINOR: Reuse cached offset metadata when reading from log

497ac02

Fix 2.11 compiler error

2819f0f

Add a few more test cases

52e0a60

Address review comments

d4886e3

hachikuji force-pushed the offset-metadata-refactor branch from 77900f2 to d4886e3 Compare July 27, 2019 19:58

junrao approved these changes Jul 29, 2019

View reviewed changes

Revise comment on floorLogSegment

15ddf03

hachikuji merged commit a48b5d9 into apache:trunk Jul 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-8717: Reuse cached offset metadata when reading from log #7081

KAFKA-8717: Reuse cached offset metadata when reading from log #7081

hachikuji commented Jul 12, 2019

ijuma commented Jul 15, 2019

hachikuji commented Jul 15, 2019 •

edited

hachikuji commented Jul 15, 2019

hachikuji commented Jul 22, 2019

junrao left a comment

hachikuji commented Jul 28, 2019

junrao left a comment

junrao Jul 29, 2019

KAFKA-8717: Reuse cached offset metadata when reading from log #7081

KAFKA-8717: Reuse cached offset metadata when reading from log #7081

Conversation

hachikuji commented Jul 12, 2019

Committer Checklist (excluded from commit message)

ijuma commented Jul 15, 2019

hachikuji commented Jul 15, 2019 • edited

hachikuji commented Jul 15, 2019

hachikuji commented Jul 22, 2019

junrao left a comment

Choose a reason for hiding this comment

hachikuji commented Jul 28, 2019

junrao left a comment

Choose a reason for hiding this comment

junrao Jul 29, 2019

Choose a reason for hiding this comment

hachikuji commented Jul 15, 2019 •

edited