POC: Enhance CF prefix check #12654

Zelldon · 2023-05-03T14:06:48Z

Description

Idea:

We know that the prefixes normally look like this: [0, 0, 0, 0, 0, 0, 0, 21] length = 8
the keys we check them against look mostly like this: [0, 0, 0, 0, 0, 0, 0, 21, 0, 0, 0, 12] length = 12

Right now we check the prefixes via iterating from the beginning, which means we have to iterate over several zero values which could be improved if we start from the back. Right now this will only improve the case when the prefix is different. This was what I first tried.

Later I thought in order to improve this for all cases, I could check the column family byte directly (currently we have less than 128 column families), this means we can just check the last byte of the long (we write them in ByteOrder.BIG_ENDIAN). If this is equal the entry is a valid entry for the CF otherwise not.

Furthermore, sometimes we don't provide any prefix this means we can check whether the length of the prefix is zero so we don't check the key further. If the prefix key is larger then zero we can check the key (starting after the CF prefix).

The JMH benchmark results were interesting since they show a much lower error rate (variance) than the other test runs.

Result "io.camunda.zeebe.engine.perf.EnginePerformanceTest.measureProcessExecutionTime":
  223.584 ±(99.9%) 2.052 ops/s [Average]
  (min, avg, max) = (196.934, 223.584, 238.563), stdev = 8.686
  CI (99.9%): [221.533, 225.636] (assumes normal distribution)

Benchmark                                           Mode  Cnt    Score   Error  Units
EnginePerformanceTest.measureProcessExecutionTime  thrpt  200  223.584 ± 2.052  ops/s

I want to start a benchmark by maxing out performance to see whether we see any difference.

Related issues

related #12241

Definition of Done

Not all items need to be done depending on the issue and the pull request.

Code changes:

The changes are backwards compatibility with previous versions
If it fixes a bug then PRs are created to backport the fix to the last two minor versions. You can trigger a backport by assigning labels (e.g. backport stable/1.3) to the PR, in case that fails you need to create backports manually.

Testing:

There are unit/integration tests that verify all acceptance criterias of the issue
New tests are written to ensure backwards compatibility with further versions
The behavior is tested manually
The change has been verified by a QA run
The impact of the changes is verified by a benchmark

Documentation:

The documentation is updated (e.g. BPMN reference, configuration, examples, get-started guides, etc.)
If the PR changes how BPMN processes are validated (e.g. support new BPMN element) then the Camunda modeling team should be informed to adjust the BPMN linting.

Other teams:
If the change impacts another team an issue has been created for this team, explaining what they need to do to support this change.

Please refer to our review guidelines.

github-actions · 2023-05-03T14:34:44Z

Setup

Deployed to measurement-4872866750

camunda-platform:
  zeebe:
    image:
      repository: gcr.io/zeebe-io/zeebe
      tag: zell-improve-prefix-check-benchmark-bccd00e
  zeebe-gateway:
    image:
      repository: gcr.io/zeebe-io/zeebe
      tag: zell-improve-prefix-check-benchmark-bccd00e
global:
  image:
    tag: zell-improve-prefix-check-benchmark-bccd00e

Measurement before

Process Instance Execution Time: p99=0.915 p90=0.337 p50=0.098
Throughput: 149.971 PI/s
Grafana

Chaos injection

Deployed chaos network-latency-5

Measurement after

Process Instance Execution Time: p99=4.183 p90=2.250 p50=0.941
Throughput: 74.911 PI/s
Grafana

Details

See https://github.com/camunda/zeebe/actions/runs/4872866750

Zelldon · 2023-05-05T11:46:31Z

Closing for now, branch will stay to apply if we need it.

Zelldon added 2 commits May 3, 2023 14:13

fix: reverse prefix check

9a3dde1

wip: check CF byte and prefix separate

bccd00e

Zelldon added the benchmark label May 3, 2023

Zelldon changed the title ~~POC: Reverse prefix check~~ POC: Enhance CF prefix check May 3, 2023

This was referenced May 4, 2023

Implement JMH benchmark for support process instance creation on larger state #12241

Closed

[EPIC] Support stable performance for new instances even on larger state #12033

Closed

Zelldon removed the benchmark label May 5, 2023

Zelldon closed this May 5, 2023

Zelldon deleted the zell-improve-prefix-check branch March 28, 2024 15:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

POC: Enhance CF prefix check #12654

POC: Enhance CF prefix check #12654

Zelldon commented May 3, 2023

github-actions bot commented May 3, 2023

Zelldon commented May 5, 2023

POC: Enhance CF prefix check #12654

POC: Enhance CF prefix check #12654

Conversation

Zelldon commented May 3, 2023

Description

Related issues

Definition of Done

github-actions bot commented May 3, 2023

Setup

Measurement before

Chaos injection

Measurement after

Details

Zelldon commented May 5, 2023