Added module to enable CPU affinity #1641

merlimat · 2018-09-04T02:36:46Z

Motivation

This is part of a set of changes aimed at reducing latency in BK at the expense of other aspects (eg: max throughput). While not intended to be used as default settings, they might be good to have whenever the latency becomes critical.

Pinning a thread to a particular CPU will ensure no other process will execute on that CPU reducing all scheduler induced context switches that will cause latency jittery.

A given thread that wants to get pinned to a CPU just needs to call:

CpuAffinity.acquireCore();

It's called acquireCore() because it will also disable hyper-threading on the pinned cpu.

Subsequent PRs will use this module to have the option to pin critical threads to available CPUs.

Changes

Added JNI module to call sched_setaffinity() to pin a thread to a particular CPU
Automatically discover available isolated CPUs
Acquire file-based locks to allow multiple processes on same machine to acquire CPUs independently.

cpu-affinity/src/main/java/org/apache/bookkeeper/utils/affinity/CpuAffinity.java

merlimat · 2018-09-17T17:58:15Z

Ping. Please take a look again.

circe-checksum/src/main/java/com/scurrilous/circe/utils/NativeUtils.java

cpu-affinity/src/main/java/org/apache/bookkeeper/common/util/affinity/impl/CpuAffinityImpl.java

ivankelly

The patch itself looks good. A few nits.

However, I think we need an overarching document for these "performance" improvements, with benchmark results, detailed instructions to show how to reproduce the scenario in which performance changes, and the overall goals and approach for the perf improvements.

The current patches feel very piecemeal and handwavey, and this becomes a problem later when someone wants to change something, and the change is objected to on performance grounds but no solid facts on why it can't change.

circe-checksum/src/main/java/com/scurrilous/circe/utils/NativeUtils.java

cpu-affinity/pom.xml

ivankelly · 2018-09-21T08:35:26Z

cpu-affinity/pom.xml

+    </profile>
+
+    <profile>
+      <id>Linux</id>


Why is the profile capitalized, while the other isn't

This pom was copied from the existing circe-checksum one. The profile name is not that important since it's automatically enabled based on the current OS.

cpu-affinity/src/main/affinity/cpp/affinity_jni.c

cpu-affinity/src/main/java/org/apache/bookkeeper/common/util/affinity/impl/NativeUtils.java

merlimat · 2018-11-08T20:41:57Z

@ivankelly @eolivelli @sijie Please take another look.

However, I think we need an overarching document for these "performance" improvements, with benchmark results, detailed instructions to show how to reproduce the scenario in which performance changes, and the overall goals and approach for the perf improvements.

This is just the underlying implementation. I will work on the docs and instructions when the whole busy-wait change set is ready.

ivankelly · 2018-11-09T12:24:39Z

However, I think we need an overarching document for these "performance" improvements, with benchmark results, detailed instructions to show how to reproduce the scenario in which performance changes, and the overall goals and approach for the perf improvements.

This is just the underlying implementation. I will work on the docs and instructions when the whole busy-wait change set is ready.

Do you have some preliminary numbers then to show why this change is valuable?

merlimat · 2018-11-09T17:30:18Z

Do you have some preliminary numbers then to show why this change is valuable?

@ivankelly

This are the numbers on my local tests. It's not using the BK client lib directly, but rather through ManagedLedger (from Pulsar) which has one more thread handoff, to have callbacks serialized to 1 thread hashed on topic name.

Conditions:

Client/server on same machine with TCP connection
Tested on a bare-metal node
Client sends 1K rps (size 1KB) and waits for response
Multiple request outstanding, responses in order
Latency measured in millis

	50pct	95pct	99pct	99.9pct	99.99pct	99.999pct	max
Regular queue	0.078	0.092	0.110	0.152	0.232	0.430	0.538
Spin-Wait Journal	0.065	0.073	0.087	0.132	0.428	0.483	0.819
Spin-Wait Workers	0.049	0.059	0.066	0.095	0.449	0.465	0.466
Spin-Wait IO Threads	0.036	0.047	0.052	0.075	0.436	0.445	0.452
Disable HyperThread	0.029	0.038	0.041	0.054	0.204	0.388	0.396
CPU-Affinity	0.029	0.037	0.040	0.050	0.066	0.090	0.104

The remaining bulk of the median latency I think is related to SO_BUSY_POLL (netty/netty#8268) being ineffective on loopback interface.

Charts:

merlimat · 2018-11-09T19:12:11Z

run pr validation

ivankelly · 2018-11-12T09:32:42Z

@merlimat thanks. Do you have any numbers for just this change? Like without the spin-wait stuff and HT disablement?

merlimat · 2018-11-12T20:21:09Z

@ivankelly All the changes in the table above are incremental, so the last 2 lines show the effect of Cpu isolation:

	50pct	95pct	99pct	99.9pct	99.99pct	99.999pct	max
Disable HyperThread	0.029	0.038	0.041	0.054	0.204	0.388	0.396
CPU-Affinity	0.029	0.037	0.040	0.050	0.066	0.090	0.104

I think it doesn't make sense to test cpu isolation without the other changes because the baseline noise will make the effects not visible.

To summarize: cpu-affinity is used to remove interference from other tasks (OS thread and other processes) from a particular thread in order to avoid context switches and reduce the latency in the long tail.

Without busy-poll, the bulk of context switches will still be there (eg: when sleeping while waiting for items in a queue), so the cpu-isolation won't provide much benefits.

Added module to enable CPU affinity

960b550

merlimat added the type/improvement label Sep 4, 2018

merlimat added this to the 4.8.0 milestone Sep 4, 2018

merlimat self-assigned this Sep 4, 2018

merlimat requested review from jvrao, ivankelly, sijie and eolivelli September 4, 2018 02:36

sijie reviewed Sep 4, 2018

View reviewed changes

cpu-affinity/src/main/java/org/apache/bookkeeper/utils/affinity/CpuAffinity.java Outdated Show resolved Hide resolved

merlimat added 5 commits September 4, 2018 18:34

Renamed package into org.apache.bookkeeper.common.util.affinity

c689211

Fixed checkstyle issues

65b1504

Fixed spotbug issues

c1df066

Fixed license headers

61d5792

Fixed broken xml

c8f33b3

eolivelli modified the milestones: 4.8.0, 4.9.0 Sep 6, 2018

cpu-affinity fixes

bed26ef

merlimat force-pushed the cpu-affinity branch from 3a3056a to 5119c88 Compare September 17, 2018 23:20

Fixed cleanup

615f94e

merlimat force-pushed the cpu-affinity branch from 5119c88 to 615f94e Compare September 18, 2018 00:05

eolivelli reviewed Sep 18, 2018

View reviewed changes

ivankelly reviewed Sep 21, 2018

View reviewed changes

merlimat mentioned this pull request Nov 8, 2018

Added BlockingQueue implementation based on JCtools #1682

Merged

merlimat added 3 commits November 8, 2018 10:08

Merge remote-tracking branch 'apache/master' into cpu-affinity

2e04a6c

Addressed comments

0e32eab

Reformatted affinity_jni.c with spaces

f8dd101

Grouped 2 args of try-with

1b98397

eolivelli approved these changes Nov 8, 2018

View reviewed changes

Ignoring OBL_UNSATISFIED_OBLIGATION for issue on java9

7492763

Fixed checkstyle

e78a200

sijie approved these changes Nov 10, 2018

View reviewed changes

ivankelly approved these changes Nov 12, 2018

View reviewed changes

merlimat added area/bookie area/client release/4.9.0 labels Nov 13, 2018

merlimat merged commit dab8310 into apache:master Nov 13, 2018

merlimat deleted the cpu-affinity branch November 13, 2018 19:34

merlimat mentioned this pull request Nov 15, 2018

Allow to configure BK for low latency busy-wait settings #1812

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added module to enable CPU affinity #1641

Added module to enable CPU affinity #1641

merlimat commented Sep 4, 2018

merlimat commented Sep 17, 2018

ivankelly left a comment

ivankelly Sep 21, 2018

merlimat Nov 8, 2018

merlimat commented Nov 8, 2018

ivankelly commented Nov 9, 2018

merlimat commented Nov 9, 2018 •

edited

Loading

merlimat commented Nov 9, 2018

ivankelly commented Nov 12, 2018

merlimat commented Nov 12, 2018

Added module to enable CPU affinity #1641

Added module to enable CPU affinity #1641

Conversation

merlimat commented Sep 4, 2018

Motivation

Changes

merlimat commented Sep 17, 2018

ivankelly left a comment

Choose a reason for hiding this comment

ivankelly Sep 21, 2018

Choose a reason for hiding this comment

merlimat Nov 8, 2018

Choose a reason for hiding this comment

merlimat commented Nov 8, 2018

ivankelly commented Nov 9, 2018

merlimat commented Nov 9, 2018 • edited Loading

merlimat commented Nov 9, 2018

ivankelly commented Nov 12, 2018

merlimat commented Nov 12, 2018

merlimat commented Nov 9, 2018 •

edited

Loading