optimize CPU inference with Array-Based Tree Traversal #11519

razdoburdin · 2025-06-20T13:50:25Z

This PR introduces optimization for CPU inference. For each tree, the top N levels are transformed into a compact array-based layout. This allows for a branchless node indexing rule: idx = 2 * idx + int(val < split_cond). To minimize memory overhead, this transformation from the standard tree structure to the array layout is performed on-the-fly for each block of data being processed. Even with this additional calculations, improved data locality in the cache-friendly array layout leads to inference speed up to ~2x (x1.4 on average).

trivialfis · 2025-06-21T01:33:21Z

Thank you for the optimization on the inference. Please unmark the "draft" status and ping me when the PR is ready for testing.

src/predictor/array_tree_layout.h

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

…rdin/xgboost into dev/cpu/eytzinger_layout

Vika-F

Cosmetic changes.

The next possible step would be to convert the trees into array-based representation only once, and not to do it for each block of data.

src/predictor/array_tree_layout.h

src/predictor/cpu_predictor.cc

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

razdoburdin · 2025-06-24T11:52:19Z

The next possible step would be to convert the trees into array-based representation only once, and not to do it for each block of data.

it sounds reasonable and will further improve perf (by cost of increasing memory consumption).

razdoburdin · 2025-06-24T12:25:34Z

Thank you for the optimization on the inference. Please unmark the "draft" status and ping me when the PR is ready for testing.

hi @trivialfis, the PR is ready for review.

trivialfis · 2025-07-01T08:48:13Z

cc @hcho3

trivialfis

Still trying to understand the code, will give it a try later. In the meanwhile, could you please craft some specific unittests for the new inference algorithm?

trivialfis · 2025-07-01T11:37:36Z

src/predictor/array_tree_layout.h

+   */
+  std::array<bst_node_t, kNodesCount + 1> nidx_in_tree_;
+
+  inline static bool IsLeaf(const RegTree& tree, bst_node_t nidx) {


No need for the inline keyword, same goes for all following methods.

trivialfis · 2025-07-01T11:52:32Z

src/predictor/cpu_predictor.cc

+   * We use transforming trees to array layout for each block of data to avoid memory overheads.
+   * It makes the array layout inefficient for block_size == 1
+   */ 
+  const bool use_array_tree_layout = block_size > 1;


What happens if this is a small online inference call? The input size could be a few samples per call.

trivialfis · 2025-07-01T11:53:53Z

src/predictor/cpu_predictor.cc

+  for (std::size_t i = 0; i < block_size; ++i) {
+    bst_node_t nidx = 0;
+    if constexpr (use_array_tree_layout) {
+      nidx = p_nidx[i];


Dmitry Razdoburdin and others added 12 commits May 28, 2025 04:53

basic implementation

e64e20c

optimisations

60c2ffe

fix compilation error

8f6dfe3

perf optimzation

bd13491

add categorial

3827a49

add multitarget

7334bd2

linting

8356855

perf

165b34a

fix perf

52eee0c

refactoring

cb28530

add comments

7ae3a42

more comments

2799644

razdoburdin marked this pull request as draft June 20, 2025 13:50

fix and tildy

a8bb91e

Vika-F reviewed Jun 23, 2025

View reviewed changes

src/predictor/array_tree_layout.h Show resolved Hide resolved

razdoburdin and others added 7 commits June 23, 2025 15:22

Update src/predictor/array_tree_layout.h

6d94176

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

add static assertions

e34becc

fix randome state usage in sycl training_continuation test

a2f2c75

Merge branch 'master' into dev/cpu/eytzinger_layout

2afad25

check if right child is valid

92ac69e

Merge branch 'dev/cpu/eytzinger_layout' of https://github.com/razdobu…

e2b0f05

…rdin/xgboost into dev/cpu/eytzinger_layout

use signed ints for node indxes

87bee15

Vika-F reviewed Jun 24, 2025

View reviewed changes

razdoburdin and others added 6 commits June 24, 2025 12:53

Update src/predictor/array_tree_layout.h

c3c1c85

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

Update src/predictor/array_tree_layout.h

d270ee7

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

Update src/predictor/array_tree_layout.h

2a7e575

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

Update src/predictor/array_tree_layout.h

3539ec0

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

Update src/predictor/array_tree_layout.h

709d233

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

Update src/predictor/array_tree_layout.h

40be7e2

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

razdoburdin and others added 2 commits June 24, 2025 12:57

Update src/predictor/cpu_predictor.cc

c9160c6

Co-authored-by: Victoriya Fedotova <viktoria.nn@gmail.com>

linting

de552e8

razdoburdin marked this pull request as ready for review June 24, 2025 12:24

trivialfis reviewed Jul 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

optimize CPU inference with Array-Based Tree Traversal #11519

optimize CPU inference with Array-Based Tree Traversal #11519

Uh oh!

razdoburdin commented Jun 20, 2025

Uh oh!

trivialfis commented Jun 21, 2025

Uh oh!

Uh oh!

Vika-F left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

razdoburdin commented Jun 24, 2025

Uh oh!

razdoburdin commented Jun 24, 2025

Uh oh!

trivialfis commented Jul 1, 2025

Uh oh!

trivialfis left a comment

Uh oh!

trivialfis Jul 1, 2025

Uh oh!

trivialfis Jul 1, 2025

Uh oh!

trivialfis Jul 1, 2025

Uh oh!

Uh oh!

Uh oh!

optimize CPU inference with Array-Based Tree Traversal #11519

Are you sure you want to change the base?

optimize CPU inference with Array-Based Tree Traversal #11519

Uh oh!

Conversation

razdoburdin commented Jun 20, 2025

Uh oh!

trivialfis commented Jun 21, 2025

Uh oh!

Uh oh!

Vika-F left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

razdoburdin commented Jun 24, 2025

Uh oh!

razdoburdin commented Jun 24, 2025

Uh oh!

trivialfis commented Jul 1, 2025

Uh oh!

trivialfis left a comment

Choose a reason for hiding this comment

Uh oh!

trivialfis Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

trivialfis Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

trivialfis Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!