refactor: replace merge-sort with heapsort #405

ali-behjati · 2024-05-15T11:58:01Z

This change replaces the merge-sort with a heapsort that uses much less CU than merge-sort.

The previous algorithm is very fast running on a normal CPU but doesn't work very well in BPF because all instructions (like load, move, ..) have the same cost and there is no cache and optimizations for memory-alignment have no real benefit.

The major benefits of heapsort are being non-recursive that reduces the high stackframe overhead in BPF and being inplace which minimizes number of copies.

Unfortunately there is no way to systematically get the the compute usage out of program test. The test_benchmark file has a simple code that helps running benchmarks on various number of publishers.

In 32-publisher setup, heapsort reduces the CU from 16.5k to 12k and in the 64-publisher setup 37k to 20.5k. The numbers are the worst cases running on randomized input. The result of running in a highly similar input (two distinct prices, and two distinct confidences) is 14.7k and 25k respectively, which is still better and is very unlikely in practice.

This change replaces the merge-sort with a heapsort that uses much less CU than merge-sort. The previous algorithm is very fast running on a normal CPU but doesn't work very well in BPF because all instructions (like load, move, ..) have the same cost and there is no cache and optimizations for memory-alignment have no real benefit. A major benefit of heapsort is being non-recursive that reduces the high stackframe overhead in BPF and is inplace which minimizes number of copies. Unfortunately there is no way to systematically get the the compute usage out of program test. The `test_benchmark` file has a simple code that helps running benchmarks on various number of publishers. In 32-publisher setup, heapsort reduces the CU from 16.5k to 12k and in the 64-publisher setup 37k to 20.5k. The numbers are the worst cases running on randomized input.

jayantk

very nicely done

program/c/src/oracle/sort/tmpl/sort_stable.c

program/c/src/oracle/model/price_model.h

guibescos · 2024-05-15T18:40:31Z

program/c/src/oracle/upd_aggregate.h

@@ -188,8 +188,7 @@ static inline bool upd_aggregate( pc_price_t *ptr, uint64_t slot, int64_t timest
    // note: numv>0 and nprcs = 3*numv at this point
    int64_t agg_p25;
    int64_t agg_p75;
-    int64_t scratch[ PC_NUM_COMP * 3 ]; // ~0.75KiB for current PC_NUM_COMP (FIXME: DOUBLE CHECK THIS FITS INTO STACK FRAME LIMIT)


Great that it's in place now

program/c/src/oracle/sort/test_sort_stable.c

program/c/src/oracle/model/test_price_model.c

guibescos

tyvm

This reverts commit 2ea67f3.

ali-behjati requested review from Reisen, jayantk and guibescos May 15, 2024 11:58

jayantk previously approved these changes May 15, 2024

View reviewed changes

program/c/src/oracle/sort/tmpl/sort_stable.c Show resolved Hide resolved

program/c/src/oracle/model/price_model.h Show resolved Hide resolved

ali-behjati dismissed jayantk’s stale review via d455638 May 15, 2024 12:54

ali-behjati force-pushed the optimize/use-heapsort branch 2 times, most recently from f04cafb to 0302196 Compare May 15, 2024 13:19

ali-behjati added 2 commits May 15, 2024 15:20

fix: use uint64_t for indices

3b469c7

chore: add more docs

c581da7

ali-behjati force-pushed the optimize/use-heapsort branch from 0302196 to c581da7 Compare May 15, 2024 13:20

guibescos reviewed May 15, 2024

View reviewed changes

program/c/src/oracle/sort/test_sort_stable.c Show resolved Hide resolved

ali-behjati force-pushed the optimize/use-heapsort branch from 3e13613 to 9afe6e8 Compare May 16, 2024 07:49

refactor: add more tests and benchmarks

5647ad0

ali-behjati force-pushed the optimize/use-heapsort branch from 9afe6e8 to 5647ad0 Compare May 16, 2024 07:50

refactor: improve comments

884ff7d

Reisen previously approved these changes May 16, 2024

View reviewed changes

program/c/src/oracle/model/test_price_model.c Show resolved Hide resolved

fix: use right array for populating testdata

9d489c1

ali-behjati dismissed Reisen’s stale review via 9d489c1 May 16, 2024 11:19

guibescos approved these changes May 16, 2024

View reviewed changes

ali-behjati merged commit 2ea67f3 into main May 16, 2024

ali-behjati deleted the optimize/use-heapsort branch May 16, 2024 15:24

ali-behjati added a commit that referenced this pull request May 16, 2024

Revert "refactor: replace merge-sort with heapsort (#405)"

ebc583e

This reverts commit 2ea67f3.

ali-behjati mentioned this pull request May 16, 2024

revert: bring merge-sort back #406

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: replace merge-sort with heapsort #405

refactor: replace merge-sort with heapsort #405

Uh oh!

ali-behjati commented May 15, 2024 •

edited

Loading

Uh oh!

jayantk left a comment

Uh oh!

Uh oh!

Uh oh!

guibescos May 15, 2024

Uh oh!

Uh oh!

Uh oh!

guibescos left a comment

Uh oh!

Uh oh!

refactor: replace merge-sort with heapsort #405

refactor: replace merge-sort with heapsort #405

Uh oh!

Conversation

ali-behjati commented May 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jayantk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

guibescos May 15, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

guibescos left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ali-behjati commented May 15, 2024 •

edited

Loading