Integrate CI for Running Benchmarks #436

tusharmath · 2023-10-07T08:59:31Z

Overview

We need to enhance our CI pipeline to not only run our existing Criterion benchmarks but also ensure that the build time stays within acceptable limits. Moreover, a clear and concise report should be generated and published on the PR for quick and easy understanding.

Requirements

Run Criterion Benchmarks: Our CI should execute the existing Criterion benchmarks as part of the pipeline.
Fail on Degradation: The CI should fail if there's a performance degradation when compared to the main branch.
Build Time Check: The CI should compare the build time of the current PR with the average build time of the last few successful builds on the main branch. If the build time increases by more than 10%, the CI should fail.
Benchmark Report: A tabular comparison of benchmarks should be published on the PR, similar to the code-cov report. It should display the current benchmark results alongside the results from the main branch for easy comparison.
Conditional Execution: Benchmarks and build time checks should only be executed when the benchmark label is added to a PR or commit.
Support for Forks: The CI process should be designed to run benchmarks on forks as well.

Rationale

Performance is paramount for our project. It's essential to ensure that any changes do not adversely affect either the runtime performance (tracked by benchmarks) or the build time. By integrating these checks into our CI, we can maintain a high standard of performance and ensure that contributors receive immediate feedback on any potential issues.

The text was updated successfully, but these errors were encountered:

tusharmath · 2023-10-07T09:07:28Z

/bounty 150$

algora-pbc · 2023-10-07T09:07:31Z

💎 $150 bounty created by tailcallhq
🙋 If you'd like to work on this issue, comment below to get assigned
👉 To claim this bounty, submit a pull request that includes the text /claim #436 somewhere in its body
📝 Before proceeding, please make sure you can receive payouts in your country
💵 Payment arrives in your account 2-5 days after the bounty is rewarded
💯 You keep 100% of the bounty award
🙏 Thank you for contributing to tailcallhq/tailcall!
🙋‍♂️ Join our discord channel if you need help.

5war00p · 2023-10-07T09:14:59Z

/attempt #436

Options

Cancel my attempt

algora-pbc · 2023-10-08T09:15:05Z

The bounty is up for grabs! Everyone is welcome to /attempt #436 🙌

algora-pbc · 2023-10-08T11:43:43Z

Note: The user @cheikh2shift is already attempting to complete issue #436 and claim the bounty. If you attempt to complete the same issue, there is a chance that @cheikh2shift will complete the issue first, and be awarded the bounty. We recommend discussing with @cheikh2shift and potentially collaborating on the same solution versus creating an alternate solution.

algora-pbc · 2023-10-09T10:29:34Z

The bounty is up for grabs! Everyone is welcome to /attempt #436 🙌

neo773 · 2023-10-09T10:30:32Z

/attempt #436

meskill · 2023-10-10T14:09:33Z

Using Github Actions to measure performance could be quite unreliable due the way they are executing. That fact even mentioned in the criterion's FAQ

tusharmath · 2023-10-10T15:30:31Z

Interesting @meskill. Have you used iai? It seems like it's not actively maintained.

meskill · 2023-10-10T15:52:29Z

Interesting @meskill. Have you used iai? It seems like it's not actively maintained.

I have only heard about this kind of benchmarking tools, but never used them. Indeed the project seems abandoned, but the question has links to other projects that look more wealthy.

tusharmath · 2023-10-11T10:39:40Z

This seems like an alternative to iai: https://github.com/Joining7943/iai-callgrind

shambu2k · 2023-10-22T10:53:18Z

/attempt #436

Options

Cancel my attempt

tusharmath · 2023-10-22T10:58:44Z

The approach that I have in mind is to use iai kinda tools to identify the following —

Instructions:                1733
  L1 Hits:                     2358
  L2 Hits:                        0
  RAM Hits:                       3
  Total read+write:            2361
  Estimated Cycles:            2463

Since these are exact values, whenever there is an increase in any of the parameters, we could fail the build.

algora-pbc · 2023-10-23T10:53:27Z

The bounty is up for grabs! Everyone is welcome to /attempt #436 🙌

alankritdabral · 2023-11-04T16:08:36Z

/attempt #436

algora-pbc · 2023-11-04T16:08:38Z

@alankritdabral: The Tailcall Inc. team prefers to assign a single contributor to the issue rather than let anyone attempt it right away. We recommend waiting for a confirmation from a member before getting started.

tusharmath · 2023-11-05T06:27:07Z

@alankritdabral would you be taking the approach that I have described above using IAI?

alankritdabral · 2023-11-05T06:31:50Z

Yes i am taking the same approah @tusharmath

alankritdabral · 2023-11-05T12:28:24Z

@tusharmath I am thinking to change each bench file from Criterion->iai-callgrind code so i can have accurate data to compare :
Just wanted to know if you agree with the approach.
benches/json_like_bench.rs
Original code:

use criterion::{black_box, criterion_group, criterion_main, Criterion};
use serde_json::json;

fn benchmark_batched_body(c: &mut Criterion) {
  c.bench_function("test_batched_body", |b| {
    b.iter(|| {
      let input = json!({
          "data": [
              {"user": {"id": "1"}},
              {"user": {"id": "2"}},
              {"user": {"id": "3"}},
              {"user": [
                  {"id": "4"},
                  {"id": "5"}
                  ]
              },
          ]
      });

      black_box(
        serde_json::to_value(tailcall::json::gather_path_matches(
          &input,
          &["data".into(), "user".into(), "id".into()],
          vec![],
        ))
        .unwrap(),
      );
    })
  });
}

criterion_group!(benches, benchmark_batched_body);
criterion_main!(benches);

Edited Code:

use iai_callgrind::{black_box, library_benchmark, library_benchmark_group, main};

fn gather_path_matches(input: &serde_json::Value, path: &[&str]) -> Option<serde_json::Value> {
  let mut current = input;
  for key in path {
      current = match current.get(key) {
          Some(value) => value,
          None => return None, // Handle the case where the key doesn't exist
      };
  }
  Some(current.clone())
}


#[library_benchmark]

fn benchmark_batched_body() {
  let input = json!({
      "data": [
          {"user": {"id": "1"}},
          {"user": {"id": "2"}},
          {"user": {"id": "3"}},
          {"user": [
              {"id": "4"},
              {"id": "5"}
              ]
          },
      ]
  });

  black_box(gather_path_matches(&input, &["data", "user", "id"]));
}


library_benchmark_group!(
    name= batched_body; 
    benchmarks= benchmark_batched_body);

main!(library_benchmark_groups = batched_body);

Results:

~/Desktop/git/tailcall$ cargo bench --bench json_like_bench
Compiling tailcall v0.1.0 (/home/aloo/Desktop/git/tailcall)
Finished bench [optimized] target(s) in 5.80s
Running benches/json_like_bench.rs (target/release/deps/json_like_bench-6b149ebecce48594)
json_like_bench::batched_body::benchmark_batched_body

Instructions:               10798 (-89.16146%)
 L1 Hits:                    15605 (-87.84515%)
 L2 Hits:                        1 (-99.35065%)
 RAM Hits:                     137 (-70.02188%)
 Total read+write:           15743 (-87.79575%)
 Estimated Cycles:           20405 (-85.94213%)

tusharmath · 2023-11-06T06:29:08Z

@alankritdabral As discussed on discord, we need both kind of benchmarks. They both signal orthogonal things. And it's possible that reduction in instructions still results in a performance regression.

tusharmath · 2023-12-28T06:08:51Z

fixed in #762

epompeii · 2024-02-08T15:00:53Z

@tusharmath I'm a little late to the party here, but down the road, if you want a more robust continuous benchmarking solution, you might consider checking out Bencher: https://github.com/bencherdev/bencher

tusharmath · 2024-02-08T15:11:48Z

@tusharmath I'm a little late to the party here, but down the road, if you want a more robust continuous benchmarking solution, you might consider checking out Bencher: https://github.com/bencherdev/bencher

This is a nice tool. We are a small team so we take help from contributors to things done. Happy to add a bounty for integrating the tool into our ci.

epompeii · 2024-02-08T17:55:05Z

Thank you for the kind words! I would be more than happy to help with the integration. 😃

tusharmath added the maintenance label Oct 7, 2023

tusharmath changed the title ~~CI for Benchmarks~~ Integrate CI for Running Benchmarks Oct 7, 2023

algora-pbc bot added the 💎 Bounty label Oct 7, 2023

tusharmath assigned alankritdabral Nov 5, 2023

This was referenced Nov 11, 2023

Ci integrate alankritdabral/tailcall#8

Closed

Integrate CI for running benchmarks #624

Closed

alankritdabral mentioned this issue Nov 19, 2023

Integrate CI for Running Benchmarks #436 #663

Closed

5 tasks

This was referenced Nov 23, 2023

Update bot.yml alankritdabral/tailcall#31

Closed

Changes alankritdabral/tailcall#33

Closed

tusharmath closed this as completed Dec 28, 2023

github-actions bot added the type: chore Routine tasks like conversions, reorganization, and maintenance work. label Dec 31, 2023

This was referenced Mar 9, 2024

Integrate bencher for benchmarks #1300

Open

ci: Track main branch benchmarks with Bencher #1441

Merged

epompeii mentioned this issue Apr 14, 2024

ci: Track benchmarks with Bencher #1725

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate CI for Running Benchmarks #436

Integrate CI for Running Benchmarks #436

tusharmath commented Oct 7, 2023 •

edited

Loading

tusharmath commented Oct 7, 2023

algora-pbc bot commented Oct 7, 2023 •

edited

Loading

5war00p commented Oct 7, 2023 •

edited by algora-pbc bot

Loading

algora-pbc bot commented Oct 8, 2023

algora-pbc bot commented Oct 8, 2023

algora-pbc bot commented Oct 9, 2023

neo773 commented Oct 9, 2023

meskill commented Oct 10, 2023

tusharmath commented Oct 10, 2023

meskill commented Oct 10, 2023

tusharmath commented Oct 11, 2023

shambu2k commented Oct 22, 2023 •

edited by algora-pbc bot

Loading

tusharmath commented Oct 22, 2023

algora-pbc bot commented Oct 23, 2023

alankritdabral commented Nov 4, 2023

algora-pbc bot commented Nov 4, 2023

tusharmath commented Nov 5, 2023

alankritdabral commented Nov 5, 2023

alankritdabral commented Nov 5, 2023 •

edited

Loading

tusharmath commented Nov 6, 2023

tusharmath commented Dec 28, 2023

epompeii commented Feb 8, 2024

tusharmath commented Feb 8, 2024

epompeii commented Feb 8, 2024

Integrate CI for Running Benchmarks #436

Integrate CI for Running Benchmarks #436

Comments

tusharmath commented Oct 7, 2023 • edited Loading

Overview

Requirements

Rationale

tusharmath commented Oct 7, 2023

algora-pbc bot commented Oct 7, 2023 • edited Loading

5war00p commented Oct 7, 2023 • edited by algora-pbc bot Loading

algora-pbc bot commented Oct 8, 2023

algora-pbc bot commented Oct 8, 2023

algora-pbc bot commented Oct 9, 2023

neo773 commented Oct 9, 2023

meskill commented Oct 10, 2023

tusharmath commented Oct 10, 2023

meskill commented Oct 10, 2023

tusharmath commented Oct 11, 2023

shambu2k commented Oct 22, 2023 • edited by algora-pbc bot Loading

tusharmath commented Oct 22, 2023

algora-pbc bot commented Oct 23, 2023

alankritdabral commented Nov 4, 2023

algora-pbc bot commented Nov 4, 2023

tusharmath commented Nov 5, 2023

alankritdabral commented Nov 5, 2023

alankritdabral commented Nov 5, 2023 • edited Loading

tusharmath commented Nov 6, 2023

tusharmath commented Dec 28, 2023

epompeii commented Feb 8, 2024

tusharmath commented Feb 8, 2024

epompeii commented Feb 8, 2024

tusharmath commented Oct 7, 2023 •

edited

Loading

algora-pbc bot commented Oct 7, 2023 •

edited

Loading

5war00p commented Oct 7, 2023 •

edited by algora-pbc bot

Loading

shambu2k commented Oct 22, 2023 •

edited by algora-pbc bot

Loading

alankritdabral commented Nov 5, 2023 •

edited

Loading