CI: Continuous Benchmarking #166

jserv · 2023-07-19T09:28:29Z

github-action-benchmark provides a GitHub Action for continuous benchmarking, and we can utilize it to track performance regressions.

Expected output:

Integrate github-action-benchmark in CI pipeline.
Deploy self-hosted runners for stable benchmarking environments. Arm64 and x86-64 hosts included.
Ensure that all benchmark charts from above workflows are gathered in GitHub pages.

Reference:

How to make GitHub Actions 22x faster with bare-metal Arm

ChinYikMing · 2023-08-29T17:16:18Z

Support pull requests. Instead of updating GitHub pages, add a comment to the pull request to explain benchmark results.

Quote from github-action-benchmark's future work.

If this CI run is enabled for PRs, the benchmark charts will update with each pull request. Since we must first confirm a performance improvement before updating the benchmark charts, this might not be what we desire. In other words, we need to discuss about how the PR is doing before updating the benchmark chart. Or, is it acceptable to update the benchmark chart with each PR?

And one more thing, may I know if @qwe661234 is working on this issue? If not, I could help to write the CI script if the above concern is OK.

jserv · 2023-08-29T17:23:41Z

And one more thing, may I know if @qwe661234 is working on this issue? If not, I could help to write the CI script if the above concern is OK.

Please move forward. Before submitting pull request(s), do write down your plans and communicate here.

jserv · 2023-08-29T17:27:59Z

If this CI run is enabled for PRs, the benchmark charts will update with each pull request. Since we must first confirm a performance improvement before updating the benchmark charts, this might not be what we desire. In other words, we need to discuss about how the PR is doing before updating the benchmark chart. Or, is it acceptable to update the benchmark chart with each PR?

If the results generated by the CI are deterministic, we can certainly treat them as references for incremental development. However, due to our current lack of confidence, it is premature to define a clear strategy at this stage.

ChinYikMing · 2023-09-02T15:59:56Z

My plan is as follows:

Run the benchmark workflow and update the GitHub pages whenever "pushing on master branch" and "opening of a pull request and synchronizing the pull request(if any) on master branch" are performed when editing the "emulate.c" and "rv32_template.c" files because these 2 files have the potential to directly impact the benchmark. Under this constraint, we can prohibit excessive benchmark workflow with unrelated files, such as those in docs/ . Do you have any suggestions for other files that might impact the benchmark?
The benchmark comparison between the most recent commit and the previous commit will be noted, allowing the maintainer and committer to debate the benchmark comparison. An example of a benchmark comparison can be seen in the benchmark action README: link
Main benchmark will be “Dhrystone” and “CoreMark” in tests/ for now because I can easily modify the script to generate the JSON format file which required by the benchmark action.
The x-axis and y-axis unit of benchmark graph will be commit SHA and unit of benchmark respectively. Let’s take “Dhrystone” be an example: the y-axis will be average MIPS over 10 runs. Example benchmark graph taken from the benchmark action README: link

ChinYikMing · 2023-09-03T05:06:50Z

The benchmark comparison between the most recent commit and the previous commit will be noted, allowing the maintainer and committer to debate the benchmark comparison. An example of a benchmark comparison can be seen in the benchmark action README: link

We can set the threshold to trigger the generation of benchmark comparison, for example 0.5x worse than before, default is 2x worse than before.

jserv · 2023-09-06T04:40:35Z

Run the benchmark workflow and update the GitHub pages whenever "pushing on master branch" and "opening of a pull request and synchronizing the pull request(if any) on master branch" are performed when editing the "emulate.c" and "rv32_template.c" files because these 2 files have the potential to directly impact the benchmark. Under this constraint, we can prohibit excessive benchmark workflow with unrelated files, such as those in docs/ . Do you have any suggestions for other files that might impact the benchmark?

We shall track decode.c as well.

Main benchmark will be “Dhrystone” and “CoreMark” in tests/ for now because I can easily modify the script to generate the JSON format file which required by the benchmark action.

Agree.

The 'pull_request_target' event is utilized for the benchmark action due to its ability to update data in the 'gh-pages' branch, which is crucial for visualization on GitHub Pages. The 'pull_request' event lacks the necessary 'GITHUB_TOKEN' for this task. Furthermore, 'workflow_dispatch' event is added to enable the initial setup and running of the benchmark. This permits the storage of base benchmark data for future comparisons. To prevent redundant executions, a filter has been implemented to exclude the merging push event. Close #166

jserv assigned qwe661234 Jul 19, 2023

jserv assigned ChinYikMing Aug 29, 2023

jserv assigned ChinYikMing and unassigned qwe661234 and ChinYikMing Aug 29, 2023

ChinYikMing mentioned this issue Sep 9, 2023

CI: Implementing benchmark regression tests #213

Merged

jserv closed this as completed in #213 Sep 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: Continuous Benchmarking #166

CI: Continuous Benchmarking #166

jserv commented Jul 19, 2023 •

edited

Loading

ChinYikMing commented Aug 29, 2023

jserv commented Aug 29, 2023

jserv commented Aug 29, 2023

ChinYikMing commented Sep 2, 2023 •

edited

Loading

ChinYikMing commented Sep 3, 2023 •

edited

Loading

jserv commented Sep 6, 2023

CI: Continuous Benchmarking #166

CI: Continuous Benchmarking #166

Comments

jserv commented Jul 19, 2023 • edited Loading

ChinYikMing commented Aug 29, 2023

jserv commented Aug 29, 2023

jserv commented Aug 29, 2023

ChinYikMing commented Sep 2, 2023 • edited Loading

ChinYikMing commented Sep 3, 2023 • edited Loading

jserv commented Sep 6, 2023

jserv commented Jul 19, 2023 •

edited

Loading

ChinYikMing commented Sep 2, 2023 •

edited

Loading

ChinYikMing commented Sep 3, 2023 •

edited

Loading