ci(bench): bump per-target timeout_minutes to 45/60/45 (fixes #152) by polaz · Pull Request #153 · structured-world/structured-zstd

polaz · 2026-05-17T17:17:45Z

Summary

Today's main pushes (#148, #151, #138 → v0.0.21) all failed to publish bench data to https://structured-world.github.io/structured-zstd/dev/bench/ because 7 of the 21 bench shards hit the per-target timeout_minutes cap. When any shard fails, benchmark-aggregate and benchmark-pages never run, so no points land on the dashboard.

Which shards timed out

The bench-matrix runs 7 strategy shards on 3 targets in parallel. Of the 21 shards:

14 passed in 8-25 minutes
7 hit their cap exactly:
- lazy on all 3 targets (11 levels: 5..15)
- fast on all 3 targets (8 levels: -7..-1, 1)
- btultra2 on i686-gnu (3 levels but slow 32-bit target)

Each level takes ~2-3 minutes through the bench binary; 11 levels exceed the 25-30 min caps.

Fix

Bump per-target timeout_minutes:

x86_64-gnu: 25 → 45
i686-gnu: 30 → 60 (slower 32-bit target)
x86_64-musl: 25 → 45

This was user's explicit choice over splitting heavy shards (which would parallelize better but increase the CI surface). The bumped caps leave ~50% headroom over observed worst cases:

Observed worst: i686-gnu/btultra2 at 30m16s, i686-gnu/fast/lazy at ~30m
New i686 cap: 60m — fits with margin

Test plan

CI green on this PR (lint, tests, all 21 bench shards complete)
After merge, benchmark-pages job publishes a new data point on the dashboard

Closes #152

Summary by CodeRabbit

Chores
- Updated CI benchmark timeout configurations to allow extended test execution times on multiple target platforms.

… shards finish The bench-matrix runs seven strategy shards per target. The heaviest shards exceed the previous per-target caps on every main push: * lazy shard runs 11 levels (5..15) sequentially through one bench binary invocation. On any target this takes 25-30+ minutes, hitting the cap before the benchmark frame even starts on the last level. * fast shard runs 8 levels (-7..-1, 1). On i686-gnu (32-bit, ~30% slower) this consistently exceeds the previous 30m cap. * btultra2 shard runs 3 expensive levels (20..22). On i686-gnu the matcher passes alone push it past 30m. When a single shard hits its job timeout the whole CI run is marked failed, so benchmark-aggregate and benchmark-pages never run and the dashboard at https://structured-world.github.io/structured-zstd/dev/bench/ gets no new data point for that commit. This affected every main push today (#148, #151, #138 → v0.0.21 release). Bump per-target timeout_minutes: x86_64-gnu 25 -> 45, i686-gnu 30 -> 60, x86_64-musl 25 -> 45. The 32-bit target gets a higher absolute cap because its bench loop is consistently ~30% slower. Closes #152

coderabbitai · 2026-05-17T17:17:56Z

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: a7e0c0bc-1a9b-4915-8104-53f3a7242751

📥 Commits

Reviewing files that changed from the base of the PR and between 32475d3 and db5b0a4.

📒 Files selected for processing (1)

.github/workflows/ci.yml

📝 Walkthrough

Walkthrough

The CI workflow configuration increases timeout limits for three benchmark targets in the bench-matrix job. x86_64-gnu and x86_64-musl timeouts increase from 25 to 45 minutes, and i686-gnu increases from 30 to 60 minutes, allowing slower benchmark shards (lazy, fast, and btultra2 compression levels) to complete without premature termination.

Changes

Benchmark CI Timeout Configuration

Layer / File(s)	Summary
Benchmark target timeout adjustment `.github/workflows/ci.yml`	Timeout values updated for `x86_64-gnu` (25→45 min), `i686-gnu` (30→60 min), and `x86_64-musl` (25→45 min) to prevent lazy, fast, and btultra2 shards from timing out during benchmarks.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~5 minutes

Poem

A rabbit hops through CI flows with care,
When benchmarks dragged and timeout struck despair,
Now 60 minutes, plenty time to race—
The shards complete without a time-out chase! 🐰⏱️

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/#152-bench-timeouts

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull request overview

Adjusts the GitHub Actions benchmark matrix job time budgets so long-running benchmark shards (notably fast, lazy, and btultra2 on i686) complete successfully, allowing downstream aggregation and pages publishing to run reliably.

Changes:

Increased per-target timeout_minutes for benchmark shards to reduce shard timeouts and unblock benchmark-aggregate/benchmark-pages.

codecov · 2026-05-17T17:20:25Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

sw-release-bot

⚠️ Performance Alert ⚠️

Possible performance regression was detected for benchmark 'structured-zstd vs C FFI (x86_64-gnu)'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.30.

Benchmark suite	Current: `db5b0a4`	Previous: `1e0f695`	Ratio
`compress/level_22_btultra2/small-4k-log-lines/matrix/c_ffi`	`0.112` ms	`0.067` ms	`1.67`
`compress/level_22_btultra2/decodecorpus-z000033/matrix/c_ffi`	`227.505` ms	`171.961` ms	`1.32`

This comment was automatically generated by workflow using github-action-benchmark.

CC: @polaz

The 45/60/45 caps from PR #153 still aren't enough for the slowest shards (lazy, fast). Log inspection of cancelled jobs (job 76419168123, sha f3a6dad) shows linear progress with ~9s per criterion iteration and no stalls — the bench suite simply has more combinations (level × scenario × codec_side × stream_variant × {compress, decompress}) than fit in 45 min: ~11 levels × ~5 scenarios × 2 sides × 2 stream variants × 2 ops ≈ 440 iterations × 9s ≈ 66 min worst-case shard Uniform 120 min cap (GH-hosted limit is 360 min) gives ~50% headroom on the slowest shards and unblocks the publish chain so the dev/bench dashboard stays current.

Copilot AI review requested due to automatic review settings May 17, 2026 17:17

polaz merged commit 2a077c9 into main May 17, 2026
5 of 6 checks passed

polaz deleted the fix/#152-bench-timeouts branch May 17, 2026 17:17

Copilot started reviewing on behalf of polaz May 17, 2026 17:18 View session

Copilot AI reviewed May 17, 2026

View reviewed changes

sw-release-bot Bot reviewed May 17, 2026

View reviewed changes

coderabbitai Bot mentioned this pull request May 17, 2026

ci(bench): remove fail-on-alert — regression is informational, not a gate #158

Closed

This was referenced May 17, 2026

ci(bench): bump per-target timeout_minutes to 120 — current 45/60 caps still hit by lazy/fast shards #160

Closed

ci(bench): bump bench shard timeout_minutes to 120 — fixes stuck publish #161

Merged

coderabbitai Bot mentioned this pull request May 18, 2026

perf(fse): replace next_state linear search with donor-parity flat tables + tune CI bench budgets #165

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci(bench): bump per-target timeout_minutes to 45/60/45 (fixes #152)#153

ci(bench): bump per-target timeout_minutes to 45/60/45 (fixes #152)#153
polaz merged 1 commit into
mainfrom
fix/#152-bench-timeouts

polaz commented May 17, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

Uh oh!

coderabbitai Bot commented May 17, 2026 •

edited

Loading

Review failed

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

Copilot AI left a comment

Uh oh!

codecov Bot commented May 17, 2026

Uh oh!

sw-release-bot Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

polaz commented May 17, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Which shards timed out

Fix

Test plan

Summary by CodeRabbit

Uh oh!

Uh oh!

coderabbitai Bot commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

codecov Bot commented May 17, 2026

Codecov Report

Uh oh!

sw-release-bot Bot left a comment

Choose a reason for hiding this comment

⚠️ Performance Alert ⚠️

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

polaz commented May 17, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 17, 2026 •

edited

Loading