Skip to content

Fix benchmarks.yml: use ec2-gha, remove FOMO benchmark#399

Merged
ryan-williams merged 5 commits into
mainfrom
rw/ec2
Sep 24, 2025
Merged

Fix benchmarks.yml: use ec2-gha, remove FOMO benchmark#399
ryan-williams merged 5 commits into
mainfrom
rw/ec2

Conversation

@ryan-williams

@ryan-williams ryan-williams commented Sep 24, 2025

Copy link
Copy Markdown
Collaborator
  • In #308 I neglected to update benchmarks.yml to use ec2-gha, which resulted in an invalid workflow file.
  • However, benchmarks.yml has been broken on main since #384, which added a FOMO model benchmark that uses more memory (and GPU memory)

This PR fixes the former by updating benchmarks.yml to use ec2-gha, and works around the latter by restoring the benchmarks configs to the pre-#384 state.

benchmarks#172 is a passing run from f313865

ryan-williams and others added 4 commits September 23, 2025 16:39
The addition of FOMO model testing in PR #384 caused benchmark failures
due to excessive memory usage. FOMO with Perceiver layers uses ~170GB RAM
for CPU benchmarks and exceeds 22GB GPU memory.

This commit removes FOMO from the benchmark suite, restoring it to the
last known working state. A separate commit will follow to add a
lightweight FOMO configuration suitable for CI benchmarks.

Fixes benchmark OOM failures that started with #384.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@ryan-williams ryan-williams marked this pull request as ready for review September 24, 2025 20:00

@jder jder left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! Looking into why these were broken was next on my list so hope it wasn't too bad :)

Comment thread .github/workflows/benchmarks.yml Outdated
Co-authored-by: Jesse Rusak <jesse@openathena.ai>
@ryan-williams ryan-williams merged commit 1f7cac1 into main Sep 24, 2025
4 checks passed
@ryan-williams ryan-williams deleted the rw/ec2 branch September 24, 2025 20:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants