feat: Introduce Horizontal Pod Autoscaler, offload blocking evaluatio… by IsmailMehdi · Pull Request #269 · GoogleCloudPlatform/evalbench

IsmailMehdi · 2026-03-17T18:17:42Z

This pull request introduces infrastructure and code enhancements to support horizontal scaling, improve concurrency, and increase the robustness of session management within the evalbench service.

Key Changes
Scalability & Orchestration

Horizontal Pod Autoscaler (HPA): Added a new hpa.yaml configuration to automatically scale the evaluation server based on CPU utilization (target 50%).

Resource Management: Defined explicit CPU and memory requests and limits for the evalbench-eval container to ensure predictable scaling behavior.

Makefile Updates: Updated the deploy target to include the HPA configuration during deployment.

Performance & Concurrency

Async Offloading: Modified eval_service.py to offload blocking operations, such as evaluator.evaluate and _process_results, to a thread pool executor. This prevents the gRPC event loop from blocking during intensive evaluation tasks.

Container Optimization: Updated the Dockerfile to streamline supervisord configuration and set a fixed BUILD_TIME environment variable.

Session Management Robustness

Safe Lookups: Updated sessionmgr.py to use .get() for session lookups and added existence checks before deletion to prevent KeyError exceptions.

Improved Reaper Logic: Refactored the session reaper to be more efficient, using list comprehensions for identifying expired sessions and increasing the sleep interval to 10 seconds to reduce overhead.

…n tasks to a thread pool, and enhance session manager robustness.

…ve the build time environment variable.

IsmailMehdi · 2026-03-17T18:26:28Z

/gcbrun

IsmailMehdi · 2026-03-17T18:43:04Z

/gcbrun

…include additional metadata for sdist and wheel URLs.

IsmailMehdi · 2026-03-17T19:03:42Z

/gcbrun

…ndler.

IsmailMehdi · 2026-03-17T19:45:47Z

/gcbrun

…for UI and metrics.

…n an `UncloseableStream`.

…llations and removing NVM.

…hin the Docker container.

…e CSV reporter outputs to a shared volume when running in server mode.

…ve an extra blank line in `csv.py`.

…line length.

IsmailMehdi · 2026-03-18T01:30:41Z

/gcbrun

feat: Introduce Horizontal Pod Autoscaler, offload blocking evaluatio…

6024fb3

…n tasks to a thread pool, and enhance session manager robustness.

IsmailMehdi requested a review from mahyareb as a code owner March 17, 2026 18:17

IsmailMehdi and others added 2 commits March 17, 2026 11:17

Merge branch 'main' into scaling

53614d4

chore: Update Python package sources to an internal registry and remo…

900fb1e

…ve the build time environment variable.

IsmailMehdi requested a review from totoleon March 17, 2026 18:43

chore: Update uv.lock to use PyPI as the package source registry and …

480191b

…include additional metadata for sdist and wheel URLs.

Ismail Mehdi added 2 commits March 17, 2026 19:23

fix: Configure absl.logging to output to stdout and initialize its ha…

560d0ee

…ndler.

style: add blank lines for improved readability.

d3bd203

IsmailMehdi and others added 12 commits March 17, 2026 13:33

Merge branch 'main' into scaling

1b9c5bf

feat: Configure GCS FUSE for session management and expose new ports …

b02489e

…for UI and metrics.

Merge branch 'main' into scaling

98a919f

fix: Prevent logging handler from closing sys.stdout by wrapping it i…

d7c453e

…n an `UncloseableStream`.

feat: Install Node.js via NodeSource PPA, consolidating package insta…

a9f2741

…llations and removing NVM.

feat: Configure a dedicated home directory and user for evalbench wit…

89238f5

…hin the Docker container.

Merge branch 'main' into scaling

57dae61

feat: Enhance results directory discovery in the viewer and ensure th…

a4761e1

…e CSV reporter outputs to a shared volume when running in server mode.

Merge branch 'main' into scaling

b206d03

chore: Exclude evalbench/evalproto from pycodestyle checks and remo…

409191f

…ve an extra blank line in `csv.py`.

config: Adjust pycodestyle exclusions and enforce the configured max …

06eda69

…line length.

style: adjust whitespace and formatting for improved readability.

fe0d76f

totoleon approved these changes Mar 18, 2026

View reviewed changes

IsmailMehdi merged commit a639282 into main Mar 18, 2026
4 checks passed

release-please bot mentioned this pull request Mar 18, 2026

chore(main): release 1.1.0 #277

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Introduce Horizontal Pod Autoscaler, offload blocking evaluatio…#269

feat: Introduce Horizontal Pod Autoscaler, offload blocking evaluatio…#269
IsmailMehdi merged 18 commits intomainfrom
scaling

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 17, 2026

Uh oh!

IsmailMehdi commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants