ci: switch integration tests to use eval proxy#1985
Merged
Conversation
Change LLM_BASE_URL from llm-proxy.app.all-hands.dev to llm-proxy.eval.all-hands.dev and use LLM_API_KEY_EVAL secret. This enables testing models that are only available on the eval proxy. Co-authored-by: openhands <openhands@all-hands.dev>
2faedde to
f490f2d
Compare
xingyaoww
approved these changes
Feb 10, 2026
Contributor
🧪 Integration Tests ResultsOverall Success Rate: 100.0% 📊 Summary
📋 Detailed Resultslitellm_proxy_jade_spark_2862
Skipped Tests:
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Switch integration tests from using the app proxy to the eval proxy.
Changes
LLM_BASE_URLfromllm-proxy.app.all-hands.devtollm-proxy.eval.all-hands.devLLM_API_KEYsecret toEVAL_LLM_API_KEYMotivation
Some models in
resolve_model_config.pyare only available on the eval proxy (e.g.,jade-spark-2862). Using the app proxy causes integration tests to fail with "Invalid model name" errors for these models.Required Action
The
EVAL_LLM_API_KEYsecret needs to be configured in the repository settings with a key that has access to the eval proxy.@neubig can click here to continue refining the PR
Agent Server images for this PR
• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server
Variants & Base Images
eclipse-temurin:17-jdknikolaik/python-nodejs:python3.12-nodejs22golang:1.21-bookwormPull (multi-arch manifest)
# Each variant is a multi-arch manifest supporting both amd64 and arm64 docker pull ghcr.io/openhands/agent-server:db248ea-pythonRun
All tags pushed for this build
About Multi-Architecture Support
db248ea-python) is a multi-arch manifest supporting both amd64 and arm64db248ea-python-amd64) are also available if needed