fix(tests): add CI failure tolerance and fix 4 embedding tests (Section 5/5) #623

yehudit1987 · 2025-11-10T14:47:44Z

Disable Gemma embedding model in CI tests since it's a gated model
requiring HF_TOKEN. Tests now use Qwen3-Embedding-0.6B exclusively.

This approach was discussed and approved by maintainers who decided
to focus on Qwen3 (non-gated) for CI tests.
See: https://github.com/vllm-project/semantic-router/issues/573#issuecomment-3607352121

Changes in candle-binding/semantic-router_test.go:

Set GemmaEmbeddingModelPath to empty string (disable Gemma)
Update dimension expectations from 768/1024 to 1024 (Qwen3)
Skip InitGemmaOnly test with clear explanation
Remove conditional skip logic that was masking test failures
Update comments to clarify Qwen3-only and Matryoshka usage

Changes in tools/make/models.mk:

Add Qwen3-Embedding-0.6B to minimal download target for CI
Remove Gemma from lora download target
Improve download tracking with .downloaded marker files

Qwen3-Embedding-0.6B is fully open (no gating) and supports
Matryoshka dimension truncation (768/512/256/128) from its
native 1024 dimensions.

Resolves #573 (Section 5: Embedding Model Tests)

netlify · 2025-11-10T14:47:51Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`b9af3bc`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/693539b1ecdd7b00088e867b
😎 Deploy Preview	https://deploy-preview-623--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-11-10T14:47:57Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `Root Directory`

Owners: @rootfs, @Xunzhuo
Files changed:

.github/workflows/integration-test-k8s.yml

📁 `candle-binding`

Owners: @rootfs
Files changed:

candle-binding/semantic-router_test.go
candle-binding/src/classifiers/unified.rs
candle-binding/src/ffi/embedding.rs

📁 `e2e`

Owners: @Xunzhuo
Files changed:

e2e/profiles/aibrix/profile.go

📁 `tools`

Owners: @yuluo-yx, @rootfs, @Xunzhuo
Files changed:

tools/make/models.mk
tools/make/rust.mk

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

Signed-off-by: Yehudit Kerido <ykerido@ykerido-thinkpadp1gen7.raanaii.csb>

rootfs · 2025-12-07T15:52:40Z

.github/workflows/integration-test-k8s.yml

        run: |
          make build-e2e

+      - name: Free up disk space


this is a good start, i think the /mnt directory can be used if possible (e.g. move models and symlink there)

rootfs · 2025-12-07T15:53:39Z

candle-binding/src/classifiers/unified.rs

            model_type
        };

+        // Validate model availability and fall back if necessary


maynot be the best option to fallback to different model. Would you mind disabling Gemma test and model download in CI?

rootfs · 2025-12-07T15:54:58Z

merging it to unblock other PRs

github-actions bot assigned rootfs and Xunzhuo Nov 10, 2025

yehudit1987 force-pushed the fix_skipped_tests_5 branch 2 times, most recently from cb26fa5 to 5cc12a3 Compare November 10, 2025 14:56

github-actions bot deleted a comment Nov 15, 2025

yehudit1987 force-pushed the fix_skipped_tests_5 branch 2 times, most recently from a925516 to ba72fb3 Compare December 4, 2025 11:55

yehudit1987 marked this pull request as ready for review December 4, 2025 12:51

yehudit1987 requested review from Xunzhuo and rootfs as code owners December 4, 2025 12:51

yehudit1987 marked this pull request as draft December 4, 2025 14:09

yehudit1987 force-pushed the fix_skipped_tests_5 branch from ba72fb3 to fdf4b91 Compare December 4, 2025 15:01

yehudit1987 marked this pull request as ready for review December 4, 2025 17:37

yehudit1987 force-pushed the fix_skipped_tests_5 branch 2 times, most recently from ae7829d to 1f5c6f9 Compare December 7, 2025 06:25

github-actions bot assigned yuluo-yx Dec 7, 2025

yehudit1987 force-pushed the fix_skipped_tests_5 branch 2 times, most recently from beb4c7c to e634bc6 Compare December 7, 2025 07:53

yehudit1987 marked this pull request as draft December 7, 2025 08:04

yehudit1987 force-pushed the fix_skipped_tests_5 branch from e634bc6 to f19a1ab Compare December 7, 2025 08:08

fix skipped tests

b9af3bc

Signed-off-by: Yehudit Kerido <ykerido@ykerido-thinkpadp1gen7.raanaii.csb>

yehudit1987 force-pushed the fix_skipped_tests_5 branch from f19a1ab to b9af3bc Compare December 7, 2025 08:24

yehudit1987 marked this pull request as ready for review December 7, 2025 09:04

rootfs reviewed Dec 7, 2025

View reviewed changes

rootfs merged commit 912fe2a into vllm-project:main Dec 7, 2025
35 checks passed

yehudit1987 mentioned this pull request Dec 7, 2025

Workflow: CI Disk Optimization #782

Closed

yehudit1987 deleted the fix_skipped_tests_5 branch December 8, 2025 07:49

liavweiss mentioned this pull request Dec 9, 2025

Move model storage to the /mnt directory on both the host and the Kin… #792

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(tests): add CI failure tolerance and fix 4 embedding tests (Section 5/5) #623

fix(tests): add CI failure tolerance and fix 4 embedding tests (Section 5/5) #623

Uh oh!

yehudit1987 commented Nov 10, 2025 •

edited

Loading

Uh oh!

netlify bot commented Nov 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 10, 2025 •

edited

Loading

Uh oh!

rootfs Dec 7, 2025

Uh oh!

rootfs Dec 7, 2025

Uh oh!

rootfs commented Dec 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix(tests): add CI failure tolerance and fix 4 embedding tests (Section 5/5) #623

fix(tests): add CI failure tolerance and fix 4 embedding tests (Section 5/5) #623

Uh oh!

Conversation

yehudit1987 commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👥 vLLM Semantic Team Notification

📁 Root Directory

📁 candle-binding

📁 e2e

📁 tools

🎉 Thanks for your contributions!

Uh oh!

rootfs Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

rootfs Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

rootfs commented Dec 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yehudit1987 commented Nov 10, 2025 •

edited

Loading

netlify bot commented Nov 10, 2025 •

edited

Loading

github-actions bot commented Nov 10, 2025 •

edited

Loading

📁 `Root Directory`

📁 `candle-binding`

📁 `e2e`

📁 `tools`