Skip to content

Add Windows CUDA CI coverage for Voxtral Realtime#17852

Merged
larryliu0820 merged 1 commit intomainfrom
codex/voxtral-realtime-windows-ci
Mar 4, 2026
Merged

Add Windows CUDA CI coverage for Voxtral Realtime#17852
larryliu0820 merged 1 commit intomainfrom
codex/voxtral-realtime-windows-ci

Conversation

@larryliu0820
Copy link
Contributor

@larryliu0820 larryliu0820 commented Mar 4, 2026

Summary

This PR adds mistralai/Voxtral-Mini-4B-Realtime-2602 coverage to Windows CUDA CI by following the existing cuda-windows.yml pattern (Linux export artifact + Windows runner E2E).

cuda-windows CI previously covered Voxtral 3B and Parakeet, but not Voxtral Realtime.
Voxtral Realtime also needed explicit cuda-windows export support in its exporter to generate Windows-targeted CUDA artifacts.

What changed

  • Updated Windows CUDA workflow matrix to include Voxtral Realtime with supported quantization:
    • quantized-int4-tile-packed
  • Kept Voxtral Realtime in offline mode for CI:
    • passes vr-offline in export job
  • Extended Windows E2E script support for Voxtral Realtime:
    • builds voxtral_realtime_runner
    • runs with --preprocessor_path, tokenizer, audio, and CUDA --data_path
    • validates expected runtime output string
  • Added cuda-windows backend support in export_voxtral_rt.py:
    • accepts --backend cuda-windows
    • lowers using CUDA partitioner with Windows compile spec (platform=windows)
    • normalizes model/export logic to CUDA internals where appropriate
  • Added export artifact validation:
    • asserts aoti_cuda_blob.ptd exists for Voxtral Realtime on cuda/cuda-windows

@larryliu0820 larryliu0820 requested a review from lucylq as a code owner March 4, 2026 19:01
@pytorch-bot
Copy link

pytorch-bot bot commented Mar 4, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17852

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 3 Cancelled Jobs, 3 Unrelated Failures

As of commit f45642a with merge base 5ddbab2 (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOBS - The following jobs were cancelled. Please retry:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 4, 2026
@github-actions
Copy link

github-actions bot commented Mar 4, 2026

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

@larryliu0820 larryliu0820 temporarily deployed to upload-benchmark-results March 4, 2026 20:15 — with GitHub Actions Inactive
@larryliu0820 larryliu0820 merged commit 2dd9034 into main Mar 4, 2026
362 of 372 checks passed
@larryliu0820 larryliu0820 deleted the codex/voxtral-realtime-windows-ci branch March 4, 2026 23:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants