Skip to content

Voxtral Realtime: enable streaming mode in CUDA CI#17844

Merged
mergennachin merged 1 commit intomainfrom
enable_streaming
Mar 4, 2026
Merged

Voxtral Realtime: enable streaming mode in CUDA CI#17844
mergennachin merged 1 commit intomainfrom
enable_streaming

Conversation

@mergennachin
Copy link
Copy Markdown
Contributor

Remove the vr-offline override so the CUDA CI runs Voxtral Realtime
in streaming mode (the default). The streaming encoder path exercises
the full pipeline including ring buffer KV cache and incremental mel
processing.

Remove the vr-offline override so the CUDA CI runs Voxtral Realtime
in streaming mode (the default). The streaming encoder path exercises
the full pipeline including ring buffer KV cache and incremental mel
processing.
Copilot AI review requested due to automatic review settings March 4, 2026 14:24
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot Bot commented Mar 4, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17844

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 9614d5c with merge base 6db7f4c (image):

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 4, 2026
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Mar 4, 2026

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

@mergennachin mergennachin requested a review from Gasoonjia March 4, 2026 14:25
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the CUDA CI workflow to run Voxtral Realtime in its default streaming mode by removing the explicit vr-offline override. This helps exercise the streaming encoder pipeline in CUDA CI (e.g., incremental mel + ring-buffer KV cache).

Changes:

  • Remove the workflow logic that forces Voxtral Realtime into vr-offline mode during CUDA artifact export.
  • Remove the workflow logic that forces Voxtral Realtime into vr-offline mode during CUDA e2e testing.
  • Update related workflow comments to no longer mention “offline mode”.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@mergennachin mergennachin merged commit 6ad5f75 into main Mar 4, 2026
221 of 225 checks passed
@mergennachin mergennachin deleted the enable_streaming branch March 4, 2026 18:56
jpiat pushed a commit to jpiat/executorch that referenced this pull request Mar 17, 2026
Remove the vr-offline override so the CUDA CI runs Voxtral Realtime
in streaming mode (the default). The streaming encoder path exercises
the full pipeline including ring buffer KV cache and incremental mel
processing.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants