feat(llm): add QMD_FORCE_CUDA env var to disable Vulkan offloading by JasonOA888 · Pull Request #338 · tobi/qmd

JasonOA888 · 2026-03-09T07:42:33Z

Problem

On Windows VMs with para-virtualized GPUs (e.g., ExHyperV RTX 4090), QMD may use Vulkan offloading instead of pure CUDA mode even when CUDA is working correctly:

$ qmd status
GPU: vulkan (offloading: yes)

Solution

Add QMD_FORCE_CUDA environment variable to force CUDA and disable Vulkan:

export QMD_FORCE_CUDA=1
qmd query "test"

This sets gpu: "cuda" in getLlama() options, bypassing the auto-detection that might choose Vulkan.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(llm): add QMD_FORCE_CUDA env var to disable Vulkan offloading#338

feat(llm): add QMD_FORCE_CUDA env var to disable Vulkan offloading#338
JasonOA888 wants to merge 1 commit into
tobi:mainfrom
JasonOA888:feat/force-cuda-env

JasonOA888 commented Mar 9, 2026

Uh oh!

tobi commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JasonOA888 commented Mar 9, 2026

Problem

Solution

Related

Uh oh!

tobi commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants