Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to override with two mdev profiles within one VM #24

Open
nyacat opened this issue Dec 19, 2022 · 1 comment
Open

Unable to override with two mdev profiles within one VM #24

nyacat opened this issue Dec 19, 2022 · 1 comment

Comments

@nyacat
Copy link

nyacat commented Dec 19, 2022

OS: Proxmox 7.3
Kernel: 5.15.74-1-pve
Host Driver: 525.60.12
Guest Driver: 525.60.13
Hardware: RTX 2060 12G * 2

Host Config:

[mdev.00000000-0000-0000-0000-000000000102]
num_displays = 1
display_width = 1920
display_height = 1080
max_pixels = 2073600
cuda_enabled = 1
frl_enabled = 0
framebuffer = 0x162000000

[mdev.00000001-0000-0000-0000-000000000102]
num_displays = 1
display_width = 1920
display_height = 1080
max_pixels = 2073600
cuda_enabled = 1
frl_enabled = 0
framebuffer = 0x162000000

VM Config

hostpci0: 0000:82:00.0,mdev=nvidia-259,pcie=1
hostpci1: 0000:01:00.0,mdev=nvidia-259,pcie=1

PVE vm Start

kvm: -device vfio-pci,sysfsdev=/sys/bus/mdev/devices/00000000-0000-0000-0000-000000000102,id=hostpci0,bus=ich9-pcie-port-1,addr=0x0: warning: vfio 00000000-0000-0000-0000-000000000102: Could not enable error recovery for the device
kvm: -device vfio-pci,sysfsdev=/sys/bus/mdev/devices/00000001-0000-0000-0000-000000000102,id=hostpci1,bus=ich9-pcie-port-2,addr=0x0: warning: vfio 00000001-0000-0000-0000-000000000102: Could not enable error recovery for the device
TASK OK

Host journalctl nvidia-vgpu-mgr.service
pastebin

Problem:
When two vGPUs are added to a virtual machine, the mdev override does not work properly, but if override with a profile it works fine.

@mbilker
Copy link
Owner

mbilker commented Dec 20, 2022

From the system log, I see that the second start call is not recognized as a start call so it might be using a different command code so it likely needs additional support in vgpu_unlock-rs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants