Skip to content

VCS licenses are acquired per-cluster rather than per-gpu #267

@kralicky

Description

@kralicky

1. Issue or feature description

I have created four clusters and installed GPU operator into all 4. Each cluster contains one node which has been given 1 of 8 available VGPUs from the host, split between two GPUs providing 4 VGPUs each (everything is done on one machine with VMs). nvidia-gridd leases four licenses from the NLS, but it should only lease two.

nvidia-smi output on the host:

❯ nvidia-smi vgpu
Fri Oct  8 18:41:01 2021       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.63                 Driver Version: 470.63                    |
|---------------------------------+------------------------------+------------+
| GPU  Name                       | Bus-Id                       | GPU-Util   |
|      vGPU ID     Name           | VM ID     VM Name            | vGPU-Util  |
|=================================+==============================+============|
|   0  Tesla T4                   | 00000000:85:00.0             |   0%       |
|      3251649543  GRID T4-4C     | 0607...  instance-00000273   |      0%    |
+---------------------------------+------------------------------+------------+
|   1  Tesla T4                   | 00000000:C1:00.0             |   0%       |
|      3251635394  GRID T4-4C     | aa32...  instance-00000264   |      0%    |
|      3251643331  GRID T4-4C     | d5b3...  instance-0000026e   |      0%    |
|      3251652536  GRID T4-4C     | 6107...  instance-00000275   |      0%    |
+---------------------------------+------------------------------+------------+

image

Metadata

Metadata

Assignees

No one assigned

    Labels

    lifecycle/staleDenotes an issue or PR has remained open with no activity and has become stale.

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions