GPU not detected for Realtime API

### Version

`0.39.1`

### Description

Tried to run a GPU _Realtime API_ on cortex 0.39 but the GPU is not being detected for some reason. I tried to run it on a local machine and it worked fine.

### Configuration

```yaml
- name: scratch
  kind: RealtimeAPI
  pod:
    port: 8000
    containers:
    - name: api
      image: nricklin/ubuntu-gpu-test
      compute:
        gpu: 1
```

### Expected behavior

```bash
[I] ➜ docker run --gpus all --rm nricklin/ubuntu-gpu-test

Number of CUDA Devices = 1
===========================
Device 0 has name Quadro P2000 with compute capability 6.1 canMapHostMemory=1
                           global memory = 3.9454
HostToDevice PCI Express BW=11.4723 GB/s
DeviceToHost PCI Express BW=11.4917 GB/s
```

### Actual behavior

```bash
[I] ➜ cortex logs --random-pod scratch

waiting for pod to initialize ...
test.cu(29) : cudaSafeCall() Runtime API error : no CUDA-capable device is detected.
Starting admin server on :15000
Starting proxy server on :8888
TCP probe to user-provided container port failed: dial tcp 127.0.0.1:8000: connect: connection refused
TCP probe to user-provided container port failed: dial tcp 127.0.0.1:8000: connect: connection refused
```



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU not detected for Realtime API #2381

Version

Description

Configuration

Expected behavior

Actual behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GPU not detected for Realtime API #2381

Description

Version

Description

Configuration

Expected behavior

Actual behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions