Closed
Description
Version
0.39.1
Description
Tried to run a GPU Realtime API on cortex 0.39 but the GPU is not being detected for some reason. I tried to run it on a local machine and it worked fine.
Configuration
- name: scratch
kind: RealtimeAPI
pod:
port: 8000
containers:
- name: api
image: nricklin/ubuntu-gpu-test
compute:
gpu: 1
Expected behavior
[I] ➜ docker run --gpus all --rm nricklin/ubuntu-gpu-test
Number of CUDA Devices = 1
===========================
Device 0 has name Quadro P2000 with compute capability 6.1 canMapHostMemory=1
global memory = 3.9454
HostToDevice PCI Express BW=11.4723 GB/s
DeviceToHost PCI Express BW=11.4917 GB/s
Actual behavior
[I] ➜ cortex logs --random-pod scratch
waiting for pod to initialize ...
test.cu(29) : cudaSafeCall() Runtime API error : no CUDA-capable device is detected.
Starting admin server on :15000
Starting proxy server on :8888
TCP probe to user-provided container port failed: dial tcp 127.0.0.1:8000: connect: connection refused
TCP probe to user-provided container port failed: dial tcp 127.0.0.1:8000: connect: connection refused