Skip to content

devops-ws/gpu-guide

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 

Repository files navigation

gpu-guide

安装 hd 工具:curl https://gitee.com/linuxsuren/tools/raw/master/install-zh.sh|bash

安装 Toolkit

hd i nvidia-container-toolkit

安装 NVIDIA 容器运行时

hd i nvidia-docker2

安装 GPU Operator

helm repo add nvidia https://nvidia.github.io/gpu-operator
helm repo update
helm install gpu-operator nvidia/gpu-operator -n gpu-operator-resources --set toolkit.version=1.6.0-centos7

测试

cat <<EOF | kubectl apply -f -
apiVersion: v1
kind: Pod
metadata:
   name: dcgmproftester
spec:
   restartPolicy: OnFailure
   containers:
   - name: dcgmproftester
     image: nvcr.io/nvidia/k8s/cuda-sample:vectoradd-cuda10.2
     resources:
      limits:
         nvidia.com/gpu: 1
EOF

FAQ

  • version GLIBC_2.27' not found`
    • 部分驱动有问题,可以通过指定驱动版本解决。详情参考这里.

参考