Skip to content

Conversation

@stubbi
Copy link
Collaborator

@stubbi stubbi commented Dec 10, 2025

The nvidia/cuda:11.8.0-base image doesn't include cuBLAS (libcublas.so.11), which KataGo requires. Instead of using the larger 'runtime' image (~1.4GB), we now copy only the required libraries from the builder:

  • libcublas*.so (cuBLAS for matrix operations)
  • libcublasLt*.so (cuBLAS LT)
  • libcudnn*.so (cuDNN for neural network inference)

This reduces image size by ~300-500MB compared to using the runtime image.

Root cause: KataGo crashed with 'libcublas.so.11: cannot open shared object
file: No such file or directory'

The nvidia/cuda:11.8.0-base image doesn't include cuBLAS (libcublas.so.11),
which KataGo requires. Instead of using the larger 'runtime' image (~1.4GB),
we now copy only the required libraries from the builder:
- libcublas*.so (cuBLAS for matrix operations)
- libcublasLt*.so (cuBLAS LT)
- libcudnn*.so (cuDNN for neural network inference)

This reduces image size by ~300-500MB compared to using the runtime image.

Root cause: KataGo crashed with 'libcublas.so.11: cannot open shared object
file: No such file or directory'
@stubbi stubbi force-pushed the fix/missing-cublas branch from 2cad0f2 to e5d6487 Compare December 10, 2025 21:07
@stubbi stubbi merged commit ea465c6 into main Dec 10, 2025
4 checks passed
@stubbi stubbi deleted the fix/missing-cublas branch December 10, 2025 21:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants