Problem with GPU allocation after updating to CTranslate2 4.0.0 #1628

carolinaxxxxx · 2024-02-22T04:25:07Z

When the device_index = 1 parameter (GPU 1), the GPU 0 is charged with low value data (in my case about 263 MB) and GPU itself shows signs of work, although it should not. This is clearly the result of the CTranslate2 4.0.0. After returning to ctranslate2 3.24.0 the problem disappears.

The above example is for the whisper model, but I also tried with llm models and the same result.

minhthuc2502 · 2024-03-01T11:44:40Z

This is not a bug, just from CUDA 12, it seems that it takes more memory for initializing GPU. The logic is the same as the version before. I have a small fix here to prevent initializing unused GPU. Thank you for reporting it.

jordimas mentioned this issue Feb 22, 2024

Problem with GPU allocation after updating to CTranslate2 4.0.0 Softcatala/whisper-ctranslate2#83

Closed

minhthuc2502 mentioned this issue Mar 1, 2024

directly set device index the first time to avoid initializing unused gpu #1633

Merged

minhthuc2502 closed this as completed Mar 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem with GPU allocation after updating to CTranslate2 4.0.0 #1628

Problem with GPU allocation after updating to CTranslate2 4.0.0 #1628

carolinaxxxxx commented Feb 22, 2024 •

edited

Loading

minhthuc2502 commented Mar 1, 2024

Problem with GPU allocation after updating to CTranslate2 4.0.0 #1628

Problem with GPU allocation after updating to CTranslate2 4.0.0 #1628

Comments

carolinaxxxxx commented Feb 22, 2024 • edited Loading

minhthuc2502 commented Mar 1, 2024

carolinaxxxxx commented Feb 22, 2024 •

edited

Loading