torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 432.00 MiB (GPU 2; 23.65 GiB total capacity; 20.88 GiB already allocated; 259.56 MiB free; #49

wallon-ai · 2023-03-16T11:46:56Z

Describe the bug
A clear and concise description of what the bug is.

To Reproduce
Steps to reproduce the behavior:

Go to '...'
Click on '....'
Scroll down to '....'
See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

OS: [e.g. iOS]
Browser [e.g. chrome, safari]
Version [e.g. 22]

Smartphone (please complete the following information):

Device: [e.g. iPhone6]
OS: [e.g. iOS8.1]
Browser [e.g. stock browser, safari]
Version [e.g. 22]

Additional context
Add any other context about the problem here.

satpalsr · 2023-03-16T13:51:43Z

You ran out of GPU memory. Describe more on your setup like what you are using and what command you ran to resolve.

riatzukiza · 2023-03-16T20:15:52Z

It'd be really cool if the minimum requirements of the model (size on disk for data set, vram requirements) on the readme, that would save a lot of people some time.

wallon-ai · 2023-03-17T02:13:57Z

You ran out of GPU memory. Describe more on your setup like what you are using and what command you ran to resolve.

batch_size=4

csris · 2023-03-18T05:03:12Z

It'd be really cool if the minimum requirements of the model (size on disk for data set, vram requirements) on the readme, that would save a lot of people some time.

That's a great idea. I'll put up a PR soon to document this.

musicfish1973 · 2023-03-18T08:44:31Z

(OpenChatKit) root@aca2869c8358:~/OpenChatKit-main# python inference/bot.py
Loading /root/OpenChatKit-main/inference/../huggingface_models/GPT-NeoXT-Chat-Base-20B to cuda:0...
Traceback (most recent call last):
File "/root/OpenChatKit-main/inference/bot.py", line 185, in
main()
File "/root/OpenChatKit-main/inference/bot.py", line 181, in main
).cmdloop()
File "/root/anaconda3/envs/OpenChatKit/lib/python3.10/cmd.py", line 105, in cmdloop
self.preloop()
File "/root/OpenChatKit-main/inference/bot.py", line 64, in preloop
self._model = ChatModel(self._model_name_or_path, self._gpu_id)
File "/root/OpenChatKit-main/inference/bot.py", line 24, in init
self._model.to(device)
File "/root/anaconda3/envs/OpenChatKit/lib/python3.10/site-packages/torch/nn/modules/module.py", line 989, in to
return self._apply(convert)
File "/root/anaconda3/envs/OpenChatKit/lib/python3.10/site-packages/torch/nn/modules/module.py", line 641, in _apply
module._apply(fn)
File "/root/anaconda3/envs/OpenChatKit/lib/python3.10/site-packages/torch/nn/modules/module.py", line 641, in _apply
module._apply(fn)
File "/root/anaconda3/envs/OpenChatKit/lib/python3.10/site-packages/torch/nn/modules/module.py", line 641, in _apply
module._apply(fn)
[Previous line repeated 2 more times]
File "/root/anaconda3/envs/OpenChatKit/lib/python3.10/site-packages/torch/nn/modules/module.py", line 664, in _apply
param_applied = fn(param)
File "/root/anaconda3/envs/OpenChatKit/lib/python3.10/site-packages/torch/nn/modules/module.py", line 987, in convert
return t.to(device, dtype if t.is_floating_point() or t.is_complex() else None, non_blocking)
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 288.00 MiB (GPU 0; 14.56 GiB total capacity; 13.86 GiB already allocated; 90.44 MiB free; 13.88 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

bohell · 2023-03-20T02:11:35Z

some problem, any idea how much memory it needs? or any solution to reduce the memory use? Thanks.

amaliako assigned amaliako and LorrinWWW and unassigned amaliako Mar 17, 2023

orangetin mentioned this issue Mar 27, 2023

Update inference to support multi-gpu and offloading #70

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 432.00 MiB (GPU 2; 23.65 GiB total capacity; 20.88 GiB already allocated; 259.56 MiB free; #49

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 432.00 MiB (GPU 2; 23.65 GiB total capacity; 20.88 GiB already allocated; 259.56 MiB free; #49

wallon-ai commented Mar 16, 2023

satpalsr commented Mar 16, 2023

riatzukiza commented Mar 16, 2023

wallon-ai commented Mar 17, 2023

csris commented Mar 18, 2023

musicfish1973 commented Mar 18, 2023

bohell commented Mar 20, 2023

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 432.00 MiB (GPU 2; 23.65 GiB total capacity; 20.88 GiB already allocated; 259.56 MiB free; #49

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 432.00 MiB (GPU 2; 23.65 GiB total capacity; 20.88 GiB already allocated; 259.56 MiB free; #49

Comments

wallon-ai commented Mar 16, 2023

satpalsr commented Mar 16, 2023

riatzukiza commented Mar 16, 2023

wallon-ai commented Mar 17, 2023

csris commented Mar 18, 2023

musicfish1973 commented Mar 18, 2023

bohell commented Mar 20, 2023