训练模型时这个问题怎么办？？疑似N卡内存不够。 CUDA out of memory. Tried to allocate 122.00 MiB (GPU 0; 4.00 GiB total capacity; 3.15 GiB already allocated; 0 bytes free; 3.45 GiB reserved in total by PyTorch #669

pzhyyd · 2022-07-21T15:33:49Z

Summary[问题简述（一句话）]
训练模型时这个问题怎么办？？疑似N卡内存不够。
CUDA out of memory. Tried to allocate 122.00 MiB (GPU 0; 4.00 GiB total capacity; 3.15 GiB already allocated; 0 bytes free; 3.45 GiB reserved in total by PyTorch

Env & To Reproduce[复现与环境]
python3.9、NVIDIA GeForce GTX 1050Ti（4GB）

Screenshots[截图（如有）]

pzhyyd · 2022-07-21T15:36:37Z

运行起来N卡的状态。

pzhyyd · 2022-07-21T15:37:09Z

VERT2022 · 2022-07-21T23:50:58Z

訓練模型顯存不足
訓練合成器時：將 synthesizer/hparams.py中的batch_size參數調小

//調整前
tts_schedule = [(2, 1e-3, 20_000, 12), # Progressive training schedule
(2, 5e-4, 40_000, 12), # (r, lr, step, batch_size)
(2, 2e-4, 80_000, 12), #
(2, 1e-4, 160_000, 12), # r = reduction factor (# of mel frames
(2, 3e-5, 320_000, 12), # synthesized for each decoder iteration)
(2, 1e-5, 640_000, 12)], # lr = learning rate
//調整後
tts_schedule = [(2, 1e-3, 20_000, 8), # Progressive training schedule
(2, 5e-4, 40_000, 8), # (r, lr, step, batch_size)
(2, 2e-4, 80_000, 8), #
(2, 1e-4, 160_000, 8), # r = reduction factor (# of mel frames
(2, 3e-5, 320_000, 8), # synthesized for each decoder iteration)
(2, 1e-5, 640_000, 8)], # lr = learning rate

pzhyyd · 2022-07-22T05:40:46Z

是调整这个位置么

还是这里呢

pzhyyd · 2022-07-22T05:42:19Z

gai'wei改为这样可以么

babysor · 2022-07-22T13:56:37Z

差不多

pzhyyd · 2022-07-22T15:54:49Z

訓練模型顯存不足訓練合成器時：將 synthesizer/hparams.py中的batch_size參數調小

//調整前 tts_schedule = [(2, 1e-3, 20_000, 12), # Progressive training schedule (2, 5e-4, 40_000, 12), # (r, lr, step, batch_size) (2, 2e-4, 80_000, 12), # (2, 1e-4, 160_000, 12), # r = reduction factor (# of mel frames (2, 3e-5, 320_000, 12), # synthesized for each decoder iteration) (2, 1e-5, 640_000, 12)], # lr = learning rate //調整後 tts_schedule = [(2, 1e-3, 20_000, 8), # Progressive training schedule (2, 5e-4, 40_000, 8), # (r, lr, step, batch_size) (2, 2e-4, 80_000, 8), # (2, 1e-4, 160_000, 8), # r = reduction factor (# of mel frames (2, 3e-5, 320_000, 8), # synthesized for each decoder iteration) (2, 1e-5, 640_000, 8)], # lr = learning rate

感谢指导，已成功开始训练了

z1139147357 · 2022-09-21T00:30:53Z

我已经把batch_size调整到4了，还是出现同样的问题，唉。。。

by517840374 · 2022-11-17T14:37:31Z

你是不是没装cuda?我没装之前也是这样，或者更换一下pytorch 版本

pzhyyd closed this as completed Jul 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

训练模型时这个问题怎么办？？疑似N卡内存不够。 CUDA out of memory. Tried to allocate 122.00 MiB (GPU 0; 4.00 GiB total capacity; 3.15 GiB already allocated; 0 bytes free; 3.45 GiB reserved in total by PyTorch #669

训练模型时这个问题怎么办？？疑似N卡内存不够。 CUDA out of memory. Tried to allocate 122.00 MiB (GPU 0; 4.00 GiB total capacity; 3.15 GiB already allocated; 0 bytes free; 3.45 GiB reserved in total by PyTorch #669

pzhyyd commented Jul 21, 2022

pzhyyd commented Jul 21, 2022

pzhyyd commented Jul 21, 2022

VERT2022 commented Jul 21, 2022

pzhyyd commented Jul 22, 2022

pzhyyd commented Jul 22, 2022

babysor commented Jul 22, 2022

pzhyyd commented Jul 22, 2022

z1139147357 commented Sep 21, 2022

by517840374 commented Nov 17, 2022

训练模型时这个问题怎么办？？疑似N卡内存不够。 CUDA out of memory. Tried to allocate 122.00 MiB (GPU 0; 4.00 GiB total capacity; 3.15 GiB already allocated; 0 bytes free; 3.45 GiB reserved in total by PyTorch #669

训练模型时这个问题怎么办？？疑似N卡内存不够。 CUDA out of memory. Tried to allocate 122.00 MiB (GPU 0; 4.00 GiB total capacity; 3.15 GiB already allocated; 0 bytes free; 3.45 GiB reserved in total by PyTorch #669

Comments

pzhyyd commented Jul 21, 2022

pzhyyd commented Jul 21, 2022

pzhyyd commented Jul 21, 2022

VERT2022 commented Jul 21, 2022

pzhyyd commented Jul 22, 2022

pzhyyd commented Jul 22, 2022

babysor commented Jul 22, 2022

pzhyyd commented Jul 22, 2022

z1139147357 commented Sep 21, 2022

by517840374 commented Nov 17, 2022