请教一下OOM问题 #52

beeper00 · 2021-10-15T03:12:35Z

用自己造的数据跑模型时，由于部分句子较长，容易出现oom问题，所以我在代码中加了句子长度不超过15的限制（11g显存）才能正常训练。默认的batchsize是2048，我改这个数字发现实际使用的显存没有变化，似乎不起作用。想请教一下，如果不想限制句子长度，应该改哪部分参数或代码来解决oom？

zhangyimi · 2021-10-15T09:26:51Z

请问使用的是什么模型？建议模型选用ernie-lstm或者transformer，batch_size先设置为300试试。

beeper00 · 2021-10-15T09:32:36Z

好的，参数我都用的默认的，只有数据是自己造的，然后就oom了，按你说的试一下，谢谢哈

…

------------------ 原始邮件 ------------------ 发件人: "baidu/DDParser" ***@***.***>; 发送时间: 2021年10月15日(星期五) 下午5:27 ***@***.***>; ***@***.******@***.***>; 主题: Re: [baidu/DDParser] 请教一下OOM问题 (#52) 请问使用的是什么模型？建议模型选用ernie-lstm或者transformer，batch_size先设置为300试试。 — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

zhangyimi closed this as completed Oct 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

请教一下OOM问题 #52

请教一下OOM问题 #52

beeper00 commented Oct 15, 2021

zhangyimi commented Oct 15, 2021

beeper00 commented Oct 15, 2021 via email

请教一下OOM问题 #52

请教一下OOM问题 #52

Comments

beeper00 commented Oct 15, 2021

zhangyimi commented Oct 15, 2021

beeper00 commented Oct 15, 2021 via email