Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

萌新请教,为何报错,需要13EB内存? #9005

Closed
1 task done
kiddkyd2 opened this issue May 26, 2024 · 5 comments
Closed
1 task done

萌新请教,为何报错,需要13EB内存? #9005

kiddkyd2 opened this issue May 26, 2024 · 5 comments
Assignees

Comments

@kiddkyd2
Copy link

kiddkyd2 commented May 26, 2024

问题确认 Search before asking

  • 我已经搜索过问题,但是没有找到解答。I have searched the question and found no related answer.

请提出你的问题 Please ask your question

数据集是我自己拉框的40几张图片。已经导出coco格式的json。

按照网上教程,执行了这个命令: python tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_coco.yml --eval

报错:
Out of memory error on GPU 0. Cannot allocate 13.906031EB memory on GPU 0, 5.367256GB memory has been allocated and available memory is only 647.804688MB.

这个内存要求13EB也太高了吧?我显卡是GTX 1060,6G显存。支持CUDA 12.2,安装的CUDA Tookit 12.2.0。

有什么办法可以简单的跑一下图像分类吗?想用yolo的,但是yolov3,yolov8,都不行,要么说内存不够,要么说CUDA没有符合的内核。

@lyuwenyu
Copy link
Collaborator

你卡的显存多大,可以把data readerconfig里的batch size改小点

@kiddkyd2
Copy link
Author

你卡的显存多大,可以把data readerconfig里的batch size改小点

把worker_num: 1 和batch_size: 1后,还是提示要196.187500TB memory

@lyuwenyu
Copy link
Collaborator

lyuwenyu commented May 27, 2024

改啥了没? 除了上面的其他的是默认的配置嘛

@kiddkyd2
Copy link
Author

改啥了没? 除了上面的其他的是默认的配置嘛

萌新哪敢乱改啊。。。。就是改了指向训练集的yml配置路径而已。。。话说,你们什么配置,都没遇到这个问题吗?我跑ultralytics框架,用batch=1,workers=0都可以成功跑完训练。

@lyuwenyu
Copy link
Collaborator

没遇到过这个问题,刚测了下 可以跑的..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants