Qwen2-VL-7B的微调out of memory #1860

KirbytroNic0528 · 2024-08-30T02:32:38Z

CUDA_VISIBLE_DEVICES=3,4 NPROC_PER_NODE=2 swift sft
--model_type qwen2-vl-7b-instruct
--model_id_or_path qwen/Qwen2-VL-7B-Instruct
--sft_type lora
--dataset dataset.json
torch版本2.4.0
cuda12.2
设备8*A40
使用的是这个命令，其中一张显卡的显存会无限增加直到out of memory
/

Jintao-Huang · 2024-08-31T08:53:04Z

参考这里：https://swift.readthedocs.io/zh-cn/latest/Multi-Modal/qwen2-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.html#ocr

Jintao-Huang · 2024-08-31T08:53:51Z

You can save memory by reducing SIZE_FACTOR=8 and MAX_PIXELS=602112.

Jade0321 · 2024-09-03T07:12:45Z

用8张A100跑，batch_size=1，也会out of memory，没找到SIZE_FACTOR=8 and MAX_PIXELS=602112.

Jintao-Huang · 2024-09-03T07:24:45Z

#1859

This was referenced Aug 31, 2024

CUDA error: too many resources requested for launch (V100, qwen2-vl) #1867

Open

🎉Support for finetuning of Qwen2-VL-Chat series models #1857

Open

tastelikefeet closed this as completed Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen2-VL-7B的微调out of memory #1860

Qwen2-VL-7B的微调out of memory #1860

KirbytroNic0528 commented Aug 30, 2024

Jintao-Huang commented Aug 31, 2024

Jintao-Huang commented Aug 31, 2024

Jade0321 commented Sep 3, 2024

Jintao-Huang commented Sep 3, 2024

Qwen2-VL-7B的微调out of memory #1860

Qwen2-VL-7B的微调out of memory #1860

Comments

KirbytroNic0528 commented Aug 30, 2024

Jintao-Huang commented Aug 31, 2024

Jintao-Huang commented Aug 31, 2024

Jade0321 commented Sep 3, 2024

Jintao-Huang commented Sep 3, 2024