ValueError: Target modules ['q_proj', 'v_proj'] not found in the base model. Please check the target modules and try again. #43

su-heyang · 2023-06-17T02:34:22Z

train_sft.py训练指令：
CUDA_VISIBLE_DEVICES=0 python src/train_sft.py
--model_name_or_path /data1/projects/baichuan-7B/
--do_train
--dataset alpaca_gpt4_zh
--finetuning_type lora
--output_dir output
--overwrite_cache
--per_device_train_batch_size 4
--gradient_accumulation_steps 4
--lr_scheduler_type cosine
--logging_steps 10
--save_steps 1000
--learning_rate 5e-5
--num_train_epochs 3.0
--plot_loss
--fp16

训练报错ValueError: Target modules ['q_proj', 'v_proj'] not found in the base model. Please check the target modules and try again.

有没有大佬知道怎么解决，谢谢！

intothephone · 2023-06-17T02:37:06Z

加参数：--lora_target W_pack

1ring2rta · 2023-07-11T01:45:45Z

百川的attention应该是把(Wq,Wk,Wv) concat成一个W_pack了

hiyouga added the solved This problem has been already solved label Jun 19, 2023

hiyouga closed this as completed Jun 19, 2023

godfly mentioned this issue Aug 17, 2023

大数据量全参数预训练报错、流式读数据报错 #549

Closed

liwenju0 mentioned this issue Sep 18, 2023

when running tokenizer on datasets，program crashed #954

Closed

Mr-Otaku-Lin mentioned this issue Jun 13, 2024

Qwen2-7B lora训练后推理出错 #4251

Closed

1 task

hiennguyennq mentioned this issue Oct 21, 2024

distributed training: using GPU 0 to perform barrier as devices used by this process are currently unknown. #5769

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: Target modules ['q_proj', 'v_proj'] not found in the base model. Please check the target modules and try again. #43

ValueError: Target modules ['q_proj', 'v_proj'] not found in the base model. Please check the target modules and try again. #43

su-heyang commented Jun 17, 2023

intothephone commented Jun 17, 2023

1ring2rta commented Jul 11, 2023

ValueError: Target modules ['q_proj', 'v_proj'] not found in the base model. Please check the target modules and try again. #43

ValueError: Target modules ['q_proj', 'v_proj'] not found in the base model. Please check the target modules and try again. #43

Comments

su-heyang commented Jun 17, 2023

intothephone commented Jun 17, 2023

1ring2rta commented Jul 11, 2023