Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

推理阶段,预测文件中label显示不全问题 #3775

Open
1 task done
jy-101361-1810897 opened this issue May 16, 2024 · 1 comment
Open
1 task done

推理阶段,预测文件中label显示不全问题 #3775

jy-101361-1810897 opened this issue May 16, 2024 · 1 comment
Labels
bug Something isn't working pending This problem is yet to be addressed.

Comments

@jy-101361-1810897
Copy link

Reminder

  • I have read the README and searched the existing issues.

Reproduction

CUDA_VISIBLE_DEVICES=0 python src/train.py
--stage sft
--model_name_or_path $base_model_dir
--do_train
--dataset juben_train
--dataset_dir $dataset_dir
--template baichuan2
--cutoff_len 512
--finetuning_type lora
--lora_target W_pack
--output_dir $output_dir
--overwrite_output_dir
--overwrite_cache
--per_device_train_batch_size 16
--gradient_accumulation_steps 1
--lr_scheduler_type cosine
--bf16
--double_quant
--quantization_bit 4
--quantization_type nf4
--warmup_ratio 0.05
--logging_steps 10
--save_steps 50
--learning_rate 1e-4
--num_train_epochs 10.0
--plot_loss

CUDA_VISIBLE_DEVICES=0 python src/train.py
--stage sft
--do_predict
--model_name_or_path $base_model_dir
--adapter_name_or_path $output_dir
--dataset_dir $dataset_dir
--dataset juben_test
--template baichuan2
--finetuning_type lora
--output_dir $predict_dir
--per_device_eval_batch_size 1
--max_samples 1000
--predict_with_generate
--max_length 1024
--generation_max_length 1024
--fp16

Expected behavior

模型微调后用于推理,发现生成的generated_predictions_0.jsonl文件中的label这个键的值不全问题

System Info

No response

Others

No response

@jy-101361-1810897 jy-101361-1810897 changed the title 推理阶段,生成文本长度限制问题 推理阶段,预测文件中label显示不全问题 May 16, 2024
@maiqingqiang
Copy link

我也遇到了,我临时用datasets的output代替label了

@hiyouga hiyouga added bug Something isn't working pending This problem is yet to be addressed. labels May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working pending This problem is yet to be addressed.
Projects
None yet
Development

No branches or pull requests

3 participants