We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
您好,请问在配置正常的情况下,跑一遍readme.md里的训练示例大概用多长时间? 希望有一个参照确认是否配置正确了
单卡 RTX3060 cuda 11.7
示例: python3 train_qlora.py --train_args_json chatGLM_6B_QLoRA.json --model_name_or_path THUDM/chatglm-6b --train_data_path data/train.jsonl --eval_data_path data/dev.jsonl --lora_rank 4 --lora_dropout 0.05 --compute_dtype fp32
The text was updated successfully, but these errors were encountered:
@hbj52152 3060的话29小时左右,3090在7~8小时左右
Sorry, something went wrong.
No branches or pull requests
您好,请问在配置正常的情况下,跑一遍readme.md里的训练示例大概用多长时间?
希望有一个参照确认是否配置正确了
单卡 RTX3060 cuda 11.7
示例:
python3 train_qlora.py
--train_args_json chatGLM_6B_QLoRA.json
--model_name_or_path THUDM/chatglm-6b
--train_data_path data/train.jsonl
--eval_data_path data/dev.jsonl
--lora_rank 4
--lora_dropout 0.05
--compute_dtype fp32
The text was updated successfully, but these errors were encountered: