Skip to content

使用InstructionWild数据中的seed_prompts_en.jsonl数据集对llama-7b模型进行预训练报错 #4387

Discussion options

You must be logged in to vote

执行命令是:torchrun --standalone --nproc_per_node=1 train_sft.py --pretrain "/home/llm-deploy/ColossalAI/llama-7b" --model 'llama' --strategy colossalai_zero2 --log_interval 10 --save_path /home/llm-deploy/ColossalAI/output/llama-7B --dataset /home/llm-deploy/ColossalAI/InstructionWild/data/seed_prompts_en.jsonl --batch_size 1 --lr 2e-5 --max_epochs 1

报错信息如下:


The training data is contained in these two files.

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by 13416157913
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants