This repository has been archived by the owner on Oct 12, 2023. It is now read-only.
when per_device_eval_batch_size
> 1 and launch by deepspeed, RuntimeError: Tensors must be contiguous
#385
Labels
solved
This problem has been already solved.
RuntimeError: Tensors must be contiguous
occurs whenper_device_eval_batch_size
> 1cmd:
deepspeed --include localhost:0,1,2,3,4,5,6,7 --master_port $MASTER_PORT src/train_bash.py \ --stage sft \ --model_name_or_path THUDM/chatglm2-6b \ --checkpoint_dir ${CHECKPOINT} \ --do_predict \ --dataset dev_data\ --overwrite_cache \ --finetuning_type lora \ --output_dir ${CHECKPOINT}/predict \ --overwrite_cache \ --per_device_eval_batch_size 4 \ --max_source_length 1024 \ --max_target_length 128 \ --max_samples 1000 \ --predict_with_generate \ --plot_loss \ --fp16
The text was updated successfully, but these errors were encountered: