llama3-8b-base 微调后重复输出 #3813

xxll88 · 2024-05-19T23:42:39Z

Reminder

I have read the README and searched the existing issues.

Reproduction

model

model_name_or_path: /home/ubuntu/Meta-Llama-3-8B

method

stage: sft
do_train: true
finetuning_type: freeze

template: llama3

ddp

ddp_timeout: 180000000
deepspeed: examples/deepspeed/ds_z2_config.json

dataset

dataset: alpaca_gpt4_zh ,alpaca_zh,firefly
cutoff_len: 1024

overwrite_cache: true
preprocessing_num_workers: 16

output

output_dir: ../saves/llama3-8b/sft
save_total_limit: 2
logging_steps: 20
save_steps: 500
plot_loss: true
overwrite_output_dir: false

train

per_device_train_batch_size: 16
gradient_accumulation_steps: 1
learning_rate: 0.00005
num_train_epochs: 2.0
lr_scheduler_type: cosine
warmup_steps: 0.1
bf16: true

eval

val_size: 0.001
per_device_eval_batch_size: 1
evaluation_strategy: steps
eval_steps: 500

#!/bin/bash

NPROC_PER_NODE=2
NNODES=1
RANK=0
MASTER_ADDR=127.0.0.1
MASTER_PORT=29500

CUDA_VISIBLE_DEVICES=0,1 torchrun
--nproc_per_node $NPROC_PER_NODE
--nnodes $NNODES
--node_rank $RANK
--master_addr $MASTER_ADDR
--master_port $MASTER_PORT
src/train.py llama3_sft_freeze.yaml

Expected behavior

llamafactory-cli chat --model_name ~/saves/llama3-8b/sft --template llama3

System Info

No response

Others

No response

hiyouga · 2024-05-20T10:01:46Z

base 模型不要使用 llama3 template
template: default

hiyouga added the solved This problem has been already solved. label May 20, 2024

hiyouga closed this as completed May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama3-8b-base 微调后重复输出 #3813

llama3-8b-base 微调后重复输出 #3813

xxll88 commented May 19, 2024 •

edited

hiyouga commented May 20, 2024

llama3-8b-base 微调后重复输出 #3813

llama3-8b-base 微调后重复输出 #3813

Comments

xxll88 commented May 19, 2024 • edited

Reminder

Reproduction

model

method

ddp

dataset

output

train

eval

Expected behavior

System Info

Others

hiyouga commented May 20, 2024

xxll88 commented May 19, 2024 •

edited