Moreh LLaMA2 Alpaca fine-tuning example

Prerequisite

Hugging Face format 으로 변환된 모델 & tokenizer 가 필요합니다.

Install Dependeicies

pip install -r requirements.txt

How To Run

7B 4node 2 Pipeline Parallel stage example

#!/bin/bash

python train.py \
  --model-name-or-path [YOUT_PATH_TO_MODEL] \
  --batch-size 1024 \
  --lr 1e-4 \
  --use-pipeline \
  --split-layers 15 \
  --num-micro-batches 32 \
  --bfloat16 \
  --block-size 2048 \
  --distribute-parameter

13B 8node 4 Pipeline Parallel stage example

#!/bin/bash

python train.py \
  --model-name-or-path [YOUT_PATH_TO_MODEL] \
  --batch-size 1024 \
  --lr 1e-4 \
  --use-pipeline \
  --split-layers 9 19 29 \
  --num-micro-batches 64 \
  --bfloat16 \
  --block-size 4096 \
  --distribute-parameter \
  --enable-activation-recomputation

33B 12node 6 Pipeline Parallel stage example

#!/bin/bash

python train.py \
  --model-name-or-path [YOUT_PATH_TO_MODEL] \
  --batch-size 1024 \
  --lr 1e-4 \
  --use-pipeline \
  --split-layers 9 19 29 39 49 \
  --num-micro-batches 32 \
  --bfloat16 \
  --block-size 2048 \
  --distribute-parameter \
  --enable-activation-recomputation

70B 24node 12 Pipeline Parallel stage example

#!/bin/bash

python train.py \
  --model-name-or-path [YOUT_PATH_TO_MODEL] \
  --batch-size 1024 \
  --lr 1e-4 \
  --use-pipeline \
  --split-layers 6 13 20 27 34 41 48 55 61 67 73 \
  --num-micro-batches 32 \
  --bfloat16 \
  --block-size 2048 \
  --distribute-parameter \
  --enable-activation-recomputation

Arguments

Parameter	Type	Default	Description
--model-name-or-path	str		model name or path
--batch-size	int	8	number of examples for each training iteration
--lr	float	0.00001	learning rate
--bfloat16	bool	false	whether to use bfloat16
--distribute-parameter	bool	false	whether to distribute fp32 master parameters
--num-micro-batches	int	1	split batch to N steps (micro batches)
--log-interval	int	10	logging interval
--save-model-dir	str	./	path to save model at the end of training

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
model		model
script_examples		script_examples
tokenizer		tokenizer
README.md		README.md
config.ini		config.ini
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Moreh LLaMA2 Alpaca fine-tuning example

Prerequisite

Install Dependeicies

How To Run

7B 4node 2 Pipeline Parallel stage example

13B 8node 4 Pipeline Parallel stage example

33B 12node 6 Pipeline Parallel stage example

70B 24node 12 Pipeline Parallel stage example

Arguments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

moreh-dev/moreh-llama2

Folders and files

Latest commit

History

Repository files navigation

Moreh LLaMA2 Alpaca fine-tuning example

Prerequisite

Install Dependeicies

How To Run

7B 4node 2 Pipeline Parallel stage example

13B 8node 4 Pipeline Parallel stage example

33B 12node 6 Pipeline Parallel stage example

70B 24node 12 Pipeline Parallel stage example

Arguments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages