Pathways Language Model (PaLM) based on PyTorch

A Colossal-AI implementation of Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance. We reproduced the model architect and applied multiple optimization stategies, e.g. data parallelism, tensor parallelism & ZeRO, to scale the training to mulple-GPUs with teh help of Colosssal-AI.

You are very welcome to contribute in any way to help us enhance the usability of this project.

Preparation

Install Colosssal-AI, which is a Pytorch-based large-scale model training system with various efficient parallelization techniques.

pip install colossalai

Use HuggingFace datasets to download Wikitext-2 dataset. The placeholder /PATH/TO/DATA is optional and is ./wiki_dataset by default.

python ./tools/download_wiki.py -o </PATH/TO/DATA>

Download tokenizer files by calling the following command. The place holder /PATH/TO/TOKENIZER/ is optional and is ./token by default.

bash ./tools/download_token.py </PATH/TO/TOKENIZER/>

Usage

Configure your settings in CONFIG_FILE.py, for example

SEQ_LENGTH = 2048
BATCH_SIZE = 8
NUM_EPOCHS = 10

parallel = dict(
    tensor=dict(mode='1d', size=2),
)

model = "palm_small"

We have provided some in ./configs 2. Run

DATA=/PATH/TO/DATA/ TOKENIZER=/PATH/TO/TOKENIZER/ torchrun --nproc_per_node=NUM_GPUS train.py --from_torch --config CONFIG_FILE.py

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
configs		configs
data		data
model		model
tests		tests
tools		tools
.gitignore		.gitignore
README.md		README.md
data_loader.py		data_loader.py
requirement.txt		requirement.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

data

data

model

model

tests

tests

tools

tools

.gitignore

.gitignore

README.md

README.md

data_loader.py

data_loader.py

requirement.txt

requirement.txt

train.py

train.py

utils.py

utils.py

Repository files navigation

Pathways Language Model (PaLM) based on PyTorch

Preparation

Usage

About

Releases

Packages

Languages

YuliangLiu0306/PaLM-colossalai

Folders and files

Latest commit

History

Repository files navigation

Pathways Language Model (PaLM) based on PyTorch

Preparation

Usage

About

Resources

Stars

Watchers

Forks

Languages