Skip to content

large language model training-3-stages+deployment

Notifications You must be signed in to change notification settings

pzc163/llm3s-conatiner

 
 

Repository files navigation

Complete docs

Install envs

first install pytorch2.0 https://pytorch.org/get-started/locally/ then install others pip install -r requirements.txt

deploy necessary settings

run train SFT model

bash run.sh

run train Reward model

bash run-reward.sh

run train RLHF model

bash run-rlhf.sh

Prepare data

SFT data

refer sft-data-construction

reward data and RLHF data

refer rlhf-ppo

About

large language model training-3-stages+deployment

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages

  • Python 98.9%
  • Shell 1.1%