From 99e6c3de2140c42bdb49d87a2f722e141e58e19e Mon Sep 17 00:00:00 2001 From: Ammar Ahmad Awan Date: Tue, 11 Apr 2023 10:35:04 -0700 Subject: [PATCH] add run command for 13b --- blogs/deepspeed-chat/README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/blogs/deepspeed-chat/README.md b/blogs/deepspeed-chat/README.md index 6ea52227d1c4..bfa7672d4030 100644 --- a/blogs/deepspeed-chat/README.md +++ b/blogs/deepspeed-chat/README.md @@ -89,6 +89,8 @@ pip install . git clone https://github.com/microsoft/DeepSpeedExamples.git cd DeepSpeedExamples/applications/DeepSpeed-Chat/ pip install -r requirements.txt + +python train.py --actor-model facebook/opt-13b --reward-model facebook/opt-350m --num-gpus 8 ``` In about half a day, your 13B model would be fully trained with its checkpoints ready. The following table demonstrates a breakdown of the training time for each of the three steps: