Support DDP #81

awaelchli · 2023-04-02T13:04:55Z

Fixes #80

Note, while we can enable DDP here, training/finetuning the model with DDP won't work on most systems as the model/optimizer/gradients simply can't fit in memory. This PR is just for correctness and because of the question that popped up in #80.

lit_llama/model.py

finetune.py

lantiga · 2023-04-02T14:40:51Z

I think the original request was referring to multi-gpu, be it ddp, fsdp or deepspeed.

I’m in favor of merging this after the rope fix, but I’d probably avoid showing ddp since it can’t possibly work, and rather make sure fsdp works instead.

WDYT?

) Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

support ddp

eafb749

awaelchli requested review from carmocca and lantiga as code owners April 2, 2023 13:04

awaelchli commented Apr 2, 2023

View reviewed changes

lit_llama/model.py Outdated Show resolved Hide resolved

revert

9466126

awaelchli commented Apr 2, 2023

View reviewed changes

finetune.py Show resolved Hide resolved

awaelchli added 5 commits April 2, 2023 10:48

build cache in forward

28e7182

build cache in forward

b1ecafb

keep comment

9075d7a

forgot to set device

602d810

fix tests

ffddf51

lantiga approved these changes Apr 2, 2023

View reviewed changes

lantiga merged commit 2aef01d into main Apr 2, 2023

lantiga deleted the ddp-support branch April 2, 2023 19:19

KzZheng mentioned this pull request Apr 2, 2023

Missing rope_cache for model with lora #83

Closed

timothylimyl referenced this pull request in timothylimyl/lit-llama-qa May 21, 2023

Support DDP (#81)

28c0689

gkroiz added a commit to gkroiz/lit-llama that referenced this pull request May 22, 2023

Updated TPU docs to download nightly torch dependencies (Lightning-AI#81

3326132

) Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support DDP #81

Support DDP #81

awaelchli commented Apr 2, 2023

lantiga commented Apr 2, 2023

Support DDP #81

Support DDP #81

Conversation

awaelchli commented Apr 2, 2023

lantiga commented Apr 2, 2023