Finetuning with weights in bfloat16 #100

awaelchli · 2023-04-05T17:16:51Z

The memory consumption is now ~20GB compared to before at ~38GB.
The training iteration speed-up ~1.5x-2x.

lantiga

A few suggested changes

README.md

Co-authored-by: Luca Antiga <luca@lightning.ai>

lantiga · 2023-04-05T17:24:07Z

Does memory fit a 3090 even in the LoRA case?

awaelchli · 2023-04-05T17:25:19Z

It's tight but yeah it could work:

I can double-check on our 3090 machine.

lantiga · 2023-04-05T17:30:15Z

oh ahah, that's tight!

lantiga · 2023-04-05T17:30:40Z

better check IMO

awaelchli · 2023-04-05T18:10:50Z

Attempted to test it but got hold up by this problem: #101

awaelchli · 2023-04-05T18:50:06Z

The finetuning fits into the 3090:

I had to install pytorch nightly to get around the issue #101 though. Perhaps we should hold off merging this? Or we could say this requires pytorch nightly. Or we could investigate whether changing the implementation can avoid the error.

lantiga · 2023-04-05T20:47:38Z

Great, I would merge and say that 3090 requires nightly

then we’ll investigate complex again

lantiga · 2023-04-06T06:42:26Z

Amazing, let’s merge!

Co-authored-by: Luca Antiga <luca@lightning.ai>

bfloat

ffe5222

awaelchli requested review from carmocca and lantiga as code owners April 5, 2023 17:16

awaelchli added 2 commits April 5, 2023 13:18

update readme

f66beb6

typo

93eb81e

lantiga approved these changes Apr 5, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

apply to lora

7baf70f

awaelchli changed the title ~~Adapter-finetuning with weights in bfloat16~~ Finetuning with weights in bfloat16 Apr 5, 2023

Update README.md

8bb4c5c

Co-authored-by: Luca Antiga <luca@lightning.ai>

awaelchli mentioned this pull request Apr 5, 2023

Expected is_sm80 to be true, but got false #101

Closed

awaelchli added 3 commits April 5, 2023 17:35

update readme with pytorch nightly inffo

6fc8692

typo

180d1bf

add a note to the top of the file

7f0f2f7

awaelchli mentioned this pull request Apr 6, 2023

WIP: Make LLaMA torch.compile compatible #103

Closed

lantiga merged commit 8a13cbf into main Apr 6, 2023

lantiga deleted the adapter-bfloat branch April 6, 2023 06:42

This was referenced Apr 6, 2023

Typo? "7B require ~26 GB of GPU memory (A100 GPU)." #67

Closed

Support LoRA finetuning with quantization #54

Open

awaelchli mentioned this pull request Apr 10, 2023

Fix weight initialization in LoRA and Adapter finetuning #117

Merged

timothylimyl referenced this pull request in timothylimyl/lit-llama-qa May 21, 2023

Finetuning with weights in bfloat16 (#100)

b66d3e1

Co-authored-by: Luca Antiga <luca@lightning.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning with weights in bfloat16 #100

Finetuning with weights in bfloat16 #100

awaelchli commented Apr 5, 2023

lantiga left a comment

lantiga commented Apr 5, 2023

awaelchli commented Apr 5, 2023

lantiga commented Apr 5, 2023

lantiga commented Apr 5, 2023

awaelchli commented Apr 5, 2023

awaelchli commented Apr 5, 2023

lantiga commented Apr 5, 2023

lantiga commented Apr 6, 2023

Finetuning with weights in bfloat16 #100

Finetuning with weights in bfloat16 #100

Conversation

awaelchli commented Apr 5, 2023

lantiga left a comment

Choose a reason for hiding this comment

lantiga commented Apr 5, 2023

awaelchli commented Apr 5, 2023

lantiga commented Apr 5, 2023

lantiga commented Apr 5, 2023

awaelchli commented Apr 5, 2023

awaelchli commented Apr 5, 2023

lantiga commented Apr 5, 2023

lantiga commented Apr 6, 2023