Add fsdp+qlora support #160

deep-diver · 2024-04-24T14:10:58Z

This PR is to add FSDP+QLoRA support with the following changes:

add recipes/accelerate_configs/fsdp+qlora.yaml
update versions of peft>=0.9.0 and bitsandbytes>=0.43.0 dependencies
bnb_4bit_quant_storage field in ModelArguments
set bnb_4bit_quant_storage of BitsAndBytesConfig

With these changes, I have confirmed FSDP+QLoRA works within my local setup (2 x A6000).

lewtun

Thanks a lot for the nice addition @deep-diver ! Would you like to add a small example on how to run FSDP + QLoRA here? https://github.com/huggingface/alignment-handbook/tree/main/scripts#fine-tuning

Let's also rename the accelerate config and then we can merge

lewtun · 2024-04-25T08:37:24Z

recipes/accelerate_configs/fsdp+qlora.yaml

Let's call this just fsdp.yaml since I think it will also work with full training

For FSDP+QLoRA, We need to set fsdp_cpu_ram_efficient_loading=true, fsdp_use_orig_params=false and fsdp_offload_params=true(cpu offloading) when using Accelerate config according to this doc.

Since this is a recipe, I think it would be handy to provide separate yaml who wants to use FSDP+QLoRA out of the box. WDYT?

Sounds great, can we then rename the file to fsdp_qlora.yaml please?

@lewtun

done! :)

deep-diver · 2024-04-25T13:52:50Z

@lewtun

Besides keeping or removing fsdp+qlora.yaml discussion, I made additional commit for adding an example on https://github.com/huggingface/alignment-handbook/tree/main/scripts#fine-tuning. Please take a look!

HuggingFaceDocBuilderDev · 2024-04-30T18:52:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

deep-diver · 2024-05-01T02:59:26Z

@lewtun

reminder

deep-diver · 2024-05-07T07:23:18Z

@lewtun

reminder. I addressed your comments :)

Add fsdp+qlora support (huggingface#160)

deep-diver added 4 commits April 24, 2024 22:51

update dependencies

de3bfd7

accelerate recipe for fsdp+qlora

52e23d1

add bnb_4bit_quant_storage arg

3fc44a8

update BitsAndBytesConfig to support bnb_4bit_quant_storage field

55aabbf

lewtun approved these changes Apr 25, 2024

View reviewed changes

add example on fsdp+qlora

f27bb78

lewtun and others added 2 commits May 1, 2024 15:59

Merge branch 'main' into add-fsdp+qlora

dd83399

rename accelerate recipe to fsdp_qlora.yaml

44c6ab9

lewtun merged commit 606d2e9 into huggingface:main May 8, 2024

Ritvik19 added a commit to Ritvik19/alignment-handbook that referenced this pull request May 19, 2024

Merge pull request #1 from huggingface/main

2892a54

Add fsdp+qlora support (huggingface#160)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add fsdp+qlora support #160

Add fsdp+qlora support #160

Uh oh!

deep-diver commented Apr 24, 2024

Uh oh!

lewtun left a comment

Uh oh!

lewtun Apr 25, 2024

Uh oh!

deep-diver Apr 25, 2024

Uh oh!

lewtun May 1, 2024

Uh oh!

deep-diver May 2, 2024

Uh oh!

deep-diver commented Apr 25, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Apr 30, 2024

Uh oh!

deep-diver commented May 1, 2024

Uh oh!

deep-diver commented May 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add fsdp+qlora support #160

Add fsdp+qlora support #160

Uh oh!

Conversation

deep-diver commented Apr 24, 2024

Uh oh!

lewtun left a comment

Choose a reason for hiding this comment

Uh oh!

lewtun Apr 25, 2024

Choose a reason for hiding this comment

Uh oh!

deep-diver Apr 25, 2024

Choose a reason for hiding this comment

Uh oh!

lewtun May 1, 2024

Choose a reason for hiding this comment

Uh oh!

deep-diver May 2, 2024

Choose a reason for hiding this comment

Uh oh!

deep-diver commented Apr 25, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Apr 30, 2024

Uh oh!

deep-diver commented May 1, 2024

Uh oh!

deep-diver commented May 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants