[`Accelerator`] Fix issue with 8bit models #1155

younesbelkada · 2023-03-06T17:25:03Z

What does this PR do?

In theory it is not possible to fine-tune 8-bit models except if you use adapters that can be used only is a PeftModel is used for training (I also need to test the snippet below with a PeftModel to make sure this is relevant or not). EDIT: passing an 8bit PeftModel through accelerator.prepare seems to work fine.

But in some systems you can use the accelerator to load an 8bit model and use it out of the training scope (e.g. get the model's logits and use it in another model)

I am not sure if we should support 8-bit models using Accelerator, but if so, I propose the following changes in this PR
Happy also to revert the tests / bnb dependency

To reproduce:

import torch
import torch.nn.functional as F

from transformers import AutoModelForCausalLM
from datasets import load_dataset
from accelerate import Accelerator

model = AutoModelForCausalLM.from_pretrained(
    "EleutherAI/gpt-neo-125m",
    load_in_8bit=True,
    device_map="balanced",
)

accelerator = Accelerator()


model = accelerator.prepare(model)

cc @sgugger

Ran all the slow tests and got errors on DeepSpeed and FSDP tests but not sure if the failing is related to my PR

HuggingFaceDocBuilderDev · 2023-03-06T17:29:01Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada · 2023-03-06T17:32:15Z

I think this PR is only necessary in case people want to design systems that uses 8-bit models in their training loop without backpropagating on the 8-bit model (for example in RLHF), as using adapters works already out of the box right now

sgugger

Thanks for the fix!

src/accelerate/accelerator.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

younesbelkada · 2023-03-07T08:18:24Z

tests/test_accelerator.py

+        model = AutoModelForCausalLM.from_pretrained(
+            "EleutherAI/gpt-neo-125m", device_map=device_map, load_in_8bit=True, llm_int8_enable_fp32_cpu_offload=True
+        )


@sgugger I think this will fail as the docker image is not using the main branch of transformers no?

happy to skip it until the next release of transformers

Yes we are not installing from source.

…celerate into fix-8bit-models

sgugger

Thanks again!

younesbelkada added 2 commits March 6, 2023 17:07

fix 8bit models on accelerate

be68fa5

add bnb as dependency

5fdb4d9

sgugger approved these changes Mar 6, 2023

View reviewed changes

src/accelerate/accelerator.py Outdated Show resolved Hide resolved

Apply suggestions from code review

c40450d

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

younesbelkada commented Mar 7, 2023

View reviewed changes

younesbelkada added 4 commits March 8, 2023 13:18

fix

2c35a78

Merge branch 'fix-8bit-models' of https://github.com/younesbelkada/ac…

53e79b3

…celerate into fix-8bit-models

skip a test

0cf9776

make style

a38bb57

younesbelkada marked this pull request as ready for review March 8, 2023 13:21

younesbelkada requested a review from sgugger March 8, 2023 13:21

sgugger approved these changes Mar 8, 2023

View reviewed changes

younesbelkada merged commit 3533e2b into huggingface:main Mar 8, 2023

younesbelkada deleted the fix-8bit-models branch March 8, 2023 13:51

younesbelkada mentioned this pull request Mar 10, 2023

[Accelerator] We should not call to on modules that wraps accelerate loaded models #1172

Merged

loubnabnl mentioned this pull request Jun 14, 2023

8-bit models unsupported bigcode-project/bigcode-evaluation-harness#91

Closed

SunMarc mentioned this pull request Feb 23, 2024

Accelerate refuse to work on balanced_low_0 when GPU 0 is not filled. #2429

Closed

SunMarc mentioned this pull request Apr 26, 2024

fix bnb multi gpu training #2714

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Accelerator`] Fix issue with 8bit models #1155

[`Accelerator`] Fix issue with 8bit models #1155

younesbelkada commented Mar 6, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 6, 2023 •

edited

Loading

younesbelkada commented Mar 6, 2023 •

edited

Loading

sgugger left a comment

younesbelkada Mar 7, 2023

younesbelkada Mar 7, 2023

sgugger Mar 7, 2023

sgugger left a comment

[Accelerator] Fix issue with 8bit models #1155

[Accelerator] Fix issue with 8bit models #1155

Conversation

younesbelkada commented Mar 6, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Mar 6, 2023 • edited Loading

younesbelkada commented Mar 6, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

younesbelkada Mar 7, 2023

Choose a reason for hiding this comment

younesbelkada Mar 7, 2023

Choose a reason for hiding this comment

sgugger Mar 7, 2023

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

[`Accelerator`] Fix issue with 8bit models #1155

[`Accelerator`] Fix issue with 8bit models #1155

younesbelkada commented Mar 6, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 6, 2023 •

edited

Loading

younesbelkada commented Mar 6, 2023 •

edited

Loading