ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes` #30887

AnandUgale · 2024-05-18T02:08:42Z

System Info

Packages installed with CUDA 11.8:

torch - 2.3.0+cu118
llama-index - 0.10.37
llama-index-llms-huggingface - 0.2.0
transformers - 4.39.0
accelerate - 0.27.0
bitsandbytes - 0.43.1

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

import torch
from llama_index.llms.huggingface import HuggingFaceLLM

Optional quantization to 4bit

from transformers import BitsAndBytesConfig

quantization_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_compute_dtype=torch.float16,
bnb_4bit_quant_type="nf4",
bnb_4bit_use_double_quant=True,
)

llm = HuggingFaceLLM(
model_name="meta-llama/Meta-Llama-3-8B-Instruct",
model_kwargs={
"token": hf_token,
"torch_dtype": torch.bfloat16, # comment this line and uncomment below to use 4bit
# "quantization_config": quantization_config
},
generate_kwargs={
"do_sample": True,
"temperature": 0.6,
"top_p": 0.9,
},
tokenizer_name="meta-llama/Meta-Llama-3-8B-Instruct",
tokenizer_kwargs={"token": hf_token},
stopping_ids=stopping_ids,
)

Expected behavior

able to run LLM model

The text was updated successfully, but these errors were encountered:

amyeroberts · 2024-05-20T09:33:43Z

@AnandUgale Have you tried installing accelerate as per the error message?

RuABraun · 2024-05-29T16:54:21Z

I have the same issue. Accelerate is installed. This is while trying to run inference after successfully doing training with bitsandbytes.

RuABraun · 2024-05-29T17:12:10Z

The issue seems to be that is_bitsandbytes_available() in import_utils.py returns false when a cuda device is not available. So one should just not use the 4/8bit stuff at all when the device is CPU, which to be fair makes sense.

amyeroberts · 2024-05-29T17:20:17Z

@RuABraun Yes, cuda is required for using bitsandbytes

cc @younesbelkada - maybe we can update the warning to make things clearer?

younesbelkada · 2024-05-31T09:31:40Z

Hi !
1872bde should be included in the latest transformers so whenever you don't have access to a GPU it should error out with a clearer error message (I see you are using transformers==4.39.0)
Though I will enhance the error message to point out users to install bitsandbytes through the simpler command pip install -U bitsandbytes

younesbelkada mentioned this issue May 31, 2024

Quantization: Enhance bnb error message #31160

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes` #30887

ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes` #30887

AnandUgale commented May 18, 2024

amyeroberts commented May 20, 2024

RuABraun commented May 29, 2024

RuABraun commented May 29, 2024

amyeroberts commented May 29, 2024

younesbelkada commented May 31, 2024

ImportError: Using bitsandbytes 8-bit quantization requires Accelerate: pip install accelerate and the latest version of bitsandbytes: pip install -i https://pypi.org/simple/ bitsandbytes #30887

ImportError: Using bitsandbytes 8-bit quantization requires Accelerate: pip install accelerate and the latest version of bitsandbytes: pip install -i https://pypi.org/simple/ bitsandbytes #30887

Comments

AnandUgale commented May 18, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Optional quantization to 4bit

Expected behavior

amyeroberts commented May 20, 2024

RuABraun commented May 29, 2024

RuABraun commented May 29, 2024

amyeroberts commented May 29, 2024

younesbelkada commented May 31, 2024

ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes` #30887

ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes` #30887