AssertionError #486

virentakia · 2023-12-16T11:35:15Z

The code below throws an assertion error:

from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline

#model_name_or_path = "TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ"
model_name_or_path = "TheBlokeAI/Mixtral-tiny-GPTQ"
# To use a different branch, change revision
# For example: revision="gptq-4bit-128g-actorder_True"
model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
                                             device_map="auto",
                                             trust_remote_code=False,
                                             revision="main")

tokenizer = AutoTokenizer.from_pretrained(model_name_or_path, use_fast=True)

prompt = "Write a story about llamas"
system_message = "You are a story writing assistant"
prompt_template=f'''{prompt}
'''

print("\n\n*** Generate:")

input_ids = tokenizer(prompt_template, return_tensors='pt').input_ids.cuda()
output = model.generate(inputs=input_ids, temperature=0.7, do_sample=True, top_p=0.95, top_k=40, max_new_tokens=512)
print(tokenizer.decode(output[0]))

# Inference can also be done using transformers' pipeline

print("*** Pipeline:")
pipe = pipeline(
    "text-generation",
    model=model,
    tokenizer=tokenizer,
    max_new_tokens=512,
    do_sample=True,
    temperature=0.7,
    top_p=0.95,
    top_k=40,
    repetition_penalty=1.1
)

print(pipe(prompt_template)[0]['generated_text'])

Error:

Traceback (most recent call last):
  File "C:\Users\pikachu\Downloads\mixtral_1\test.py", line 7, in <module>
    model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\transformers\models\auto\auto_factory.py", line 566, in from_pretrained
    return model_class.from_pretrained(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\transformers\modeling_utils.py", line 3523, in from_pretrained
    model = quantizer.convert_model(model)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 229, in convert_model
    self._replace_by_quant_layers(model, layers_to_be_replaced)
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 298, in _replace_by_quant_layers
    self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 298, in _replace_by_quant_layers
    self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 298, in _replace_by_quant_layers
    self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)
  [Previous line repeated 1 more time]
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 292, in _replace_by_quant_layers
    new_layer = QuantLinear(
                ^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\auto_gptq\nn_modules\qlinear\qlinear_exllama.py", line 68, in __init__
    assert outfeatures % 32 == 0
           ^^^^^^^^^^^^^^^^^^^^^
AssertionError

The text was updated successfully, but these errors were encountered:

LaaZa · 2023-12-16T13:20:53Z

TheBlokeAI/Mixtral-tiny-GPTQ is meant only for testing AutoGPTQ and does not have the modules set in the config necessary for transformers to load it. If you want to load it you need to use AutoGPTQ directly. You also need the latest transformers and optimum if you want to load the full model with transformers.

virentakia · 2023-12-16T18:48:42Z

Using the model TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ, gets the similar AssertionError.

Code:

from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline

model_name_or_path = "TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ"
model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
                                             device_map="auto",
                                             trust_remote_code=False,
                                             revision="main")

tokenizer = AutoTokenizer.from_pretrained(model_name_or_path, use_fast=True)

prompt = "Write a story about llamas"
system_message = "You are a story writing assistant"
prompt_template=f'''{prompt}
'''

print("\n\n*** Generate:")

input_ids = tokenizer(prompt_template, return_tensors='pt').input_ids.cuda()
output = model.generate(inputs=input_ids, temperature=0.7, do_sample=True, top_p=0.95, top_k=40, max_new_tokens=512)
print(tokenizer.decode(output[0]))

# Inference can also be done using transformers' pipeline

print("*** Pipeline:")
pipe = pipeline(
    "text-generation",
    model=model,
    tokenizer=tokenizer,
    max_new_tokens=512,
    do_sample=True,
    temperature=0.7,
    top_p=0.95,
    top_k=40,
    repetition_penalty=1.1
)

print(pipe(prompt_template)[0]['generated_text'])

Full list of packages:

accelerate==0.25.0
aiohttp==3.9.1
aiosignal==1.3.1
attrs==23.1.0
auto-gptq==0.6.0
certifi==2023.11.17
charset-normalizer==3.3.2
colorama==0.4.6
coloredlogs==15.0.1
datasets==2.15.0
dill==0.3.7
filelock==3.9.0
frozenlist==1.4.1
fsspec==2023.10.0
gekko==1.0.6
huggingface-hub==0.19.4
humanfriendly==10.0
idna==3.6
Jinja2==3.1.2
MarkupSafe==2.1.3
mpmath==1.3.0
multidict==6.0.4
multiprocess==0.70.15
networkx==3.0
numpy==1.26.2
optimum==1.16.1
packaging==23.2
pandas==2.1.4
peft==0.7.1
protobuf==4.25.1
psutil==5.9.6
pyarrow==14.0.1
pyarrow-hotfix==0.6
pyreadline3==3.4.1
python-dateutil==2.8.2
pytz==2023.3.post1
PyYAML==6.0.1
regex==2023.10.3
requests==2.31.0
rouge==1.0.1
safetensors==0.4.1
sentencepiece==0.1.99
six==1.16.0
sympy==1.12
tokenizers==0.15.0
torch==2.1.2+cu121
tqdm==4.66.1
transformers==4.36.1
typing_extensions==4.4.0
tzdata==2023.3
urllib3==2.1.0
xxhash==3.4.1
yarl==1.9.4

Error:

Traceback (most recent call last):
  File "C:\Users\pikachu\Downloads\mixtral_1\test.py", line 4, in <module>
    model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\transformers\models\auto\auto_factory.py", line 566, in from_pretrained
    return model_class.from_pretrained(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\transformers\modeling_utils.py", line 3523, in from_pretrained
    model = quantizer.convert_model(model)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 229, in convert_model
    self._replace_by_quant_layers(model, layers_to_be_replaced)
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 298, in _replace_by_quant_layers
    self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 298, in _replace_by_quant_layers
    self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 298, in _replace_by_quant_layers
    self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)
  [Previous line repeated 1 more time]
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\optimum\gptq\quantizer.py", line 282, in _replace_by_quant_layers
    new_layer = QuantLinear(
                ^^^^^^^^^^^^
  File "C:\Users\pikachu\.virtualenvs\mixtral_1-ye4B-JwT\Lib\site-packages\auto_gptq\nn_modules\qlinear\qlinear_exllama.py", line 68, in __init__
    assert outfeatures % 32 == 0
           ^^^^^^^^^^^^^^^^^^^^^
AssertionError

LaaZa · 2023-12-16T19:57:46Z

Make sure you have the updated config https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ/blob/main/config.json

virentakia · 2023-12-16T23:56:19Z

Yes, the config file downloaded (on my machine - see below), matches the details of the config you've mentioned:

{
    "_name_or_path": "/workspace/process/mistralai_mixtral-8x7b-instruct-v0.1/source",
    "architectures": [
        "MixtralForCausalLM"
    ],
    "attention_dropout": 0.0,
    "bos_token_id": 1,
    "eos_token_id": 2,
    "hidden_act": "silu",
    "hidden_size": 4096,
    "initializer_range": 0.02,
    "intermediate_size": 14336,
    "max_position_embeddings": 32768,
    "model_type": "mixtral",
    "num_attention_heads": 32,
    "num_experts_per_tok": 2,
    "num_hidden_layers": 32,
    "num_key_value_heads": 8,
    "num_local_experts": 8,
    "output_router_logits": false,
    "pad_token_id": 0,
    "pretraining_tp": 1,
    "rms_norm_eps": 1e-05,
    "rope_theta": 1000000.0,
    "router_aux_loss_coef": 0.02,
    "sliding_window": 4096,
    "tie_word_embeddings": false,
    "torch_dtype": "bfloat16",
    "transformers_version": "4.36.0",
    "use_cache": true,
    "vocab_size": 32000,
    "quantization_config": {
        "bits": 4,
		"modules_in_block_to_quantize" : [
        ["self_attn.k_proj", "self_attn.v_proj", "self_attn.q_proj"],
        ["self_attn.o_proj"],
        ["block_sparse_moe.experts.0.w1", "block_sparse_moe.experts.0.w2", "block_sparse_moe.experts.0.w3"],
        ["block_sparse_moe.experts.1.w1", "block_sparse_moe.experts.1.w2", "block_sparse_moe.experts.1.w3"],
        ["block_sparse_moe.experts.2.w1", "block_sparse_moe.experts.2.w2", "block_sparse_moe.experts.2.w3"],
        ["block_sparse_moe.experts.3.w1", "block_sparse_moe.experts.3.w2", "block_sparse_moe.experts.3.w3"],
        ["block_sparse_moe.experts.4.w1", "block_sparse_moe.experts.4.w2", "block_sparse_moe.experts.4.w3"],
        ["block_sparse_moe.experts.5.w1", "block_sparse_moe.experts.5.w2", "block_sparse_moe.experts.5.w3"],
        ["block_sparse_moe.experts.6.w1", "block_sparse_moe.experts.6.w2", "block_sparse_moe.experts.6.w3"],
        ["block_sparse_moe.experts.7.w1", "block_sparse_moe.experts.7.w2", "block_sparse_moe.experts.7.w3"]],
        "group_size": -1,
        "damp_percent": 0.1,
        "desc_act": true,
        "sym": true,
        "true_sequential": true,
        "model_name_or_path": null,
        "model_file_base_name": "model",
        "quant_method": "gptq"
    }
}

virentakia · 2023-12-17T01:03:55Z

Updated the transformers library, works now...

Thanks for your inputs on this @LaaZa 👍, really appreciate it.

accelerate==0.25.0
aiohttp==3.9.1
aiosignal==1.3.1
attrs==23.1.0
auto-gptq==0.6.0
certifi==2023.11.17
charset-normalizer==3.3.2
colorama==0.4.6
coloredlogs==15.0.1
datasets==2.15.0
dill==0.3.7
filelock==3.9.0
frozenlist==1.4.1
fsspec==2023.10.0
gekko==1.0.6
huggingface-hub==0.19.4
humanfriendly==10.0
idna==3.6
Jinja2==3.1.2
MarkupSafe==2.1.3
mpmath==1.3.0
multidict==6.0.4
multiprocess==0.70.15
networkx==3.0
numpy==1.26.2
optimum==1.16.1
packaging==23.2
pandas==2.1.4
peft==0.7.1
protobuf==4.25.1
psutil==5.9.6
pyarrow==14.0.1
pyarrow-hotfix==0.6
pyreadline3==3.4.1
python-dateutil==2.8.2
pytz==2023.3.post1
PyYAML==6.0.1
regex==2023.10.3
requests==2.31.0
rouge==1.0.1
safetensors==0.4.1
sentencepiece==0.1.99
six==1.16.0
sympy==1.12
tokenizers==0.15.0
torch==2.1.2+cu121
tqdm==4.66.1
transformers @ file:///C:/Users/pikachu/Downloads/mixtral_1/transformers-main.zip#sha256=93fa327248f0dcfd4fe438656f1e44062d85db7d6bdf75fd28d9609f025e8217
typing_extensions==4.4.0
tzdata==2023.3
urllib3==2.1.0
xxhash==3.4.1
yarl==1.9.4

LaaZa · 2023-12-17T01:30:21Z

Oh, I though that was in 4.36.1 but it seems it just barely missed that version.

luisfrentzen-cc · 2023-12-19T08:51:31Z

@virentakia can you share the fix? got the same error and can't make it work

virentakia · 2023-12-19T10:01:52Z

@luisfrentzen-cc - Downloaded and installed the latest Dev version of transformers (4.37? Not released yet I guess).

transformers @ file:///C:/Users/pikachu/Downloads/mixtral_1/transformers-main.zip#sha256=93fa327248f0dcfd4fe438656f1e44062d85db7d6bdf75fd28d9609f025e8217

4.36.* is the latest one released ( and that does not seem to work)

wac81 · 2023-12-21T11:15:44Z

pip install -U git+https://github.com/huggingface/transformers.git

MarseusFu · 2024-01-17T10:01:02Z

well, i got the same error and updated transformers to 4.37.0.dev0 but still wrong.
Below is the code i use:

def finalSummary_Mixtral(keyIssue: str, first_summaries: str, connectors_string: str) -> str:
    Mixtral_model_name = "TheBloke/Mixtral-8x7B-v0.1-GPTQ"
    model = AutoModelForCausalLM.from_pretrained(Mixtral_model_name, 
                                                 # torch_dtype=torch.float32, 
                                                 device_map='cuda',
                                                 trust_remote_code=False,
                                                 revision="main"
                                                 # local_files_only=False, 
                                                 # load_in_4bit=True
                                                )

    tokenizer = AutoTokenizer.from_pretrained(Mixtral_model_name, use_fast=True)
    # system_msg = "You are a news summarization AI tasked with summarizing a set of ESG news articles."
    prompt = f''' [INST]<<SYS>>
    You are a news summarization AI tasked with summarizing a set of ESG news articles.
    <</SYS>>
    Please ensure that each news article is mentioned. If a company is mentioned, provide background information about the company. Introduce offer background information, and detail the causes and consequences for topics mentioned."
    The summary shoud be within 150 words.
    Make it a complete paragraph.
    If there are common topics between those news, start with a topic sentence. Use the conjunctions or transition words while you want to discuss a different topic or news. Be sure to only use the conjunctions and transition words in my list. list:{ connectors_string } Don't use 'Additionally' as conjunctions or transition word. Don't use the conjunctions and transition words at the beginning of the summary. Summarize these news:{chr(10)}{first_summaries}
    [/INST]
    '''
    input_ids = tokenizer(prompt, return_tensors="pt").input_ids.to("cuda")
    output = model.generate(inputs=input_ids, temperature=0.5, do_sample=True, top_k=20, num_return_sequences=1, max_new_tokens=512)
    summary = tokenizer.decode(output[0]).replace(prompt, '').replace('<s>', '').replace('</s>', '').strip()

    # pipe=pipeline(
    #     "text-generation",
    #     model=model,
    #     tokenizer=tokenizer,
    #     max_new_tokens=512,
    #     do_sample=True,
    #     temperature=0.7,
    #     top_p=0.95,
    #     top_k=40,
    #     repetition_penalty=1.1
    # )
    # summary_2 = pipe(prompt)[0]['generated_text']
    del model
    del tokenizer
    return summary

# Name                    Version                   Build  Channel
_anaconda_depends         2023.07                 py311_1  
_libgcc_mutex             0.1                 conda_forge    conda-forge
_openmp_mutex             4.5                  2_kmp_llvm    conda-forge
abseil-cpp                20211102.0           hd4dd3e8_0  
accelerate                0.26.1                   pypi_0    pypi
aiobotocore               2.7.0           py311h06a4308_0  
aiohttp                   3.9.0           py311h5eee18b_0  
aioitertools              0.7.1              pyhd3eb1b0_0  
aiosignal                 1.2.0              pyhd3eb1b0_0  
alabaster                 0.7.12             pyhd3eb1b0_0  
altair                    5.1.2                    pypi_0    pypi
anaconda-anon-usage       0.4.3           py311hfc0e8ea_100  
anaconda-catalogs         0.2.0           py311h06a4308_0  
anaconda-client           1.12.0          py311h06a4308_0  
anaconda-cloud-auth       0.1.4           py311h06a4308_0  
anaconda-navigator        2.5.2           py311h06a4308_0  
anaconda-project          0.11.1          py311h06a4308_0  
annotated-types           0.6.0                    pypi_0    pypi
anyio                     3.7.1                    pypi_0    pypi
aom                       3.6.0                h6a678d5_0  
appdirs                   1.4.4              pyhd3eb1b0_0  
archspec                  0.2.1              pyhd3eb1b0_0  
argon2-cffi               21.3.0             pyhd3eb1b0_0  
argon2-cffi-bindings      21.2.0          py311h5eee18b_0  
arrow                     1.2.3           py311h06a4308_1  
arrow-cpp                 11.0.0               h374c478_2  
astroid                   2.14.2          py311h06a4308_0  
astropy                   5.3.4           py311hf4808d0_0  
asttokens                 2.0.5              pyhd3eb1b0_0  
async-lru                 2.0.4           py311h06a4308_0  
atomicwrites              1.4.0                      py_0  
attrs                     23.1.0          py311h06a4308_0  
auto-gptq                 0.6.0                    pypi_0    pypi
automat                   20.2.0                     py_0  
autopep8                  1.6.0              pyhd3eb1b0_1  
aws-c-common              0.6.8                h5eee18b_1  
aws-c-event-stream        0.1.6                h6a678d5_6  
aws-checksums             0.1.11               h5eee18b_2  
aws-sdk-cpp               1.8.185              h721c034_1  
babel                     2.11.0          py311h06a4308_0  
backports                 1.1                pyhd3eb1b0_0  
backports.functools_lru_cache 1.6.4              pyhd3eb1b0_0  
backports.tempfile        1.0                pyhd3eb1b0_1  
backports.weakref         1.0.post1                  py_1  
bcrypt                    3.2.0           py311h5eee18b_1  
beautifulsoup4            4.12.2          py311h06a4308_0  
binaryornot               0.4.4              pyhd3eb1b0_1  
bitsandbytes              0.42.0          py311h4168a3b_0    conda-forge
black                     23.11.0         py311h06a4308_0  
blas                      1.0                         mkl  
bleach                    4.1.0              pyhd3eb1b0_0  
blosc                     1.21.3               h6a678d5_0  
bokeh                     3.3.0           py311h92b7b1e_0  
boltons                   23.0.0          py311h06a4308_0  
boost-cpp                 1.82.0               hdb19cb5_2  
botocore                  1.31.64         py311h06a4308_0  
bottleneck                1.3.5           py311hbed6279_0  
brotli                    1.0.9                h5eee18b_7  
brotli-bin                1.0.9                h5eee18b_7  
brotli-python             1.0.9           py311h6a678d5_7  
brunsli                   0.1                  h2531618_0  
bzip2                     1.0.8                h7b6447c_0  
c-ares                    1.19.1               h5eee18b_0  
c-blosc2                  2.12.0               h80c7b02_0  
ca-certificates           2023.12.12           h06a4308_0  
certifi                   2023.11.17      py311h06a4308_0  
cffi                      1.16.0          py311h5eee18b_0  
cfitsio                   3.470                h5893167_7  
chardet                   4.0.0           py311h06a4308_1003  
charls                    2.2.0                h2531618_0  
charset-normalizer        2.0.4              pyhd3eb1b0_0  
click                     8.1.7           py311h06a4308_0  
cloudpickle               2.2.1           py311h06a4308_0  
clyent                    1.2.2           py311h06a4308_1  
colorama                  0.4.6           py311h06a4308_0  
colorcet                  3.0.1           py311h06a4308_0  
comm                      0.1.2           py311h06a4308_0  
conda                     23.11.0         py311h06a4308_0  
conda-build               3.28.3          py311h06a4308_0  
conda-content-trust       0.2.0           py311h06a4308_0  
conda-index               0.3.0           py311h06a4308_0  
conda-libmamba-solver     23.12.0            pyhd3eb1b0_1  
conda-pack                0.6.0              pyhd3eb1b0_0  
conda-package-handling    2.2.0           py311h06a4308_0  
conda-package-streaming   0.9.0           py311h06a4308_0  
conda-repo-cli            1.0.75          py311h06a4308_0  
conda-token               0.4.0              pyhd3eb1b0_0  
conda-verify              3.4.2                      py_1  
constantly                23.10.4         py311h06a4308_0  
contourpy                 1.2.0           py311hdb19cb5_0  
cookiecutter              2.5.0           py311h06a4308_0  
cryptography              41.0.7          py311hdda0065_0  
cssselect                 1.1.0              pyhd3eb1b0_0  
cudatoolkit               11.8.0               h6a678d5_0  
curl                      8.5.0                hdbd6064_0  
cycler                    0.11.0             pyhd3eb1b0_0  
cyrus-sasl                2.1.28               h52b45da_1  
cytoolz                   0.12.2          py311h5eee18b_0  
daal4py                   2023.1.1        py311h4cb112f_0  
dal                       2023.1.1         hdb19cb5_48680  
dask                      2023.11.0       py311h06a4308_0  
dask-core                 2023.11.0       py311h06a4308_0  
dataclasses               0.8                pyh6d0b6a4_7  
datasets                  2.12.0          py311h06a4308_0  
datashader                0.16.0          py311h06a4308_0  
dav1d                     1.2.1                h5eee18b_0  
dbus                      1.13.18              hb2f20db_0  
debugpy                   1.6.7           py311h6a678d5_0  
decorator                 5.1.1              pyhd3eb1b0_0  
defusedxml                0.7.1              pyhd3eb1b0_0  
diff-match-patch          20200713           pyhd3eb1b0_0  
dill                      0.3.6           py311h06a4308_0  
distributed               2023.11.0       py311h06a4308_0  
distro                    1.8.0           py311h06a4308_0  
docstring-to-markdown     0.11            py311h06a4308_0  
docutils                  0.18.1          py311h06a4308_3  
entrypoints               0.4             py311h06a4308_0  
et_xmlfile                1.1.0           py311h06a4308_0  
executing                 0.8.3              pyhd3eb1b0_0  
expat                     2.5.0                h6a678d5_0  
fastapi                   0.104.0                  pypi_0    pypi
ffmpy                     0.3.1                    pypi_0    pypi
filelock                  3.13.1          py311h06a4308_0  
flake8                    6.0.0           py311h06a4308_0  
flask                     2.2.5           py311h06a4308_0  
fmt                       9.1.0                hdb19cb5_0  
fontconfig                2.14.1               h4c34cd2_2  
fonttools                 4.25.0             pyhd3eb1b0_0  
freetype                  2.12.1               h4a9f257_0  
frozenlist                1.4.0           py311h5eee18b_0  
fsspec                    2023.10.0       py311h06a4308_0  
future                    0.18.3          py311h06a4308_0  
gekko                     1.0.6                    pypi_0    pypi
gensim                    4.3.0           py311hba01205_1  
gflags                    2.2.2                he6710b0_0  
giflib                    5.2.1                h5eee18b_3  
glib                      2.69.1               he621ea3_2  
glog                      0.5.0                h2531618_0  
gmp                       6.2.1                h295c915_3  
gmpy2                     2.1.2           py311hc9b5ff0_0  
gradio                    3.50.2                   pypi_0    pypi
gradio-client             0.6.1                    pypi_0    pypi
greenlet                  3.0.1           py311h6a678d5_0  
grpc-cpp                  1.48.2               he1ff14a_1  
gst-plugins-base          1.14.1               h6a678d5_1  
gstreamer                 1.14.1               h5eee18b_1  
h11                       0.14.0                   pypi_0    pypi
h5py                      3.9.0           py311hdd6beaf_0  
hdf5                      1.12.1               h2b7332f_3  
heapdict                  1.0.1              pyhd3eb1b0_0  
holoviews                 1.18.1          py311h06a4308_0  
httpcore                  0.18.0                   pypi_0    pypi
httpx                     0.25.0                   pypi_0    pypi
huggingface-hub           0.20.2                   pypi_0    pypi
hvplot                    0.9.1           py311h06a4308_0  
hyperlink                 21.0.0             pyhd3eb1b0_0  
icu                       73.1                 h6a678d5_0  
idna                      3.4             py311h06a4308_0  
imagecodecs               2023.1.23       py311h8105a5c_0  
imageio                   2.31.4          py311h06a4308_0  
imagesize                 1.4.1           py311h06a4308_0  
imbalanced-learn          0.11.0          py311h06a4308_1  
importlib-metadata        7.0.1           py311h06a4308_0  
importlib-resources       6.1.0                    pypi_0    pypi
importlib_metadata        7.0.1                hd8ed1ab_0    conda-forge
incremental               21.3.0             pyhd3eb1b0_0  
inflection                0.5.1           py311h06a4308_0  
iniconfig                 1.1.1              pyhd3eb1b0_0  
intake                    0.6.8           py311h06a4308_0  
intel-openmp              2023.1.0         hdb19cb5_46306  
intervaltree              3.1.0              pyhd3eb1b0_0  
ipykernel                 6.25.0          py311h92b7b1e_0  
ipython                   8.20.0          py311h06a4308_0  
ipython_genutils          0.2.0              pyhd3eb1b0_1  
ipywidgets                8.0.4           py311h06a4308_0  
isort                     5.9.3              pyhd3eb1b0_0  
itemadapter               0.3.0              pyhd3eb1b0_0  
itemloaders               1.0.4              pyhd3eb1b0_1  
itsdangerous              2.0.1              pyhd3eb1b0_0  
jaraco.classes            3.2.1              pyhd3eb1b0_0  
jedi                      0.18.1          py311h06a4308_1  
jeepney                   0.7.1              pyhd3eb1b0_0  
jellyfish                 1.0.1           py311hb02cf49_0  
jinja2                    3.1.2           py311h06a4308_0  
jmespath                  1.0.1           py311h06a4308_0  
joblib                    1.2.0           py311h06a4308_0  
jpeg                      9e                   h5eee18b_1  
jq                        1.6               h27cfd23_1000  
json5                     0.9.6              pyhd3eb1b0_0  
jsonpatch                 1.32               pyhd3eb1b0_0  
jsonpointer               2.1                pyhd3eb1b0_0  
jsonschema                4.19.2          py311h06a4308_0  
jsonschema-specifications 2023.7.1        py311h06a4308_0  
jupyter                   1.0.0           py311h06a4308_8  
jupyter-lsp               2.2.0           py311h06a4308_0  
jupyter_client            8.6.0           py311h06a4308_0  
jupyter_console           6.6.3           py311h06a4308_0  
jupyter_core              5.5.0           py311h06a4308_0  
jupyter_events            0.8.0           py311h06a4308_0  
jupyter_server            2.10.0          py311h06a4308_0  
jupyter_server_terminals  0.4.4           py311h06a4308_1  
jupyterlab                4.0.8           py311h06a4308_0  
jupyterlab_pygments       0.1.2                      py_0  
jupyterlab_server         2.25.1          py311h06a4308_0  
jupyterlab_widgets        3.0.9           py311h06a4308_0  
jxrlib                    1.1                  h7b6447c_2  
keyring                   23.13.1         py311h06a4308_0  
kiwisolver                1.4.4           py311h6a678d5_0  
krb5                      1.20.1               h143b758_1  
lazy-object-proxy         1.6.0           py311h5eee18b_0  
lazy_loader               0.3             py311h06a4308_0  
lcms2                     2.12                 h3be6417_0  
ld_impl_linux-64          2.38                 h1181459_1  
lerc                      3.0                  h295c915_0  
libaec                    1.0.4                he6710b0_1  
libarchive                3.6.2                h6ac8c49_2  
libavif                   0.11.1               h5eee18b_0  
libboost                  1.82.0               h109eef0_2  
libbrotlicommon           1.0.9                h5eee18b_7  
libbrotlidec              1.0.9                h5eee18b_7  
libbrotlienc              1.0.9                h5eee18b_7  
libclang                  14.0.6          default_hc6dbbc7_1  
libclang13                14.0.6          default_he11475f_1  
libcups                   2.4.2                h2d74bed_1  
libcurl                   8.5.0                h251f7ec_0  
libdeflate                1.17                 h5eee18b_1  
libedit                   3.1.20230828         h5eee18b_0  
libev                     4.33                 h7f8727e_1  
libevent                  2.1.12               hdbd6064_1  
libffi                    3.4.4                h6a678d5_0  
libgcc-ng                 13.2.0               h807b86a_3    conda-forge
libgfortran-ng            11.2.0               h00389a5_1  
libgfortran5              11.2.0               h1234567_1  
liblief                   0.12.3               h6a678d5_0  
libllvm14                 14.0.6               hdb19cb5_3  
libmamba                  1.5.6                haf1ee3a_0  
libmambapy                1.5.6           py311h2dafd23_0  
libnghttp2                1.57.0               h2d74bed_0  
libpng                    1.6.39               h5eee18b_0  
libpq                     12.15                hdbd6064_1  
libprotobuf               3.20.3               he621ea3_0  
libsodium                 1.0.18               h7b6447c_0  
libsolv                   0.7.24               he621ea3_0  
libspatialindex           1.9.3                h2531618_0  
libssh2                   1.10.0               hdbd6064_2  
libstdcxx-ng              13.2.0               h7e041cc_3    conda-forge
libthrift                 0.15.0               h1795dd8_2  
libtiff                   4.5.1                h6a678d5_0  
libuuid                   1.41.5               h5eee18b_0  
libwebp                   1.3.2                h11a3e52_0  
libwebp-base              1.3.2                h5eee18b_0  
libxcb                    1.15                 h7f8727e_0  
libxkbcommon              1.0.1                h5eee18b_1  
libxml2                   2.10.4               hf1b16e4_1  
libxslt                   1.1.37               h5eee18b_1  
libzopfli                 1.0.3                he6710b0_0  
linkify-it-py             2.0.0           py311h06a4308_0  
llvm-openmp               14.0.6               h9e868ea_0  
llvmlite                  0.40.0          py311he621ea3_0  
locket                    1.0.0           py311h06a4308_0  
lxml                      4.9.3           py311hdbbb534_0  
lz4                       4.3.2           py311h5eee18b_0  
lz4-c                     1.9.4                h6a678d5_0  
lzo                       2.10                 h7b6447c_2  
markdown                  3.4.1           py311h06a4308_0  
markdown-it-py            2.2.0           py311h06a4308_1  
markupsafe                2.1.3           py311h5eee18b_0  
matplotlib                3.8.0           py311h06a4308_0  
matplotlib-base           3.8.0           py311ha02d727_0  
matplotlib-inline         0.1.6           py311h06a4308_0  
mccabe                    0.7.0              pyhd3eb1b0_0  
mdit-py-plugins           0.3.0           py311h06a4308_0  
mdurl                     0.1.0           py311h06a4308_0  
menuinst                  2.0.1           py311h06a4308_1  
mistune                   2.0.4           py311h06a4308_0  
mkl                       2023.1.0         h213fc3f_46344  
mkl-service               2.4.0           py311h5eee18b_1  
mkl_fft                   1.3.8           py311h5eee18b_0  
mkl_random                1.2.4           py311hdb19cb5_0  
more-itertools            10.1.0          py311h06a4308_0  
mpc                       1.1.0                h10f8cd9_1  
mpfr                      4.0.2                hb69a4c5_1  
mpi                       1.0                       mpich  
mpich                     4.1.1                hbae89fd_0  
mpmath                    1.3.0           py311h06a4308_0  
msgpack-python            1.0.3           py311hdb19cb5_0  
multidict                 6.0.4           py311h5eee18b_0  
multipledispatch          0.6.0           py311h06a4308_0  
multiprocess              0.70.14         py311h06a4308_0  
munkres                   1.1.4                      py_0  
mypy_extensions           1.0.0           py311h06a4308_0  
mysql                     5.7.24               h721c034_2  
navigator-updater         0.4.0           py311h06a4308_1  
nbclient                  0.8.0           py311h06a4308_0  
nbconvert                 7.10.0          py311h06a4308_0  
nbformat                  5.9.2           py311h06a4308_0  
ncurses                   6.4                  h6a678d5_0  
nest-asyncio              1.5.6           py311h06a4308_0  
networkx                  3.1             py311h06a4308_0  
ninja                     1.10.2               h06a4308_5  
ninja-base                1.10.2               hd09550d_5  
nltk                      3.8.1           py311h06a4308_0  
notebook                  7.0.6           py311h06a4308_0  
notebook-shim             0.2.3           py311h06a4308_0  
nspr                      4.35                 h6a678d5_0  
nss                       3.89.1               h6a678d5_0  
numba                     0.57.1          py311h96b013e_0    conda-forge
numexpr                   2.8.7           py311h65dcdc2_0  
numpy                     1.24.3          py311h08b1b3b_1  
numpy-base                1.24.3          py311hf175353_1  
numpydoc                  1.5.0           py311h06a4308_0  
oniguruma                 6.9.7.1              h27cfd23_0  
openjpeg                  2.4.0                h3ad879b_0  
openpyxl                  3.0.10          py311h5eee18b_0  
openssl                   3.0.12               h7f8727e_0  
orc                       1.7.4                hb3bc3d3_1  
orjson                    3.9.9                    pypi_0    pypi
overrides                 7.4.0           py311h06a4308_0  
packaging                 23.1            py311h06a4308_0  
pandas                    2.1.4           py311ha02d727_0  
pandocfilters             1.5.0              pyhd3eb1b0_0  
panel                     1.3.1           py311h06a4308_0  
param                     2.0.1           py311h06a4308_0  
parsel                    1.6.0           py311h06a4308_0  
parso                     0.8.3              pyhd3eb1b0_0  
partd                     1.4.1           py311h06a4308_0  
patch                     2.7.6             h7b6447c_1001  
patchelf                  0.17.2               h6a678d5_0  
pathlib                   1.0.1              pyhd3eb1b0_1  
pathspec                  0.10.3          py311h06a4308_0  
patsy                     0.5.3           py311h06a4308_0  
pcre                      8.45                 h295c915_0  
pcre2                     10.42                hebb0a14_0  
peft                      0.7.1                    pypi_0    pypi
pep8                      1.7.1           py311h06a4308_1  
pexpect                   4.8.0              pyhd3eb1b0_3  
pickleshare               0.7.5           pyhd3eb1b0_1003  
pillow                    10.0.1          py311ha6cbd5a_0  
pip                       23.2.1          py311h06a4308_0  
pkce                      1.0.3           py311h06a4308_0  
pkginfo                   1.9.6           py311h06a4308_0  
platformdirs              3.10.0          py311h06a4308_0  
plotly                    5.9.0           py311h06a4308_0  
pluggy                    1.0.0           py311h06a4308_1  
ply                       3.11            py311h06a4308_0  
prometheus_client         0.14.1          py311h06a4308_0  
prompt-toolkit            3.0.43          py311h06a4308_0  
prompt_toolkit            3.0.43               hd3eb1b0_0  
protego                   0.1.16                     py_0  
psutil                    5.9.0           py311h5eee18b_0  
ptyprocess                0.7.0              pyhd3eb1b0_2  
pure_eval                 0.2.2              pyhd3eb1b0_0  
py-cpuinfo                9.0.0           py311h06a4308_0  
py-lief                   0.12.3          py311h6a678d5_0  
pyarrow                   11.0.0          py311hd8e8d9b_1  
pyasn1                    0.4.8              pyhd3eb1b0_0  
pyasn1-modules            0.2.8                      py_0  
pybind11-abi              4                    hd3eb1b0_1  
pycodestyle               2.10.0          py311h06a4308_0  
pycosat                   0.6.6           py311h5eee18b_0  
pycparser                 2.21               pyhd3eb1b0_0  
pyct                      0.5.0           py311h06a4308_0  
pycurl                    7.45.2          py311hdbd6064_1  
pydantic                  2.4.2                    pypi_0    pypi
pydantic-core             2.10.1                   pypi_0    pypi
pydispatcher              2.0.5           py311h06a4308_2  
pydocstyle                6.3.0           py311h06a4308_0  
pydub                     0.25.1                   pypi_0    pypi
pyerfa                    2.0.0           py311h5eee18b_0  
pyflakes                  3.0.1           py311h06a4308_0  
pygments                  2.15.1          py311h06a4308_1  
pyjwt                     2.4.0           py311h06a4308_0  
pylint                    2.16.2          py311h06a4308_0  
pylint-venv               2.3.0           py311h06a4308_0  
pyls-spyder               0.4.0              pyhd3eb1b0_0  
pyodbc                    5.0.1           py311h6a678d5_0  
pyopenssl                 23.2.0          py311h06a4308_0  
pyparsing                 3.0.9           py311h06a4308_0  
pyqt                      5.15.10         py311h6a678d5_0  
pyqt5-sip                 12.13.0         py311h5eee18b_0  
pyqtwebengine             5.15.10         py311h6a678d5_0  
pysocks                   1.7.1           py311h06a4308_0  
pytables                  3.8.0           py311hb8ae3fc_3  
pytest                    7.4.0           py311h06a4308_0  
python                    3.11.7               h955ad1f_0  
python-dateutil           2.8.2              pyhd3eb1b0_0  
python-dotenv             0.21.0          py311h06a4308_0  
python-fastjsonschema     2.16.2          py311h06a4308_0  
python-json-logger        2.0.7           py311h06a4308_0  
python-libarchive-c       2.9                pyhd3eb1b0_1  
python-lmdb               1.4.1           py311h6a678d5_0  
python-lsp-black          1.2.1           py311h06a4308_0  
python-lsp-jsonrpc        1.0.0              pyhd3eb1b0_0  
python-lsp-server         1.7.2           py311h06a4308_0  
python-multipart          0.0.6                    pypi_0    pypi
python-slugify            5.0.2              pyhd3eb1b0_0  
python-snappy             0.6.1           py311h6a678d5_0  
python-tzdata             2023.3             pyhd3eb1b0_0  
python-xxhash             2.0.2           py311h5eee18b_1  
python_abi                3.11                    2_cp311    conda-forge
pytoolconfig              1.2.6           py311h06a4308_0  
pytorch                   2.1.0           cpu_py311h6d93b4c_0  
pytz                      2023.3.post1    py311h06a4308_0  
pyviz_comms               3.0.0           py311h06a4308_0  
pywavelets                1.5.0           py311hf4808d0_0  
pyxdg                     0.27               pyhd3eb1b0_0  
pyyaml                    6.0.1           py311h5eee18b_0  
pyzmq                     25.1.0          py311h6a678d5_0  
qdarkstyle                3.0.2              pyhd3eb1b0_0  
qstylizer                 0.2.2           py311h06a4308_0  
qt-main                   5.15.2              h53bd1ea_10  
qt-webengine              5.15.9               h9ab4d14_7  
qtawesome                 1.2.2           py311h06a4308_0  
qtconsole                 5.4.2           py311h06a4308_0  
qtpy                      2.4.1           py311h06a4308_0  
queuelib                  1.6.2           py311h06a4308_0  
re2                       2022.04.01           h295c915_0  
readline                  8.2                  h5eee18b_0  
referencing               0.30.2          py311h06a4308_0  
regex                     2023.10.3       py311h5eee18b_0  
reproc                    14.2.4               h295c915_1  
reproc-cpp                14.2.4               h295c915_1  
requests                  2.31.0          py311h06a4308_0  
requests-file             1.5.1              pyhd3eb1b0_0  
requests-toolbelt         1.0.0           py311h06a4308_0  
responses                 0.13.3             pyhd3eb1b0_0  
rfc3339-validator         0.1.4           py311h06a4308_0  
rfc3986-validator         0.1.1           py311h06a4308_0  
rich                      13.3.5          py311h06a4308_0  
rope                      1.7.0           py311h06a4308_0  
rouge                     1.0.1                    pypi_0    pypi
rpds-py                   0.10.6          py311hb02cf49_0  
rtree                     1.0.1           py311h06a4308_0  
ruamel.yaml               0.17.21         py311h5eee18b_0  
ruamel_yaml               0.17.21         py311h5eee18b_0  
s3fs                      2023.10.0       py311h06a4308_0  
sacremoses                0.0.43             pyhd3eb1b0_0  
safetensors               0.4.0           py311h24d97f6_0  
scikit-image              0.20.0          py311h6a678d5_0  
scikit-learn              1.3.0           py311ha02d727_1  
scikit-learn-intelex      2023.1.1        py311h06a4308_0  
scipy                     1.11.4          py311h08b1b3b_0  
scrapy                    2.8.0           py311h06a4308_0  
seaborn                   0.12.2          py311h06a4308_0  
secretstorage             3.3.1           py311h06a4308_1  
semantic-version          2.10.0                   pypi_0    pypi
semver                    2.13.0             pyhd3eb1b0_0  
send2trash                1.8.2           py311h06a4308_0  
sentencepiece             0.1.99                   pypi_0    pypi
service_identity          18.1.0             pyhd3eb1b0_1  
setuptools                68.0.0          py311h06a4308_0  
sip                       6.7.12          py311h6a678d5_0  
six                       1.16.0             pyhd3eb1b0_1  
smart_open                5.2.1           py311h06a4308_0  
snappy                    1.1.10               h6a678d5_1  
sniffio                   1.3.0           py311h06a4308_0  
snowballstemmer           2.2.0              pyhd3eb1b0_0  
sortedcontainers          2.4.0              pyhd3eb1b0_0  
soupsieve                 2.5             py311h06a4308_0  
sphinx                    5.0.2           py311h06a4308_0  
sphinxcontrib-applehelp   1.0.2              pyhd3eb1b0_0  
sphinxcontrib-devhelp     1.0.2              pyhd3eb1b0_0  
sphinxcontrib-htmlhelp    2.0.0              pyhd3eb1b0_0  
sphinxcontrib-jsmath      1.0.1              pyhd3eb1b0_0  
sphinxcontrib-qthelp      1.0.3              pyhd3eb1b0_0  
sphinxcontrib-serializinghtml 1.1.5              pyhd3eb1b0_0  
spyder                    5.4.3           py311h06a4308_1  
spyder-kernels            2.4.4           py311h06a4308_0  
sqlalchemy                2.0.25          py311h5eee18b_0  
sqlite                    3.41.2               h5eee18b_0  
stack_data                0.2.0              pyhd3eb1b0_0  
starlette                 0.27.0                   pypi_0    pypi
statsmodels               0.14.0          py311hf4808d0_0  
sympy                     1.12            py311h06a4308_0  
tabulate                  0.9.0           py311h06a4308_0  
tbb                       2021.8.0             hdb19cb5_0  
tbb4py                    2021.8.0        py311hdb19cb5_0  
tblib                     1.7.0              pyhd3eb1b0_0  
tenacity                  8.2.2           py311h06a4308_0  
terminado                 0.17.1          py311h06a4308_0  
text-unidecode            1.3                pyhd3eb1b0_0  
textdistance              4.2.1              pyhd3eb1b0_0  
threadpoolctl             2.2.0              pyh0d69192_0  
three-merge               0.1.1              pyhd3eb1b0_0  
tifffile                  2023.4.12       py311h06a4308_0  
tinycss2                  1.2.1           py311h06a4308_0  
tk                        8.6.12               h1ccaba5_0  
tldextract                3.2.0              pyhd3eb1b0_0  
tokenizers                0.15.0                   pypi_0    pypi
toml                      0.10.2             pyhd3eb1b0_0  
tomlkit                   0.11.1          py311h06a4308_0  
toolz                     0.12.0          py311h06a4308_0  
tornado                   6.3.3           py311h5eee18b_0  
tqdm                      4.65.0          py311h92b7b1e_0  
traitlets                 5.7.1           py311h06a4308_0  
transformers              4.37.0.dev0              pypi_0    pypi
truststore                0.8.0           py311h06a4308_0  
twisted                   22.10.0         py311h5eee18b_0  
typing-extensions         4.8.0                    pypi_0    pypi
typing_extensions         4.9.0           py311h06a4308_0  
tzdata                    2023d                h04d1e81_0  
uc-micro-py               1.0.1           py311h06a4308_0  
ujson                     5.4.0           py311h6a678d5_0  
unidecode                 1.2.0              pyhd3eb1b0_0  
unixodbc                  2.3.11               h5eee18b_0  
urllib3                   1.26.18         py311h06a4308_0  
utf8proc                  2.6.1                h27cfd23_0  
uvicorn                   0.23.2                   pypi_0    pypi
w3lib                     1.21.0             pyhd3eb1b0_0  
watchdog                  2.1.6           py311h06a4308_0  
wcwidth                   0.2.5              pyhd3eb1b0_0  
webencodings              0.5.1           py311h06a4308_1  
websocket-client          0.58.0          py311h06a4308_4  
websockets                11.0.3                   pypi_0    pypi
werkzeug                  2.2.3           py311h06a4308_0  
whatthepatch              1.0.2           py311h06a4308_0  
wheel                     0.38.4          py311h06a4308_0  
widgetsnbextension        4.0.5           py311h06a4308_0  
wrapt                     1.14.1          py311h5eee18b_0  
wurlitzer                 3.0.2           py311h06a4308_0  
xarray                    2023.6.0        py311h06a4308_0  
xxhash                    0.8.0                h7f8727e_3  
xyzservices               2022.9.0        py311h06a4308_1  
xz                        5.4.5                h5eee18b_0  
yaml                      0.2.5                h7b6447c_0  
yaml-cpp                  0.8.0                h6a678d5_0  
yapf                      0.31.0             pyhd3eb1b0_0  
yarl                      1.9.3           py311h5eee18b_0  
zeromq                    4.3.4                h2531618_0  
zfp                       1.0.0                h6a678d5_0  
zict                      3.0.0           py311h06a4308_0  
zipp                      3.17.0          py311h06a4308_0  
zlib                      1.2.13               h5eee18b_0  
zlib-ng                   2.0.7                h5eee18b_0  
zope                      1.0             py311h06a4308_1  
zope.interface            5.4.0           py311h5eee18b_0  
zstandard                 0.19.0          py311h5eee18b_0  
zstd                      1.5.5                hc292b87_0

LaaZa · 2024-01-17T19:09:45Z

@MarseusFu I'm not sure how you would get the same error, but you need optimum.

cckuailong · 2024-01-19T07:25:50Z

@MarseusFu Try to run pip3 install optimum==1.16.0 to solve it.

MarseusFu · 2024-01-19T07:57:31Z

@cckuailong Still got the AssetionError :(

---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
Cell In[20], line 8
      1 for index, row in tqdm(final_df.iterrows(), total=final_df.shape[0]):
      2     # if index not in simple_prompt_summary:
      3     #     simple_prompt_summary[index] = finalSummary_GPT_simple(row['key_issue'], row['bert_preprocess'])
      4     if index not in custom_prompt_summary:
      5         # custom_prompt_summary[index] = generate_summary_without_additionally(row['key_issue'], row['bert_preprocess'], connectors_string)
      6         # custom_llama_summary[index] = finalSummary_llama(row['key_issue'], row['bert_preprocess'], connectors_string)
      7         # custom_GPT4_summary[index] = generate_summary_without_additionally_gpt4(row['key_issue'], row['bert_preprocess'], connectors_string)
----> 8         custom_Mistral_summary[index] = finalSummary_Mixtral(row['key_issue'], row['bert_preprocess'], connectors_string)
      9         # custom_NB_summary[index] = finalSummary_NeuralBeagle(row['key_issue'], row['bert_preprocess'], connectors_string)
     10         # custom_N_summary[index] = finalSummary_Neural(row['key_issue'], row['bert_preprocess'], connectors_string)

Cell In[17], line 72, in finalSummary_Mixtral(keyIssue, first_summaries, connectors_string)
     70 Mixtral_model_name = "TheBloke/Mixtral-8x7B-v0.1-GPTQ"
     71 # BUG : AssertionError, haven't fix it. 
---> 72 model = AutoModelForCausalLM.from_pretrained(Mixtral_model_name, 
     73                                              # torch_dtype=torch.float32, 
     74                                              device_map='cuda',
     75                                              trust_remote_code=False,
     76                                              revision="main"
     77                                              # local_files_only=False, 
     78                                              # load_in_4bit=True
     79                                             )
     81 tokenizer = AutoTokenizer.from_pretrained(Mixtral_model_name, use_fast=True)
     82 # system_msg = "You are a news summarization AI tasked with summarizing a set of ESG news articles."

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/transformers/models/auto/auto_factory.py:566, in _BaseAutoModelClass.from_pretrained(cls, pretrained_model_name_or_path, *model_args, **kwargs)
    564 elif type(config) in cls._model_mapping.keys():
    565     model_class = _get_model_class(config, cls._model_mapping)
--> 566     return model_class.from_pretrained(
    567         pretrained_model_name_or_path, *model_args, config=config, **hub_kwargs, **kwargs
    568     )
    569 raise ValueError(
    570     f"Unrecognized configuration class {config.__class__} for this kind of AutoModel: {cls.__name__}.\n"
    571     f"Model type should be one of {', '.join(c.__name__ for c in cls._model_mapping.keys())}."
    572 )

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/transformers/modeling_utils.py:3523, in PreTrainedModel.from_pretrained(cls, pretrained_model_name_or_path, config, cache_dir, ignore_mismatched_sizes, force_download, local_files_only, token, revision, use_safetensors, *model_args, **kwargs)
   3517     logger.warning(
   3518         "You are loading your model in 8bit but you did not specify a `torch_dtype` attribute. "
   3519         "All non-linear modules will be loaded in full precision."
   3520         " If you want to load the other modules in other precision, please specify a `torch_dtype` attribute."
   3521     )
   3522 if quantization_method_from_config == QuantizationMethod.GPTQ:
-> 3523     model = quantizer.convert_model(model)
   3524     model._is_quantized_training_enabled = True
   3525 elif quantization_method_from_config == QuantizationMethod.AWQ:

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/optimum/gptq/quantizer.py:174, in GPTQQuantizer.convert_model(self, model)
    172 block_name = self.block_name_to_quantize
    173 layers_to_be_replaced = get_layers(model, prefix=block_name)
--> 174 self._replace_by_quant_layers(model, layers_to_be_replaced)
    176 return model

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/optimum/gptq/quantizer.py:235, in GPTQQuantizer._replace_by_quant_layers(self, module, names, name)
    233         setattr(module, attr, new_layer.to(device))
    234 for name1, child in module.named_children():
--> 235     self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/optimum/gptq/quantizer.py:235, in GPTQQuantizer._replace_by_quant_layers(self, module, names, name)
    233         setattr(module, attr, new_layer.to(device))
    234 for name1, child in module.named_children():
--> 235     self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)

    [... skipping similar frames: GPTQQuantizer._replace_by_quant_layers at line 235 (1 times)]

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/optimum/gptq/quantizer.py:235, in GPTQQuantizer._replace_by_quant_layers(self, module, names, name)
    233         setattr(module, attr, new_layer.to(device))
    234 for name1, child in module.named_children():
--> 235     self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1)

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/optimum/gptq/quantizer.py:227, in GPTQQuantizer._replace_by_quant_layers(self, module, names, name)
    225     out_features = layer.weight.shape[1]
    226 if not (self.desc_act) or self.group_size == -1:
--> 227     new_layer = QuantLinear(
    228         self.bits, self.group_size, in_features, out_features, True, use_cuda_fp16=self.use_cuda_fp16
    229     )
    230 else:
    231     new_layer = QuantLinear(self.bits, self.group_size, in_features, out_features, True)

File ~/anaconda3/envs/jupyter/lib/python3.11/site-packages/auto_gptq/nn_modules/qlinear/qlinear_exllama.py:65, in QuantLinear.__init__(self, bits, group_size, infeatures, outfeatures, bias, trainable, **kwargs)
     63 assert infeatures % 32 == 0
     64 assert infeatures % self.group_size == 0
---> 65 assert outfeatures % 32 == 0
     67 self.register_buffer(
     68     'qweight',
     69     torch.zeros((infeatures // 32 * self.bits, outfeatures), dtype=torch.int32)
     70 )
     71 self.register_buffer(
     72     'qzeros',
     73     torch.zeros((math.ceil(infeatures / self.group_size), outfeatures // 32 * self.bits), dtype=torch.int32)
     74 )

AssertionError:

cckuailong · 2024-01-19T08:02:41Z

@MarseusFu
Step 1: pip3 install -U git+https://github.com/huggingface/transformers.git
Step 2: pip3 install optimum==1.16.0

MarseusFu · 2024-01-19T08:21:59Z

@cckuailong Yes, I installed both of them and still got the AssertionError.

paolovic · 2024-03-02T14:16:17Z

@MarseusFu any solution to the problem?

I'm facing the same issue with
transformers==4.38.2 optimum==1.17.1 auto-gptq==0.8.0.dev0+cu118

transformers and optimum from pip, auto-gptq compiled from source https://github.com/AutoGPTQ/AutoGPTQ

MarseusFu · 2024-03-02T19:13:46Z

@paolovic Sadly, no. I gave up.

qeternity · 2024-03-03T17:33:37Z

This should not be closed. Quantizing Mixtral with AutoGPTQ writes a config out that AutoGPTQ itself is not compatible with.

I can confirm that it does work if you manually add modules_in_block_to_quantize to the config.

LaaZa · 2024-03-03T17:42:50Z

This should not be closed. Quantizing Mixtral with AutoGPTQ writes a config out that AutoGPTQ itself is not compatible with.

I can confirm that it does work if you manually add modules_in_block_to_quantize to the config.

AutoGPTQ does not use that config, It's for transformers and optimum. Modules are defined in the code for AutoGPTQ, not a config. This is likely not an AutoGPTQ issue but issue with optimum.

paolovic · 2024-03-13T14:01:39Z

@MarseusFu

It seems like if you use AutoGPTQ/AutoAWQ directly you can get something working.

model = AutoGPTQForCausalLM.from_quantized(model_path, device="cuda:0")

model = AutoAWQForCausalLM.from_quantized(model_path)

Source:
huggingface/optimum#1742

virentakia added the bug Something isn't working label Dec 16, 2023

virentakia closed this as completed Dec 17, 2023

pseudotensor mentioned this issue Dec 17, 2023

Mixtral in docker h2oai/h2ogpt#1216

Closed

paolovic mentioned this issue Mar 13, 2024

Mixtral-8x7B-Instruct-v0.1-GPTQ AssertionError huggingface/optimum#1742

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AssertionError #486

AssertionError #486

virentakia commented Dec 16, 2023

LaaZa commented Dec 16, 2023

virentakia commented Dec 16, 2023

LaaZa commented Dec 16, 2023

virentakia commented Dec 16, 2023 •

edited

virentakia commented Dec 17, 2023

LaaZa commented Dec 17, 2023

luisfrentzen-cc commented Dec 19, 2023

virentakia commented Dec 19, 2023 •

edited

wac81 commented Dec 21, 2023

MarseusFu commented Jan 17, 2024

LaaZa commented Jan 17, 2024

cckuailong commented Jan 19, 2024 •

edited

MarseusFu commented Jan 19, 2024

cckuailong commented Jan 19, 2024

MarseusFu commented Jan 19, 2024

paolovic commented Mar 2, 2024 •

edited

MarseusFu commented Mar 2, 2024

qeternity commented Mar 3, 2024

LaaZa commented Mar 3, 2024

paolovic commented Mar 13, 2024 •

edited

AssertionError #486

AssertionError #486

Comments

virentakia commented Dec 16, 2023

LaaZa commented Dec 16, 2023

virentakia commented Dec 16, 2023

LaaZa commented Dec 16, 2023

virentakia commented Dec 16, 2023 • edited

virentakia commented Dec 17, 2023

LaaZa commented Dec 17, 2023

luisfrentzen-cc commented Dec 19, 2023

virentakia commented Dec 19, 2023 • edited

wac81 commented Dec 21, 2023

MarseusFu commented Jan 17, 2024

LaaZa commented Jan 17, 2024

cckuailong commented Jan 19, 2024 • edited

MarseusFu commented Jan 19, 2024

cckuailong commented Jan 19, 2024

MarseusFu commented Jan 19, 2024

paolovic commented Mar 2, 2024 • edited

MarseusFu commented Mar 2, 2024

qeternity commented Mar 3, 2024

LaaZa commented Mar 3, 2024

paolovic commented Mar 13, 2024 • edited

virentakia commented Dec 16, 2023 •

edited

virentakia commented Dec 19, 2023 •

edited

cckuailong commented Jan 19, 2024 •

edited

paolovic commented Mar 2, 2024 •

edited

paolovic commented Mar 13, 2024 •

edited