KeyError when loading Mistral 7b via Transformers #713

lachlancahill · 2024-03-22T06:21:25Z

The bug
attempting to load the model yields the following output:

Loading checkpoint shards: 100%|██████████| 3/3 [00:06<00:00,  2.09s/it]
Traceback (most recent call last):
  File "C:\Users\lachl\PycharmProjects\llm_feedback_analytics\guidance_error.py", line 8, in <module>
    mistral = models.Transformers(model_id, echo=False, device_map=device, torch_dtype=torch.bfloat16)
  File "C:\Users\lachl\PycharmProjects\llm_feedback_analytics\.venv\lib\site-packages\guidance\models\transformers\_transformers.py", line 196, in __init__
    TransformersEngine(model, tokenizer, compute_log_probs, **kwargs),
  File "C:\Users\lachl\PycharmProjects\llm_feedback_analytics\.venv\lib\site-packages\guidance\models\transformers\_transformers.py", line 101, in __init__
    TransformersTokenizer(model, tokenizer),
  File "C:\Users\lachl\PycharmProjects\llm_feedback_analytics\.venv\lib\site-packages\guidance\models\transformers\_transformers.py", line 36, in __init__
    reconstructed += bytes([byte_decoder[c] for c in t.convert_ids_to_tokens(id)])
  File "C:\Users\lachl\PycharmProjects\llm_feedback_analytics\.venv\lib\site-packages\guidance\models\transformers\_transformers.py", line 36, in <listcomp>
    reconstructed += bytes([byte_decoder[c] for c in t.convert_ids_to_tokens(id)])
KeyError: '▁'

To Reproduce

import torch
from guidance import models

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

model_id = "mistralai/Mistral-7B-Instruct-v0.2"

mistral = models.Transformers(model_id, echo=False, device_map=device, torch_dtype=torch.bfloat16)

System info (please complete the following information):

OS: Windows 11
Guidance Version (guidance.__version__): 0.1.13

The text was updated successfully, but these errors were encountered:

riedgar-ms · 2024-03-22T12:25:41Z

I can reproduce this. @slundberg is this another aspect of the unicode error you pushed a fix for earlier this week?

riedgar-ms · 2024-03-22T16:31:39Z

I believe that this is at least part of what is going wrong in #716

JosephGatto · 2024-03-28T15:08:58Z

Any update on this? Also getting same error. On Google Colab.

riedgar-ms · 2024-03-28T15:18:45Z

I've not heard from @slundberg who knows most about this. I've just started prodding it again

riedgar-ms · 2024-03-28T17:17:09Z

As a work around @JosephGatto and @lachlancahill , it does appear that if you load mistral via llama-cpp (there is code for this in #716 ), it works.

Since our notebooks feature Mistral, add an example to our test matrix. The Mistral model itself is loaded via llama-cpp. However: - Due to #713, have to skip loading Mistral via `transformers` - To avoid running out of disk space on the GitHub runner machines, we have to narrow the testing in `test_transformers.py`

MikoAL · 2024-03-31T08:29:20Z

I get this issue no matter what model I'm using, even if it was not based on mistral, any fixes?

MikoAL · 2024-03-31T09:06:35Z

After a bit of testing, I switched to the commit where the version number was 0.1.12, the same error happened, switched to the commit where the version number was 0.1.11, and it started working again?

Fix #713 for mistral models in transformers

talglobus · 2024-04-07T03:10:56Z

Running into this issue as well, on a model finetuned from Mistral, loaded directly from pytorch

lachlancahill · 2024-04-12T10:25:53Z

This issue has not been resolved I'm afraid. I have force-reinstalled from git and still get the same error.

amirabdullah19852020 · 2024-04-18T04:07:04Z

Yes, this should still be open. I'm running into this error too on latest version.

akshaymg99 · 2024-04-22T07:11:53Z

Running into the same error, with latest version

riedgar-ms mentioned this issue Mar 28, 2024

[Test] Add Mistral to test matrix #716

Merged

MikoAL mentioned this issue Mar 31, 2024

Guidance's gen function unreasonably slow compared to the oobabooga/text-generation-webui. #727

Closed

slundberg added a commit that referenced this issue Apr 2, 2024

Fix #713 for mistal models in transformers

033229a

slundberg mentioned this issue Apr 2, 2024

Fix #713 for mistral models in transformers #740

Merged

slundberg closed this as completed in #740 Apr 2, 2024

slundberg added a commit that referenced this issue Apr 2, 2024

Merge pull request #740 from guidance-ai/fix_mistral_transformers

8f5b3bd

Fix #713 for mistral models in transformers

riedgar-ms mentioned this issue Apr 2, 2024

[Test] Enable transformers_mistral_7b #741

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError when loading Mistral 7b via Transformers #713

KeyError when loading Mistral 7b via Transformers #713

lachlancahill commented Mar 22, 2024

riedgar-ms commented Mar 22, 2024

riedgar-ms commented Mar 22, 2024

JosephGatto commented Mar 28, 2024

riedgar-ms commented Mar 28, 2024

riedgar-ms commented Mar 28, 2024 •

edited

MikoAL commented Mar 31, 2024

MikoAL commented Mar 31, 2024

talglobus commented Apr 7, 2024

lachlancahill commented Apr 12, 2024 •

edited

amirabdullah19852020 commented Apr 18, 2024

akshaymg99 commented Apr 22, 2024

KeyError when loading Mistral 7b via Transformers #713

KeyError when loading Mistral 7b via Transformers #713

Comments

lachlancahill commented Mar 22, 2024

riedgar-ms commented Mar 22, 2024

riedgar-ms commented Mar 22, 2024

JosephGatto commented Mar 28, 2024

riedgar-ms commented Mar 28, 2024

riedgar-ms commented Mar 28, 2024 • edited

MikoAL commented Mar 31, 2024

MikoAL commented Mar 31, 2024

talglobus commented Apr 7, 2024

lachlancahill commented Apr 12, 2024 • edited

amirabdullah19852020 commented Apr 18, 2024

akshaymg99 commented Apr 22, 2024

riedgar-ms commented Mar 28, 2024 •

edited

lachlancahill commented Apr 12, 2024 •

edited