lm_head.weight
missing from convert_mistral_weights_to_hf.STATE_DICT_MAPPING
#36908
Open
2 of 4 tasks
Labels
System Info
transformers
version: 4.49.0Who can help?
@younesbelkada @Cyrilvallez
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
I am working with https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501, and seem to be unable to pass it into
convert_mistral_weights_to_hf.py
(keep reading).Here's the
model.safetensors.index.json
:Then running:
Expected behavior
The
lm_head.weight
to work withconvert_mistral_weights_to_hf.py
The text was updated successfully, but these errors were encountered: