Support the Mistral AI official huggingface weights for Mixtral-8x7B-v0.1 #81

lostmygithubaccount · 2023-12-12T20:19:09Z

per the current example:

Download the models from HuggingFace:
git clone https://huggingface.co/someone13574/mixtral-8x7b-32kseqlen

I'd really rather not have to re-download many GBs of weight files, and less-so from someone13574 (no offense) when there weights are posted by Mistral AI itself: https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1

would this work already? could the example be updated to use these weights?

The text was updated successfully, but these errors were encountered:

awni · 2023-12-12T20:21:38Z

Yea it's a good point. Will look into updating the example to use the official weights instead of the ones from MixtralKit

thegodone · 2023-12-13T07:06:02Z

One major issue how to run it with 64 GB memory ? This is a huge model, we need to think of reducing the precision. Is it also possible to think of delta in the weights between the chat (instruction version Mixtral-8x7B-Instruct-v0.1) and the native generator (mixtral-8x7b-32kseqlen). But I don't see an easy way so far to run on M1 64 GB. running the convert give me a terminal killed message

(tf) tgg@gvalmu00008 mixtral % python convert.py --model_path mixtral-8x7b-32kseqlen/
zsh: killed     python convert.py --model_path mixtral-8x7b-32kseqlen/

lostmygithubaccount · 2023-12-13T15:06:24Z

I was able to use the llama.cpp convert script to get it to q4_0 and run in (M2, 96GB of RAM) -- I cannot with f16

seeing very weird output from the model though. will try to look into this more later, looking forward to learning/playing around w/ mlx on some of these models

caseybasichis · 2023-12-13T17:17:49Z

I'm able to get the someone13574 running.

I attempted the cat/convert process on the official instruct model, but no dice.

Is there a way to modify the convert script to get the instruct version going?

awni · 2023-12-13T17:36:18Z

This one right? https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1

I'll test it after getting the official Mistral one working

caseybasichis · 2023-12-14T03:45:04Z

That's the one.

awni · 2023-12-14T23:33:55Z

#107

awni added the enhancement New feature or request label Dec 12, 2023

awni self-assigned this Dec 12, 2023

dastrobu mentioned this issue Dec 13, 2023

use mistral ai official download link #89

Closed

awni mentioned this issue Dec 13, 2023

Mixtral conversion to nzp not working on M3 max 128gb #92

Closed

This was referenced Dec 14, 2023

mlx-examples/lora /convert.py isn't properly handling HF pytorch model format #104

Closed

Use official HF for mixtral #107

Merged

lostmygithubaccount closed this as completed Dec 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support the Mistral AI official huggingface weights for Mixtral-8x7B-v0.1 #81

Support the Mistral AI official huggingface weights for Mixtral-8x7B-v0.1 #81

lostmygithubaccount commented Dec 12, 2023

awni commented Dec 12, 2023

thegodone commented Dec 13, 2023 •

edited

lostmygithubaccount commented Dec 13, 2023

caseybasichis commented Dec 13, 2023

awni commented Dec 13, 2023

caseybasichis commented Dec 14, 2023

awni commented Dec 14, 2023

Support the Mistral AI official huggingface weights for Mixtral-8x7B-v0.1 #81

Support the Mistral AI official huggingface weights for Mixtral-8x7B-v0.1 #81

Comments

lostmygithubaccount commented Dec 12, 2023

awni commented Dec 12, 2023

thegodone commented Dec 13, 2023 • edited

lostmygithubaccount commented Dec 13, 2023

caseybasichis commented Dec 13, 2023

awni commented Dec 13, 2023

caseybasichis commented Dec 14, 2023

awni commented Dec 14, 2023

thegodone commented Dec 13, 2023 •

edited