[Mistral] Mistral-7B-v0.1 support #1196

Bam4d · 2023-09-27T14:53:00Z

No description provided.

vllm/model_executor/layers/attention.py

WoosukKwon · 2023-09-27T16:48:41Z

vllm/model_executor/models/mistral.py

+
+import torch
+from torch import nn
+from transformers import MistralConfig


This does not work because MistralConfig is not a regular model in HF transformers at the moment (v4.33.3). Could you define this config class just like this?

@timlacroix Besides this, it seems everything works fine!

ok addressed. Will we need to change this back after the next release ?

@timlacroix Yes. Once a new version of HF transformers is released, we will remove it.

vllm/model_executor/layers/attention.py

casper-hansen · 2023-09-27T20:46:54Z

The Mistral model is almost equivalent to llama in terms of quantizing the model, it would be super easy to extend support as I have already added Mistral in AutoAWQ. If you can modify this part below, you will enable AWQ quantized models:

_MODEL_CLASSES_SUPPORT_QUANTIZATION = [
    LlamaForCausalLM,
    MistralForCausalLM,
]

After that, you should be able to run inference with the quantized model that is already available: https://huggingface.co/casperhansen/mistral-7b-instruct-v0.1-awq

from vllm import LLM, SamplingParams

prompts = [
    "The future of AI is",
]
sampling_params = SamplingParams(temperature=0.8, top_p=0.95)

llm = LLM(model="casperhansen/mistral-7b-instruct-v0.1-awq", quantization="awq", dtype="half")

outputs = llm.generate(prompts, sampling_params)

# Print the outputs.
for output in outputs:
    prompt = output.prompt
    generated_text = output.outputs[0].text
    print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")

vllm/engine/llm_engine.py

WoosukKwon

LGTM. As this PR is not modifiable, I will fix some miscellaneous issues right after merging this PR.

Co-authored-by: timlacroix <t@mistral.ai>

Bam4d added 2 commits September 27, 2023 14:40

[Mistral] Mistral-7B-v0.1 support

63b108b

xformers with sliding window changes

f8b4f81

WoosukKwon reviewed Sep 27, 2023

View reviewed changes

vllm/model_executor/layers/attention.py Outdated Show resolved Hide resolved

guard make_local_attention

f970551

WoosukKwon reviewed Sep 27, 2023

View reviewed changes

vllm/model_executor/layers/attention.py Outdated Show resolved Hide resolved

causal

9c7a259

WoosukKwon mentioned this pull request Sep 27, 2023

[v0.2.0] Release Tracker #1089

Closed

5 tasks

WoosukKwon mentioned this pull request Sep 27, 2023

Support for Mistral 7B #1199

Closed

WoosukKwon linked an issue Sep 27, 2023 that may be closed by this pull request

Support for Mistral 7B #1199

Closed

WoosukKwon mentioned this pull request Sep 28, 2023

Bump up the version to v0.2.0 #1212

Merged

WoosukKwon reviewed Sep 28, 2023

View reviewed changes

vllm/engine/llm_engine.py Outdated Show resolved Hide resolved

timlacroix added 3 commits September 28, 2023 09:43

requested changes

609c686

typo

bcfd18b

format

46eed41

WoosukKwon approved these changes Sep 28, 2023

View reviewed changes

WoosukKwon mentioned this pull request Sep 28, 2023

Fix Mistral model #1220

Merged

WoosukKwon merged commit bb1ba58 into vllm-project:main Sep 28, 2023
2 checks passed

sh1ng mentioned this pull request Nov 8, 2023

Could you support Attention Sink? #1304

Open

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

[Mistral] Mistral-7B-v0.1 support (vllm-project#1196)

e1ef4f4

Co-authored-by: timlacroix <t@mistral.ai>

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

[Mistral] Mistral-7B-v0.1 support (vllm-project#1196)

24c3124

Co-authored-by: timlacroix <t@mistral.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Mistral] Mistral-7B-v0.1 support #1196

[Mistral] Mistral-7B-v0.1 support #1196

Bam4d commented Sep 27, 2023

WoosukKwon Sep 27, 2023

WoosukKwon Sep 27, 2023

timlacroix Sep 28, 2023

WoosukKwon Sep 28, 2023

casper-hansen commented Sep 27, 2023 •

edited

Loading

WoosukKwon left a comment

[Mistral] Mistral-7B-v0.1 support #1196

[Mistral] Mistral-7B-v0.1 support #1196

Conversation

Bam4d commented Sep 27, 2023

WoosukKwon Sep 27, 2023

Choose a reason for hiding this comment

WoosukKwon Sep 27, 2023

Choose a reason for hiding this comment

timlacroix Sep 28, 2023

Choose a reason for hiding this comment

WoosukKwon Sep 28, 2023

Choose a reason for hiding this comment

casper-hansen commented Sep 27, 2023 • edited Loading

WoosukKwon left a comment

Choose a reason for hiding this comment

casper-hansen commented Sep 27, 2023 •

edited

Loading