[fix] lm-sys/FastChat/issues/2295 #2328

vaxilicaihouxian · 2023-08-28T02:29:41Z

Why are these changes needed?

First,I use python -m fastchat.serve.cli --model-path /my/mac/path/llm_models/chatglm2-6b --load-8bit --device mps
to run chatglm2-6b.But I got this error: Trying to convert BFloat16 to the MPS backend but it does not have support for that dtype.

Related issue number (if applicable)

#2295

Checks

I've run format.sh to lint the changes in this PR.
I've included any doc changes needed.
I've made sure the relevant tests are passing (if applicable).

merrymercy · 2023-08-28T10:03:32Z

It seems your modification is not related to the picture you posted in your issue.
In your issue, the picture shows "int32 vs. int64". However, in your code, you changed float and half.
Moreover, in the two if branches you added. You added ".half()" in one "if" branch and one "else" branch. Why is this?

vaxilicaihouxian · 2023-08-28T11:18:36Z

It seems your modification is not related to the picture you posted in your issue. In your issue, the picture shows "int32 vs. int64". However, in your code, you changed float and half. Moreover, in the two if branches you added. You added ".half()" in one "if" branch and one "else" branch. Why is this?

On mac os (usually device:mps) did not support bfloat. Maybe the reason is not correct.But it works with python -m fastchat.serve.cli --model-path /my/mac/path/llm_models/chatglm2-6b --load-8bit --device mps on my macbook(m2).
If I didn't change these two lines it will show the error Trying to convert BFloat16 to the MPS backend but it does not have support for that dtype..
BTW,I'm not very good at this.Just an experience.:)

merrymercy · 2023-08-28T11:39:59Z

In the two if branches you added. You added ".half()" in one "if" branch and one "else" branch. Why is this?

vaxilicaihouxian · 2023-08-28T15:03:18Z

In the two if branches you added. You added ".half()" in one "if" branch and one "else" branch. Why is this?

Oh,that's my mistake.I add half() to these two if under mps device on my local code base.Sorry about it.Now,I fix it.

merrymercy · 2023-08-28T15:39:58Z

fastchat/model/compression.py

@@ -167,12 +167,18 @@ def load_compress_model(model_path, device, torch_dtype, use_fast, revision="mai
        tmp_state_dict = torch.load(filename, map_location=lambda storage, loc: storage)
        for name in tmp_state_dict:
            if name in linear_weights:
-                tensor = tmp_state_dict[name].to(device).data.to(torch_dtype)


Could you try something like this instead of if/else?
tensor = tmp_state_dict[name].to(torch_dtype).to(device).data

If it works, apply the same change to L178.

I just fix it by inline to and remove mps condition.It works for chatglm2-6b:

to check device type (mps).just use dtype argument. It works for chatglm2-6b raw model from huggingface.

vaxilicaihouxian added 2 commits August 28, 2023 02:27

[fix] lm-sys/issues/2295

a9d2775

fix code style

3cc6305

vaxilicaihouxian mentioned this pull request Aug 28, 2023

Make chatglm2-6b load8bit work on Mac m2 with mps(fix bfloatxx error) #2295

Open

fix wrong half() block under mps condition

94cc3d4

merrymercy requested changes Aug 28, 2023

View reviewed changes

vaxilicaihouxian added 2 commits August 29, 2023 03:11

inline to without if/else.PS: there is no need

f63d920

to check device type (mps).just use dtype argument. It works for chatglm2-6b raw model from huggingface.

fix code style

2b516ad

merrymercy merged commit 42be87e into lm-sys:main Aug 29, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] lm-sys/FastChat/issues/2295 #2328

[fix] lm-sys/FastChat/issues/2295 #2328

vaxilicaihouxian commented Aug 28, 2023

merrymercy commented Aug 28, 2023

vaxilicaihouxian commented Aug 28, 2023

merrymercy commented Aug 28, 2023

vaxilicaihouxian commented Aug 28, 2023 •

edited

merrymercy Aug 28, 2023 •

edited

vaxilicaihouxian Aug 29, 2023

[fix] lm-sys/FastChat/issues/2295 #2328

[fix] lm-sys/FastChat/issues/2295 #2328

Conversation

vaxilicaihouxian commented Aug 28, 2023

Why are these changes needed?

Related issue number (if applicable)

Checks

merrymercy commented Aug 28, 2023

vaxilicaihouxian commented Aug 28, 2023

merrymercy commented Aug 28, 2023

vaxilicaihouxian commented Aug 28, 2023 • edited

merrymercy Aug 28, 2023 • edited

Choose a reason for hiding this comment

vaxilicaihouxian Aug 29, 2023

Choose a reason for hiding this comment

vaxilicaihouxian commented Aug 28, 2023 •

edited

merrymercy Aug 28, 2023 •

edited