fix biachuan-7b tp #598

Sanster · 2023-07-27T08:10:25Z

The main modifications are in the "load_weights" function.

Before:

After:

LiVincent-Zhang · 2023-07-27T08:58:22Z

Is the same reason for baichuan-13b? #530

Sanster · 2023-07-27T09:07:14Z

Is the same reason for baichuan-13b? #530

Yes. I have tested it on both baichuan13b and 7b, and it can output normal output under tp.

LiVincent-Zhang · 2023-07-27T09:21:16Z

Is the same reason for baichuan-13b? #530

Yes. I have tested it on both baichuan13b and 7b, and it can output normal output under tp.

Can I use this PR directly on 13B？

zhuohan123

Thank you for your contribution! Can you use our official formatting script and remove other additional format changes?

zhuohan123 · 2023-07-30T03:58:19Z

vllm/model_executor/models/baichuan.py

+            if "embed_tokens" in name or "lm_head" in name:
+                # Consider padding in the vocab size.
+                param = state_dict[name]
+                padded_vocab_size = param.shape[0] * tp_world_size
+                num_extra_rows = padded_vocab_size - self.config.vocab_size
+                extra_rows = torch.empty(num_extra_rows, loaded_weight.shape[1])
+                extra_rows = extra_rows.to(loaded_weight)
+                loaded_weight = torch.cat([loaded_weight, extra_rows], dim=0)
+
+            if "W_pack" in name:
+                # W_pack.weight.shape [3*hidden_size, hidden_size] [3*4096, 4096] = [12,288, 4096]
+                total_num_heads = self.config.num_attention_heads
+                hidden_size = self.config.hidden_size
+                head_size = hidden_size // total_num_heads
+                num_heads = total_num_heads // tp_world_size
+                head_start = tp_rank * num_heads
+                head_end = (tp_rank + 1) * num_heads
+
+                loaded_weight = loaded_weight.view(
+                    3, total_num_heads, head_size, hidden_size
+                )
+                loaded_weight = loaded_weight[:, head_start:head_end, :, :]
+                loaded_weight = loaded_weight.reshape(-1, hidden_size)
+


Is this part the only part that actually changes the code logic? Can you remove other format-only modifications and use format.sh script provided by us to re-format the code? Thanks!

Hi, I have already modified the content of the PR and removed the invalid format part.

zhuohan123

LGTM! Thank you for your contribution!

Co-authored-by: wq.chu <wq.chu@tianrang-inc.com>

This was referenced Jul 27, 2023

ModuleNotFoundError: No module named 'transformers_modules' with API serving using baichuan-7b #572

Closed

Assistance Needed: Issues with Distributed Deployment in Baichuan-13b-Chat Server Implementation #513

Closed

zhuohan123 requested changes Jul 30, 2023

View reviewed changes

fix biachuan-7b tp

aeb2d9e

Sanster force-pushed the fix_baichuan_7b_tp branch from 356793c to aeb2d9e Compare August 1, 2023 06:25

zhuohan123 approved these changes Aug 1, 2023

View reviewed changes

zhuohan123 merged commit d4c7755 into vllm-project:main Aug 1, 2023
2 checks passed

This was referenced Aug 7, 2023

Unable to run baichuan13b on 2 GPUs #566

Closed

百川baichuan-chat-13B 多卡推理 #593

Closed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

fix biachuan-7b tp (vllm-project#598)

842576f

Co-authored-by: wq.chu <wq.chu@tianrang-inc.com>

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

fix biachuan-7b tp (vllm-project#598)

5c227cc

Co-authored-by: wq.chu <wq.chu@tianrang-inc.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix biachuan-7b tp #598

fix biachuan-7b tp #598

Sanster commented Jul 27, 2023

LiVincent-Zhang commented Jul 27, 2023

Sanster commented Jul 27, 2023

LiVincent-Zhang commented Jul 27, 2023

zhuohan123 left a comment

zhuohan123 Jul 30, 2023

Sanster Aug 1, 2023

zhuohan123 left a comment

fix biachuan-7b tp #598

fix biachuan-7b tp #598

Conversation

Sanster commented Jul 27, 2023

LiVincent-Zhang commented Jul 27, 2023

Sanster commented Jul 27, 2023

LiVincent-Zhang commented Jul 27, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

zhuohan123 Jul 30, 2023

Choose a reason for hiding this comment

Sanster Aug 1, 2023

Choose a reason for hiding this comment

zhuohan123 left a comment

Choose a reason for hiding this comment