Refactor loading weights #1603

grimoire · 2024-05-16T09:24:51Z

optimization tp model loading.

requirement

Optimize moe #1520

lmdeploy/pytorch/weight_loader/model_weight_loader.py

RunningLeon · 2024-06-04T09:43:24Z

@zhulinJulia24 hi, could you start a full-scope test of all pytorch engine models using daily_test CI? Thanks.

lvhan028 · 2024-06-11T06:19:58Z

lmdeploy/pytorch/models/chatglm2.py

+                                         rank=rank,
+                                         world_size=world_size,
+                                         prefix='query_key_value')
+        rowwise_parallelize_linear(self.dense,


much better than previous version

lvhan028 · 2024-06-11T06:28:09Z

lmdeploy/pytorch/weight_loader/model_weight_loader.py

+logger = get_logger('lmdeploy')
+
+
+def _get_weight_type(model_path: str, use_safetensors: bool = None):


use_safetensors can be {True, False, None}. Why not True or False?

Align with transformers

https://github.com/huggingface/transformers/blob/dcdda5324bcc7a750b5e40e11dd795442204ff27/src/transformers/modeling_utils.py#L2813

lvhan028 · 2024-06-11T06:35:31Z

lmdeploy/pytorch/weight_loader/model_weight_loader.py

+        for name, param in mod.named_parameters(recurse=False):
+            dtype = param.dtype
+            if not loader.has(name):
+                logger.debug(f'rank [{rank}]'


How to invoke this condition?

Some model might shared weight of token embedding, they do not safe redundant weight in checkpoint.

lmdeploy/pytorch/weight_loader/model_weight_loader.py

lvhan028 · 2024-06-11T06:43:29Z

lmdeploy/pytorch/models/functional.py

@@ -160,204 +157,3 @@ def sync_qparam_to_context(context: Any, layer_id: str, qparams: dict):
        context.set_output(layer_id, last_qparam)
    else:
        context.set_output(layer_id, qparams)
-
-
-@torch.no_grad()


Is it used before?

Almost never.

lmdeploy/pytorch/weight_loader/dist_utils.py

RunningLeon

LGTM

grimoire mentioned this pull request May 21, 2024

Torch deepseek v2 #1621

Open

2 tasks

grimoire added 3 commits May 22, 2024 16:58

first

7bac22d

all model down

77034bf

remove device mesh

d7138f1

grimoire force-pushed the refactor-load-weights branch from 53fe863 to d7138f1 Compare May 22, 2024 09:08

grimoire marked this pull request as draft May 22, 2024 09:09

fix triton==2.2.0

fd3f84f

grimoire marked this pull request as ready for review May 22, 2024 09:31

lvhan028 requested a review from RunningLeon June 4, 2024 06:50

merge main

08199b3

RunningLeon reviewed Jun 4, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/model_weight_loader.py Outdated Show resolved Hide resolved

update weight type log

235e684

lvhan028 added the improvement label Jun 5, 2024

lvhan028 reviewed Jun 11, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/model_weight_loader.py Outdated Show resolved Hide resolved

lvhan028 reviewed Jun 11, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/dist_utils.py Show resolved Hide resolved

lvhan028 reviewed Jun 11, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/dist_utils.py Outdated Show resolved Hide resolved

fix comment

de73b38

lvhan028 approved these changes Jun 11, 2024

View reviewed changes

RunningLeon approved these changes Jun 12, 2024

View reviewed changes

lvhan028 changed the title ~~Refactor load weights~~ Refactor loading weights Jun 12, 2024

lvhan028 merged commit 679572c into InternLM:main Jun 12, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor loading weights #1603

Refactor loading weights #1603

grimoire commented May 16, 2024 •

edited

RunningLeon commented Jun 4, 2024

lvhan028 Jun 11, 2024

lvhan028 Jun 11, 2024

grimoire Jun 11, 2024

lvhan028 Jun 11, 2024

grimoire Jun 11, 2024

lvhan028 Jun 11, 2024

grimoire Jun 11, 2024

RunningLeon left a comment

		logger = get_logger('lmdeploy')


		def _get_weight_type(model_path: str, use_safetensors: bool = None):

Refactor loading weights #1603

Refactor loading weights #1603

Conversation

grimoire commented May 16, 2024 • edited

requirement

RunningLeon commented Jun 4, 2024

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

grimoire Jun 11, 2024

Choose a reason for hiding this comment

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

grimoire Jun 11, 2024

Choose a reason for hiding this comment

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

grimoire Jun 11, 2024

Choose a reason for hiding this comment

RunningLeon left a comment

Choose a reason for hiding this comment

grimoire commented May 16, 2024 •

edited