reduce torchengine prefill mem usage #1240

grimoire · 2024-03-04T13:46:02Z

No description provided.

lmdeploy/pytorch/engine/engine.py

AllentDan

LGTM

lvhan028 · 2024-03-05T03:44:33Z

lmdeploy/pytorch/engine/engine.py

@@ -673,13 +673,13 @@ async def __long_context_forward(inputs):
                if token_count == 0 and slen > max_prefill_token_num:
                    tmp_out = await __long_context_single_forward(inputs, idx)
                    logits_gather.gather(tmp_out)
-                    del tmp_out
+                    tmp_out.pop('logits', None)


del doesn't work?

reduce mem usage

5983223

grimoire added the Bug:P2 label Mar 4, 2024

lvhan028 reviewed Mar 4, 2024

View reviewed changes

lmdeploy/pytorch/engine/engine.py Outdated Show resolved Hide resolved

remove pdb

8476f5b

lvhan028 approved these changes Mar 5, 2024

View reviewed changes

lvhan028 requested a review from AllentDan March 5, 2024 03:12

del to pop

9f1ce63

AllentDan approved these changes Mar 5, 2024

View reviewed changes

lvhan028 reviewed Mar 5, 2024

View reviewed changes

lvhan028 merged commit 4bec832 into InternLM:main Mar 5, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce torchengine prefill mem usage #1240

reduce torchengine prefill mem usage #1240

grimoire commented Mar 4, 2024

AllentDan left a comment

lvhan028 Mar 5, 2024

reduce torchengine prefill mem usage #1240

reduce torchengine prefill mem usage #1240

Conversation

grimoire commented Mar 4, 2024

AllentDan left a comment

Choose a reason for hiding this comment

lvhan028 Mar 5, 2024

Choose a reason for hiding this comment