remove owned_session #1097

grimoire · 2024-02-01T09:49:36Z

session is no longer bind to EngineInstance.

Tested on chat and profile_pytorch_benchmark with llama-7b

AllentDan

I've tested OK for both pytorch and turbomind concurrency. But there is a VRAM increase during benchmarking restful API for pytorch backend.

AllentDan · 2024-02-02T03:29:42Z

Besides, the performance gap between benchmark/profile_torch_throughput.py and benchmark/profile_restful_api.py is far larger than turbomind backend.

AllentDan · 2024-02-02T04:13:32Z

I found for api_server, if I make requests on 64 concurrency, the actual concurrency for the pytorch backend is only 10+. Do you have any clue about it?

grimoire · 2024-02-02T11:29:08Z

queue get will block cpu on coroutine. I have update an version with async recv. in 6b6dcae

But since all tokenizer are processed on the same cpu with the engine, it still can not reach the performance of profile throughput.

AllentDan

LGTM

Conflicts: lmdeploy/serve/async_engine.py lmdeploy/tokenizer.py

remove owned_session

bf1b997

grimoire added the improvement label Feb 1, 2024

grimoire requested a review from AllentDan February 1, 2024 09:51

remove pytorch branch of binding session and instance from AsyncEngine

f3146e0

AllentDan reviewed Feb 2, 2024

View reviewed changes

async recv

6b6dcae

AllentDan approved these changes Feb 4, 2024

View reviewed changes

Merge branch 'main' into remove-owned-session

df633de

Conflicts: lmdeploy/serve/async_engine.py lmdeploy/tokenizer.py

lvhan028 approved these changes Feb 4, 2024

View reviewed changes

lvhan028 merged commit c80a1ed into InternLM:main Feb 4, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove owned_session #1097

remove owned_session #1097

grimoire commented Feb 1, 2024 •

edited

Loading

AllentDan left a comment

AllentDan commented Feb 2, 2024

AllentDan commented Feb 2, 2024

grimoire commented Feb 2, 2024

AllentDan left a comment

remove owned_session #1097

remove owned_session #1097

Conversation

grimoire commented Feb 1, 2024 • edited Loading

AllentDan left a comment

Choose a reason for hiding this comment

AllentDan commented Feb 2, 2024

AllentDan commented Feb 2, 2024

grimoire commented Feb 2, 2024

AllentDan left a comment

Choose a reason for hiding this comment

grimoire commented Feb 1, 2024 •

edited

Loading