Use new event loop instead of the current loop for pipeline #1352

AllentDan · 2024-03-27T06:49:59Z

For some jupyter notebook users, the loop can not be used directly.

grimoire · 2024-03-27T10:43:21Z

lmdeploy/serve/async_engine.py

@@ -476,8 +475,8 @@ async def gather():
                ])
                outputs.put(None)

-            proc = Thread(
-                target=lambda: self.loop.run_until_complete(gather()))
+            proc = Thread(target=lambda: asyncio.new_event_loop().


Why do you use thread instead of coroutine here?
asyncio.get_event_loop().create_task would give create a new coroutine task parallelized with the current coroutine. and asyncio.Queue is awaitable.

Why do you use thread instead of coroutine here? asyncio.get_event_loop().create_task would give create a new coroutine task parallelized with the current coroutine. and asyncio.Queue is awaitable.

Just want to yield items without async. Do you have demo codes?

Thread -> asyncio.get_event_loop().create_task
Queue -> asyncio.queue

https://github.com/grimoire/lmdeploy/blob/d75c757f1712ffb1a0958758e1e0d4b18743ca8d/lmdeploy/pytorch/engine/engine.py#L718

How can I return a generator for stream infer?

lmdeploy/lmdeploy/serve/async_engine.py

Lines 484 to 488 in 3d355b5

try:

out = outputs.get(timeout=0.001)

if out is None:

break

yield out

try: out = await outputs.get() if out is None: break yield out

try: out = await outputs.get() if out is None: break yield out

but this is not an async function.

lmdeploy/lmdeploy/pytorch/engine/engine.py

Line 1103 in 3d355b5

def __call_async():

To avoid the performance drop, I used the original Thread and Queue.

RunningLeon

LGTM

lvhan028 · 2024-04-03T07:10:13Z

@AllentDan what's the performance of llama-7b now?

AllentDan · 2024-04-03T07:56:38Z

@AllentDan what's the performance of llama-7b now?

It does not influence the api_server performance. As for the stream_infer function. I tested 1000 prompts and got 131.17s for the main branch and 165.23s for the PR.

AllentDan · 2024-04-03T08:03:44Z

@AllentDan what's the performance of llama-7b now?

It does not influence the api_server performance. As for the stream_infer function. I tested 1000 prompts and got 131.17s for the main branch and 165.23s for the PR.

Tested the batch_infer function, no performance drop.

AllentDan · 2024-04-03T08:19:34Z

@AllentDan what's the performance of llama-7b now?

After recovering the original implementation, performance remained the same.

lvhan028 · 2024-04-03T08:30:38Z

@grimoire According to @AllentDan test result, coroutine slows down the performance.
Any idea about the reason?

grimoire · 2024-04-03T08:57:17Z

rollback

Use new event loop instead of the current loop for pipeline

361379a

AllentDan added Bug:P2 improvement labels Mar 27, 2024

lvhan028 requested review from RunningLeon and grimoire March 27, 2024 10:12

grimoire reviewed Mar 27, 2024

View reviewed changes

resolve comments

a16a0b2

grimoire approved these changes Apr 2, 2024

View reviewed changes

RunningLeon approved these changes Apr 3, 2024

View reviewed changes

original implementation

ca5ca2d

lvhan028 merged commit 93a09af into InternLM:main Apr 5, 2024
5 checks passed

AllentDan mentioned this pull request Apr 7, 2024

[Bug] asyncio: This event loop is already running #1399

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use new event loop instead of the current loop for pipeline #1352

Use new event loop instead of the current loop for pipeline #1352

AllentDan commented Mar 27, 2024

grimoire Mar 27, 2024

AllentDan Mar 27, 2024

grimoire Mar 28, 2024

grimoire Mar 28, 2024

AllentDan Apr 2, 2024

grimoire Apr 2, 2024

AllentDan Apr 2, 2024 •

edited

grimoire Apr 2, 2024

AllentDan Apr 3, 2024

RunningLeon left a comment

lvhan028 commented Apr 3, 2024

AllentDan commented Apr 3, 2024

AllentDan commented Apr 3, 2024

AllentDan commented Apr 3, 2024

lvhan028 commented Apr 3, 2024

grimoire commented Apr 3, 2024

	try:
	out = outputs.get(timeout=0.001)
	if out is None:
	break
	yield out

Use new event loop instead of the current loop for pipeline #1352

Use new event loop instead of the current loop for pipeline #1352

Conversation

AllentDan commented Mar 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AllentDan Apr 2, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RunningLeon left a comment

Choose a reason for hiding this comment

lvhan028 commented Apr 3, 2024

AllentDan commented Apr 3, 2024

AllentDan commented Apr 3, 2024

AllentDan commented Apr 3, 2024

lvhan028 commented Apr 3, 2024

grimoire commented Apr 3, 2024

AllentDan Apr 2, 2024 •

edited