[Fix] Avoid AsyncEngine running the same session id #1219

AllentDan · 2024-02-29T10:12:43Z

No matter how the client uses session_id, keep the sever working.

import os
from random import randint
import sys
from tqdm import tqdm

from concurrent.futures import ThreadPoolExecutor

from lmdeploy.serve.openai.api_client import APIClient

questions = ['你是谁'] * 1000

num_parallel = 256

def process_one(question, url='0.0.0.0', port='23333'):
    client = APIClient('http://{}:{}'.format(url, port))
    model_name = client.available_models[0]

    msg = [dict(role='user', content=question)]

    data = client.chat_interactive_v1(msg, session_id=randint(1, 100), repetition_penalty=1.02)
    for item in data:
        pass

    data = client.chat_completions_v1(model=model_name, messages=msg, repetition_penalty=1.02)

    for item in data:
        response = item
    
    return response

with ThreadPoolExecutor(max_workers=num_parallel) as executor:
    for response in tqdm(executor.map(process_one, questions)):
        print(response)

Conflicts: lmdeploy/serve/openai/api_server.py

lmdeploy/serve/openai/api_client.py

lmdeploy/serve/async_engine.py

AllentDan added 4 commits February 28, 2024 18:19

Use distinct session id instead of random

f4f29fd

skip session_id for openai endpoint and wait when session_id is running

88495c0

Merge branch 'main' into distinct-session-id

a19fcd9

Conflicts: lmdeploy/serve/openai/api_server.py

add comments

2aa5158

lvhan028 requested a review from irexyc March 1, 2024 04:44

lvhan028 reviewed Mar 1, 2024

View reviewed changes

lmdeploy/serve/openai/api_client.py Outdated Show resolved Hide resolved

lvhan028 reviewed Mar 1, 2024

View reviewed changes

lmdeploy/serve/openai/api_client.py Outdated Show resolved Hide resolved

lvhan028 reviewed Mar 1, 2024

View reviewed changes

lmdeploy/serve/async_engine.py Show resolved Hide resolved

AllentDan added 2 commits March 1, 2024 15:33

recover session_id in client

6b7e769

docstring

292c4b0

lvhan028 approved these changes Mar 1, 2024

View reviewed changes

irexyc approved these changes Mar 1, 2024

View reviewed changes

lvhan028 merged commit c1b135d into InternLM:main Mar 1, 2024
3 of 4 checks passed

lvhan028 added the Bug:P1 label Mar 1, 2024

AllentDan mentioned this pull request Mar 1, 2024

Async torch engine #1206

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Avoid AsyncEngine running the same session id #1219

[Fix] Avoid AsyncEngine running the same session id #1219

AllentDan commented Feb 29, 2024 •

edited

[Fix] Avoid AsyncEngine running the same session id #1219

[Fix] Avoid AsyncEngine running the same session id #1219

Conversation

AllentDan commented Feb 29, 2024 • edited

AllentDan commented Feb 29, 2024 •

edited