OpenAI refactoring #2360

FlorianJoncour · 2024-01-06T01:35:16Z

This is a reset of #2210.

The final goal is to implement function calls using the OpenAI API.
But since it was likely too much all at once, we will do it in two parts.

This pull request is only a refactoring/relocation of code to separate the Uvicorn server, the chat, and the completions.
The chat and completions are now in separate classes.
The goal is to make the entire codebase clearer and more easily modifiable in the future, as the completion should now be considered legacy.

The chat part has been divided into several methods, while the completion remained largely unchanged except for being encapsulated within a class.

Tested chat and completions with and without stream mode.

NikolaBorisov

I think this looks ok.

simon-mo

two minor points

simon-mo · 2024-01-08T23:02:13Z

vllm/entrypoints/openai/api_server.py

-        return StreamingResponse(fake_stream_generator(),
+    generator = await openai_serving_completion.create_completion(
+        request, raw_request)
+    logger.info("TYPE COMPLETION : %s" % str(type(generator)))


Suggested change

logger.info("TYPE COMPLETION : %s" % str(type(generator)))

simon-mo · 2024-01-12T18:00:08Z

vllm/entrypoints/openai/serving_engine.py

+            engine_model_config.tokenizer,
+            tokenizer_mode=engine_model_config.tokenizer_mode,
+            trust_remote_code=engine_model_config.trust_remote_code)
+        self._load_chat_template(self.chat_template)


chat template is the responsibility of ChatCompletion only

simon-mo · 2024-01-12T18:48:18Z

will fix and merge this once #2355 is in.

FlorianJoncour · 2024-01-12T23:24:40Z

Fine.

I still made the changes ^^

…ur/main

simon-mo · 2024-01-17T05:33:50Z

@FlorianJoncour, merged! Thank you for the contribution, looking forward to the tool calling PR!

jessiewiswjc · 2024-01-17T09:39:00Z

@FlorianJoncour Is there a new pr of function_call?

FlorianJoncour · 2024-01-17T10:14:42Z

I work on it, it shouldn't be too long

introduced with vllm-project#2360 was not here before

FlorianJoncour added 2 commits January 6, 2024 02:18

OpenAI API refactoring

09fa856

format

fab0edf

FlorianJoncour mentioned this pull request Jan 6, 2024

OpenAI API refactoring + Functions calling #2210

Closed

FlorianJoncour and others added 3 commits January 6, 2024 02:43

forget to uncomment something

8adb6da

Minor typo

2b2d271

Manage running event loop if any

b60cd23

simon-mo self-assigned this Jan 8, 2024

NikolaBorisov approved these changes Jan 10, 2024

View reviewed changes

simon-mo approved these changes Jan 12, 2024

View reviewed changes

Fixes as requested

519aa3d

viktor-ferenczi mentioned this pull request Jan 13, 2024

Add Grammars #2105

Closed

11 tasks

simon-mo added 9 commits January 16, 2024 06:17

Merge branch 'main' of github.com:vllm-project/vllm into FlorianJonco…

bfae6d6

…ur/main

wip-fixes, move response role to chat only

afc6231

add test cases, fix chat streaming bug

051c550

reset debugging clause

d8cb6c5

fix chat template

29e7d43

format

b5ebd47

Add test case to build pipeline

ff80aad

add test dependencies

b3cc26f

use python3

69c283f

simon-mo merged commit 14cc317 into vllm-project:main Jan 17, 2024
15 checks passed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Jan 18, 2024

OpenAI Server refactoring (vllm-project#2360)

11c1efb

joennlae added a commit to joennlae/vllm that referenced this pull request Jan 21, 2024

fix: openai api server correct error code propagation

f05f8e6

introduced with vllm-project#2360 was not here before

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

OpenAI Server refactoring (vllm-project#2360)

c88318b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI refactoring #2360

OpenAI refactoring #2360

FlorianJoncour commented Jan 6, 2024

NikolaBorisov left a comment

simon-mo left a comment

simon-mo Jan 8, 2024

simon-mo Jan 12, 2024

simon-mo commented Jan 12, 2024

FlorianJoncour commented Jan 12, 2024

simon-mo commented Jan 17, 2024

jessiewiswjc commented Jan 17, 2024

FlorianJoncour commented Jan 17, 2024

OpenAI refactoring #2360

OpenAI refactoring #2360

Conversation

FlorianJoncour commented Jan 6, 2024

NikolaBorisov left a comment

Choose a reason for hiding this comment

simon-mo left a comment

Choose a reason for hiding this comment

simon-mo Jan 8, 2024

Choose a reason for hiding this comment

simon-mo Jan 12, 2024

Choose a reason for hiding this comment

simon-mo commented Jan 12, 2024

FlorianJoncour commented Jan 12, 2024

simon-mo commented Jan 17, 2024

jessiewiswjc commented Jan 17, 2024

FlorianJoncour commented Jan 17, 2024