Check the max prompt length for the OpenAI completions API #472

nicobasile · 2023-07-15T19:55:47Z

Hi, I noticed that the create_chat_completion() function in the openai entrypoint utilizes the check_length(prompt) function, which verifies that the prompt token length doesn't exceed the model's max token length. If it does exceed the max length, it returns an HTTP error explaining so.

However this check_length(prompt) function isn't present in the create_completion() function, so a request to this endpoint which exceeded the max token length wouldn't get a relevant error message.

I added check_length(prompt) to create_completion(), and also modified the function to return the (already computed) token_ids, so that we can re-use them later for the engine.generate(), thus reducing the amount of duplicate token encoding we are doing - in theory this will result in a minor efficiency gain, but I haven't done any scientific tests to prove so.

Thanks!

zhuohan123

LGTM! Thank you for your contribution!

…ect#472)

nicobasile added 4 commits July 15, 2023 12:48

Added check_length to OpenAI Completion

52facc3

Fix 80 char limit + whitespace for PyLint

9e1099e

Pylint/Yapf formatting round 2

0a40562

Final minor formatting change to pass yapf

5573c7d

zhuohan123 mentioned this pull request Jul 25, 2023

[Fix] Add model sequence length into model config #575

Merged

zhuohan123 added 2 commits August 9, 2023 00:32

Merge branch 'main' into nicobasile/main

e202311

fix merge error

afdf4da

zhuohan123 approved these changes Aug 9, 2023

View reviewed changes

zhuohan123 merged commit 66c54aa into vllm-project:main Aug 9, 2023

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Check the max prompt length for the OpenAI completions API (vllm-proj…

077b2a4

…ect#472)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Check the max prompt length for the OpenAI completions API #472

Check the max prompt length for the OpenAI completions API #472

Uh oh!

nicobasile commented Jul 15, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

Uh oh!

Uh oh!

Check the max prompt length for the OpenAI completions API #472

Check the max prompt length for the OpenAI completions API #472

Uh oh!

Conversation

nicobasile commented Jul 15, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!