feat: add repetition_penalty and top_k to openai #288

huytuong010101 · 2024-02-28T15:34:16Z

Fixes #287.

tgaddair

Thanks for this PR! Just a couple suggestions to better align with OpenAI's format.

tgaddair · 2024-02-28T16:52:29Z

router/src/lib.rs

@@ -455,6 +455,8 @@ struct ChatCompletionRequest {
    // Additional parameters
    // TODO(travis): add other LoRAX params here
    response_format: Option<ResponseFormat>,
+    repetition_penalty: Option<f32>,


Looks like the OpenAI spec defines this as presence_penalty: https://platform.openai.com/docs/api-reference/chat/create

Since there is no top_k equivalent in OpenAI, adding it as a new param is fine.

tgaddair · 2024-02-28T16:53:53Z

router/src/lib.rs

@@ -587,8 +591,8 @@ impl From<CompletionRequest> for CompatGenerateRequest {
                api_token: None,
                best_of: req.best_of.map(|x| x as usize),
                temperature: req.temperature,
-                repetition_penalty: None,
-                top_k: None,
+                repetition_penalty: req.repetition_penalty,


If we map repetition_penalty to the OpenAI presence_penalty, we need to shift the range from (-2, 2) (OpenAI) to (0, 4) ours, so something like:

repetition_penalty: req.presence_penalty.map(|x| x + 2.0)

Same for lin 629.

Hi @tgaddair As I known, presence_penalty and repetition_penalty both have the same effect. But

presence_penalty - Between -2.0 and 2. value 0 means no penalty. Default to 0.0

repetition_penalty – Between 1.0 and infinity. 1.0 means no penalty. Default to 1.0.
So it not have same range, and presence_penalty=0 (no penalty) not have same effect with repetition_penalty=2 (much penalty)
Do you think we need keep it seprately or any better method to shift it ?

I recommend keep both params presence_penalty and repetition_penalty.
Or if you still want to reuse presence_penalty, I think repetition_penalty: req.presence_penalty.map(|x| x + 1.0) is better.

Hmm, maybe it's fine to keep both for now. We can think about how best to map presence_penalty to repition_penalty in a follow-up. Thanks for the PR!

@tgaddair thank u

feat: add repetition_penalty and top_k to openai

db966ae

huytuong010101 mentioned this pull request Feb 28, 2024

Support repetition_penalty in OpenAI API #287

Closed

tgaddair reviewed Feb 28, 2024

View reviewed changes

tgaddair approved these changes Feb 29, 2024

View reviewed changes

tgaddair merged commit f915df7 into predibase:main Feb 29, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add repetition_penalty and top_k to openai #288

feat: add repetition_penalty and top_k to openai #288

huytuong010101 commented Feb 28, 2024 •

edited by tgaddair

Loading

tgaddair left a comment

tgaddair Feb 28, 2024

tgaddair Feb 28, 2024

huytuong010101 Feb 29, 2024 •

edited

Loading

prd-tuong-nguyen Feb 29, 2024

tgaddair Feb 29, 2024

prd-tuong-nguyen Mar 1, 2024

feat: add repetition_penalty and top_k to openai #288

feat: add repetition_penalty and top_k to openai #288

Conversation

huytuong010101 commented Feb 28, 2024 • edited by tgaddair Loading

tgaddair left a comment

Choose a reason for hiding this comment

tgaddair Feb 28, 2024

Choose a reason for hiding this comment

tgaddair Feb 28, 2024

Choose a reason for hiding this comment

huytuong010101 Feb 29, 2024 • edited Loading

Choose a reason for hiding this comment

prd-tuong-nguyen Feb 29, 2024

Choose a reason for hiding this comment

tgaddair Feb 29, 2024

Choose a reason for hiding this comment

prd-tuong-nguyen Mar 1, 2024

Choose a reason for hiding this comment

huytuong010101 commented Feb 28, 2024 •

edited by tgaddair

Loading

huytuong010101 Feb 29, 2024 •

edited

Loading