Server: Handle n_keep parameter in the request by jkarthic · Pull Request #6174 · ggml-org/llama.cpp

jkarthic · 2024-03-20T09:59:19Z

No description provided.

phymbert · 2024-03-20T10:13:46Z

    llama_params["repeat_last_n"]     = json_value(body,   "repeat_last_n",     default_sparams.penalty_last_n);
    llama_params["ignore_eos"]        = json_value(body,   "ignore_eos",        false);
    llama_params["tfs_z"]             = json_value(body,   "tfs_z",             default_sparams.tfs_z);
+    llama_params["n_keep"]            = json_value(body,   "n_keep",            0);


Hello, thanks but @ggerganov @ngxson I worry this is actually not OAI compatible ?

we can consider it as an "extension" to OAI, for example tfs_z or mirostat that we're having, they are not available on OAI.

In fact this code is duplicated to the one inside launch_slot_with_task, I planned to refactor all of OAI-related logic to one place, maybe I'll do this during weekend.

ngxson

LGTM. It's quite surprise to know that server does not have --n-keep argument, maybe we need to add that in the future.

Server: Handle n_keep parameter in the request

3e67baa

phymbert reviewed Mar 20, 2024

View reviewed changes

ngxson approved these changes Mar 20, 2024

View reviewed changes

phymbert approved these changes Mar 20, 2024

View reviewed changes

phymbert merged commit 47cc7a7 into ggml-org:master Mar 20, 2024

jkarthic deleted the server_n_keep branch March 20, 2024 13:26

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 3, 2024

Server: Handle n_keep parameter in the request (ggml-org#6174)

786af84

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

Server: Handle n_keep parameter in the request (ggml-org#6174)

c14bd80

phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026

Server: Handle n_keep parameter in the request (ggml-org#6174)

1d1248f

ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026

Server: Handle n_keep parameter in the request (ggml-org#6174)

bb4fdbe

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026

Server: Handle n_keep parameter in the request (ggml-org#6174)

c8b0966

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026

Server: Handle n_keep parameter in the request (ggml-org#6174)

4600f51

AlexiAlp pushed a commit to minghaop/llama.cpp that referenced this pull request Jun 2, 2026

Server: Handle n_keep parameter in the request (ggml-org#6174)

22d28ec

AlexiAlp pushed a commit to minghaop/llama.cpp that referenced this pull request Jun 2, 2026

Server: Handle n_keep parameter in the request (ggml-org#6174)

989ce55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Server: Handle n_keep parameter in the request#6174

Server: Handle n_keep parameter in the request#6174
phymbert merged 1 commit into
ggml-org:masterfrom
get-wrecked:server_n_keep

jkarthic commented Mar 20, 2024

Uh oh!

phymbert Mar 20, 2024

Uh oh!

ngxson Mar 20, 2024

Uh oh!

ngxson Mar 20, 2024 •

edited

Loading

Uh oh!

ngxson left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jkarthic commented Mar 20, 2024

Uh oh!

phymbert Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

ngxson Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

ngxson Mar 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ngxson Mar 20, 2024 •

edited

Loading