[BugFix] adjust max_tokens and min_tokens when continue to generate tokens by kxz2002 · Pull Request #5010 · PaddlePaddle/FastDeploy

kxz2002 · 2025-11-13T12:15:53Z

Motivation

To support model continuous generation, the calculation of max_tokens and min_tokens must be considered.

Modifications

Added logic to compute max_tokens and min_tokens in engine_client.py and ernie4_5_vl_processor.py.

Usage or Command

No change in manual command.

Accuracy Tests

No need.

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

paddle-bot · 2025-11-13T12:16:03Z

Thanks for your contribution!

LiqinruiG

LGTM

…okens (PaddlePaddle#5010) * fix max and min tokens initial commit * fix double subtraction * add unit tests

…okens (#5010) (#5013) * fix max and min tokens initial commit * fix double subtraction * add unit tests

…okens (#5010) (#5015) * fix max and min tokens initial commit * fix double subtraction * add unit tests Co-authored-by: gaoziyuan <88373061+gzy19990617@users.noreply.github.com>

…okens (PaddlePaddle#5010) (PaddlePaddle#5015) * fix max and min tokens initial commit * fix double subtraction * add unit tests Co-authored-by: gaoziyuan <88373061+gzy19990617@users.noreply.github.com>

kxz2002 added 3 commits November 13, 2025 15:05

fix max and min tokens initial commit

1608044

fix double subtraction

2ccfa43

add unit tests

2412d3d

paddle-bot bot added the contributor External developers label Nov 13, 2025

LiqinruiG approved these changes Nov 13, 2025

View reviewed changes

LiqinruiG merged commit 9703108 into PaddlePaddle:develop Nov 13, 2025
22 of 24 checks passed

kxz2002 added a commit to kxz2002/FastDeploy that referenced this pull request Nov 14, 2025

[BugFix] adjust max_tokens and min_tokens when continue to generate t…

e775ac7

…okens (PaddlePaddle#5010) * fix max and min tokens initial commit * fix double subtraction * add unit tests

kxz2002 mentioned this pull request Nov 14, 2025

[Cherry-Pick] adjust max_tokens and min_tokens when continue to generate tokens (#5010) #5013

Merged

5 tasks

kxz2002 added a commit to kxz2002/FastDeploy that referenced this pull request Nov 14, 2025

[BugFix] adjust max_tokens and min_tokens when continue to generate t…

162e28a

…okens (PaddlePaddle#5010) * fix max and min tokens initial commit * fix double subtraction * add unit tests

kxz2002 mentioned this pull request Nov 14, 2025

[Cherry-Pick] adjust max_tokens and min_tokens when continue to generate tokens (#5010) #5014

Closed

5 tasks

kxz2002 added a commit to kxz2002/FastDeploy that referenced this pull request Nov 14, 2025

[BugFix] adjust max_tokens and min_tokens when continue to generate t…

5bdee87

…okens (PaddlePaddle#5010) * fix max and min tokens initial commit * fix double subtraction * add unit tests

kxz2002 mentioned this pull request Nov 14, 2025

[Cherry-Pick] adjust max_tokens and min_tokens when continue to generate tokens (#5010) #5015

Merged

5 tasks

Jiang-Jia-Jun pushed a commit that referenced this pull request Nov 14, 2025

[BugFix] adjust max_tokens and min_tokens when continue to generate t…

e92783e

…okens (#5010) (#5013) * fix max and min tokens initial commit * fix double subtraction * add unit tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] adjust max_tokens and min_tokens when continue to generate tokens#5010

[BugFix] adjust max_tokens and min_tokens when continue to generate tokens#5010
LiqinruiG merged 3 commits intoPaddlePaddle:developfrom
kxz2002:bugfix/fix_max_and_min

kxz2002 commented Nov 13, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Nov 13, 2025

Uh oh!

LiqinruiG left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kxz2002 commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Nov 13, 2025

Uh oh!

LiqinruiG left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kxz2002 commented Nov 13, 2025 •

edited

Loading