[Cherry-Pick] adjust max_tokens and min_tokens when continue to generate tokens (#5010) by kxz2002 · Pull Request #5015 · PaddlePaddle/FastDeploy

kxz2002 · 2025-11-14T02:35:42Z

…okens (#5010)

fix max and min tokens initial commit
fix double subtraction
add unit tests

Motivation

To support model continuous generation, the calculation of max_tokens and min_tokens must be considered.

Modifications

Added logic to compute max_tokens and min_tokens in engine_client.py and ernie4_5_vl_processor.py.

Usage or Command

No change in manual command.

Accuracy Tests

No need.

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

…okens (PaddlePaddle#5010) * fix max and min tokens initial commit * fix double subtraction * add unit tests

paddle-bot · 2025-11-14T02:35:48Z

Thanks for your contribution!

LiqinruiG

LGTM

…_max_and_min

…okens (PaddlePaddle#5010) (PaddlePaddle#5015) * fix max and min tokens initial commit * fix double subtraction * add unit tests Co-authored-by: gaoziyuan <88373061+gzy19990617@users.noreply.github.com>

[BugFix] adjust max_tokens and min_tokens when continue to generate t…

5bdee87

…okens (PaddlePaddle#5010) * fix max and min tokens initial commit * fix double subtraction * add unit tests

paddle-bot bot added the contributor External developers label Nov 14, 2025

LiqinruiG approved these changes Nov 14, 2025

View reviewed changes

Merge branch 'feature/experimental_feature_20250908' into cp_0908_fix…

926ae1e

…_max_and_min

Jiang-Jia-Jun self-requested a review November 14, 2025 09:50

gzy19990617 merged commit 936a809 into PaddlePaddle:feature/experimental_feature_20250908 Nov 14, 2025
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Cherry-Pick] adjust max_tokens and min_tokens when continue to generate tokens (#5010)#5015

[Cherry-Pick] adjust max_tokens and min_tokens when continue to generate tokens (#5010)#5015
gzy19990617 merged 2 commits intoPaddlePaddle:feature/experimental_feature_20250908from
kxz2002:cp_0908_fix_max_and_min

kxz2002 commented Nov 14, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Nov 14, 2025

Uh oh!

LiqinruiG left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kxz2002 commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Nov 14, 2025

Uh oh!

LiqinruiG left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kxz2002 commented Nov 14, 2025 •

edited

Loading