-
Notifications
You must be signed in to change notification settings - Fork 272
fix:make reasoning effort configurable #21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix:make reasoning effort configurable #21
Conversation
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
|
@OneZero-Y can you run |
The merge-base changed after approval.
fd7623d to
7b4eb2c
Compare
|
test failed |
Signed-off-by: OneZero-Y <aukovyps@163.com>
Signed-off-by: OneZero-Y <aukovyps@163.com>
Signed-off-by: OneZero-Y <aukovyps@163.com>
8203687 to
2bdfc9d
Compare
|
The config is currently manual, will have an automated process (to be listed in roadmap). Thanks for contributing! |
What type of PR is this?
Fix - Resolves hardcoded values and improves configurability
What this PR does / why we need it:
This PR makes reasoning effort levels configurable instead of hardcoded, addressing a TODO comment in the codebase and improving system flexibility.
Changes:
Configuration Examples:
Which issue(s) this PR fixes:
Fixes hardcoded reasoning effort levels (addresses TODO in reason_mode_selector.go:94)
Release Notes: Yes/No