-
Notifications
You must be signed in to change notification settings - Fork 695
Pull requests: xorbitsai/inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ENH: Add 4-sample micro-batching to Qwen-3 reranker to reduce GPU memory
enhancement
New feature or request
gpu
ENH: Added the check for reserved model uid like "instances", "prompts" etc.
enhancement
New feature or request
Previous Next
ProTip!
no:milestone will show everything without a milestone.