-
Notifications
You must be signed in to change notification settings - Fork 34.2k
Closed
Labels
feature-requestRequest for new features or functionalityRequest for new features or functionalitypanel-chatverification-neededVerification of issue is requestedVerification of issue is requestedverifiedVerification succeededVerification succeeded
Milestone
Description
As proposed by Logan: "if we think 64k is too much we should come up with some heuristics as to what to reserve for output. Maybe output should be at max 20% input?"
15%-20% are good numbers.
For example, sonnet 4 has 200k, 16k = 12.5%
Gemini 2.5pro has 1M input, 64K output = 15%
Thus my recommendation is that we cap max_output at 15% of input.
fyi @roblourens
Metadata
Metadata
Assignees
Labels
feature-requestRequest for new features or functionalityRequest for new features or functionalitypanel-chatverification-neededVerification of issue is requestedVerification of issue is requestedverifiedVerification succeededVerification succeeded