Add the utility to truncate by tokens #6746

aibrahim-oai · 2025-11-16T23:24:25Z

This PR is to make it on path for truncating by tokens. This path will be initially used by unified exec and context manager (responsible for MCP calls mainly).
We are exposing new config calls_output_max_tokens
Use tokens as the main budget unit but truncate based on the model family by Introducing TruncationPolicy.
Introduce truncate_text as a router for truncation based on the mode.

In next PRs:

remove truncate_with_line_bytes_budget
Add the ability to the model to override the token budget.

…anic_double_truncation

aibrahim-oai · 2025-11-17T21:45:39Z

@codex review this

chatgpt-codex-connector

💡 Codex Review

codex/codex-rs/core/src/truncate.rs

Lines 114 to 117 in d1d0644

    
           if omitted_text_items > 0 { 
        
               out.push(FunctionCallOutputContentItem::InputText { 
        
                   text: format!("[omitted {omitted_text_items} text items ...]"), 
        
               });

Respect token budget for tool content_items

The new token budget helper for structured tool output is still able to exceed the caller-provided limit. In truncate_function_output_items_to_token_limit, once we run out of remaining_tokens we increment omitted_text_items, but after the loop we always append the [omitted …] summary string (lines 114-117). That summary itself consumes several tokens yet we add it even when remaining_tokens is zero, so the returned content_items can exceed the configured cap. A concrete regression: if a user sets context_manager_function_output_max_tokens = 0, every text item hits the remaining_tokens == 0 branch, and we still return a single [omitted …] span—so the history now stores tokens even though the limit was zero. The same happens whenever the budget is exhausted exactly; the summary pushes us over. Please gate the summary on having remaining budget (or drop it when the budget is zero) so the function truly enforces the requested token cap.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

codex-rs/utils/tokenizer/src/lib.rs

codex-rs/utils/cache/src/lib.rs

codex-rs/core/tests/suite/user_shell_cmd.rs

codex-rs/core/tests/suite/unified_exec.rs

codex-rs/core/tests/suite/truncation.rs

codex-rs/core/src/truncate.rs

aibrahim-oai · 2025-11-18T18:31:14Z

@codex review this

pakrym-oai · 2025-11-18T18:36:23Z

codex-rs/core/src/context_manager/history_tests.rs


 #[test]
 fn record_items_truncates_function_call_output_content() {
+    let model = "gpt-5.1-codex";


can we remove all dynamic model lookups in tests that don't care about a specific mdoel?

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-11-18T18:37:57Z

codex-rs/core/src/truncate.rs

+            TruncationPolicy::Bytes(family_bytes) => {
+                if let Some(token_limit) = config_token_limit {
+                    Self::Bytes(approx_bytes_for_tokens(token_limit))
+                } else {
+                    Self::Bytes(approx_bytes_for_tokens(family_bytes))
+                }


Stop inflating byte truncation budgets

When we hit the Bytes branch we already have a byte budget coming from model_family.truncation_policy — e.g. the default family for GPT‑5.1 sets TruncationPolicy::Bytes(10_000) (see codex-rs/core/src/model_family.rs:95,192), which was carried over from the previous MODEL_FORMAT_MAX_BYTES = 10 * 1024 limit. Running that value through approx_bytes_for_tokens multiplies it by four (~40 KiB), so byte-based families will now keep ~4× more tool output in history before truncating. That’s a regression: by default GPT‑5.1 will no longer truncate until ~40 KiB, re-introducing the context bloat this change was meant to prevent. We should just return Self::Bytes(family_bytes) in the None case.

Useful? React with 👍 / 👎.

aibrahim-oai · 2025-11-18T19:01:36Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

codex-rs/core/src/codex.rs

aibrahim-oai · 2025-11-18T19:14:27Z

@codex review this

aibrahim-oai added 9 commits November 14, 2025 23:10

tighten

0208094

tighten

e91a3b8

tighten_panic_double_truncation

fbe5fcf

tighten_panic_double_truncation

dbb25e9

Merge branch 'main' of https://github.com/openai/codex into tighten_p…

d264c11

…anic_double_truncation

truncate

63596d1

change function names

b811a9b

cleanup

d599cf2

progress

16369ee

This was referenced Nov 17, 2025

Increase truncation limits and add codex-denno installer #6763

Closed

Fix byte truncation to preserve command output tails #6476

Closed

aibrahim-oai added 6 commits November 17, 2025 11:08

tests

5805ab0

tests

7812ef5

tests

c9bc844

tests

f1522ba

lint

da16813

lint

d1d0644

aibrahim-oai changed the title ~~Truncate by tokens~~ Add the utility to truncate by tokens Nov 17, 2025

chatgpt-codex-connector bot reviewed Nov 17, 2025

View reviewed changes

aibrahim-oai added 10 commits November 17, 2025 14:36

remove line

7f9637d

remove line

2835118

router

64bb960

router

bbfa97e

router

4b58b60

router

5db71f6

tests

d3c94a3

tests

fcc981f

avoid approx with tests

a8cdae9

avoid approx with tests

0312d3b

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/utils/tokenizer/src/lib.rs Show resolved Hide resolved

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/utils/cache/src/lib.rs Outdated Show resolved Hide resolved

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/tests/suite/user_shell_cmd.rs Outdated Show resolved Hide resolved

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/tests/suite/unified_exec.rs Outdated Show resolved Hide resolved

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/tests/suite/truncation.rs Outdated Show resolved Hide resolved

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/tests/suite/truncation.rs Outdated Show resolved Hide resolved

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/src/truncate.rs Outdated Show resolved Hide resolved

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/src/truncate.rs Show resolved Hide resolved

Merge branch 'main' into truncate-by-tokens

5dbf7d3

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/src/truncate.rs Outdated Show resolved Hide resolved

aibrahim-oai added 5 commits November 18, 2025 10:09

tests

b244de2

tests

63df57d

tests

91741d6

tests

3027a59

test

e5c77dd

pakrym-oai approved these changes Nov 18, 2025

View reviewed changes

name

f7a5f69

pakrym-oai reviewed Nov 18, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Nov 18, 2025

View reviewed changes

aibrahim-oai added 2 commits November 18, 2025 10:55

name

c380dae

fix

5944058

fix

e6af809

chatgpt-codex-connector bot reviewed Nov 18, 2025

View reviewed changes

codex-rs/core/src/codex.rs Outdated Show resolved Hide resolved

fix

7c3afa4

aibrahim-oai merged commit 3de8790 into main Nov 18, 2025
25 checks passed

aibrahim-oai deleted the truncate-by-tokens branch November 18, 2025 19:36

github-actions bot locked and limited conversation to collaborators Nov 18, 2025

	if omitted_text_items > 0 {
	out.push(FunctionCallOutputContentItem::InputText {
	text: format!("[omitted {omitted_text_items} text items ...]"),
	});

Add the utility to truncate by tokens #6746

Add the utility to truncate by tokens #6746

Uh oh!

Conversation

aibrahim-oai commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aibrahim-oai commented Nov 17, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aibrahim-oai commented Nov 18, 2025

Uh oh!

pakrym-oai Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

aibrahim-oai commented Nov 18, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

aibrahim-oai commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aibrahim-oai commented Nov 16, 2025 •

edited

Loading