Fix tokenization of <|constrain|> content type in rendering by dzhulgakov · Pull Request #47 · openai/harmony

dzhulgakov · 2025-08-07T19:49:09Z

The recommended way to render a constrained JSON clause in Harmony is .with_content_type("<|constrain|>json"). Unfortunately, the current code treats it as user input and doesn't tokenize into special tokens.

Thus, the previous tool calls get rendered wrong into the token space by encoding.render_conversation:

From rendering =======
' <' '|' 'con' 'strain' '|' '>' 'json' '<|message|>' '{}' '<|call|>'

After re-encoding or as produced by model =======
' ' '<|constrain|>' 'json' '<|message|>' '{}' '<|call|>'

This confuses the model and leads to invalid syntax for subsequent tool calling, especially on 20B model. See #27 (comment) for more context

If one renders text and encodes it back with special tokens allowed, the output is fixed. However, it opens an opportunity for prompt injection attacks as user input may now contain special tokens.

Instead, this PR handles the <|constrain|> special case explicitly. Alternative would be to add explicit API like .with_content_type("json", contrain=True) but it feels like a bigger change.

Minimal repro

Script

from openai_harmony import load_harmony_encoding, Conversation, Message, Role

encoding = load_harmony_encoding("HarmonyGptOss")

tool_call_message = (
    Message.from_role_and_content(Role.ASSISTANT, "{}")
    .with_recipient("functions.dummy")
    .with_channel("commentary")
    .with_content_type("<|constrain|>json")
)
conversation = Conversation.from_messages([tool_call_message])
tokens = encoding.render_conversation(conversation=conversation)
text = encoding.decode_utf8(tokens)
reencoded_tokens = encoding.encode(text, allowed_special="all")
text_reencoded = encoding.decode_utf8(reencoded_tokens)

print(f"{tokens == reencoded_tokens=}")
print(f"{text == text_reencoded=}")

print("text =======")
print(text)
print()
print("reencoded text =======")
print(text_reencoded)
print()

print("tokens =======")
print(" ".join([repr(encoding.decode([token])) for token in tokens]))
print()
print("reencoded tokens =======")
print(" ".join([repr(encoding.decode([token])) for token in reencoded_tokens]))
print()

output before this PR

tokens == reencoded_tokens=False
text == text_reencoded=True
text =======
<|start|>assistant to=functions.dummy<|channel|>commentary <|constrain|>json<|message|>{}<|call|>

reencoded text =======
<|start|>assistant to=functions.dummy<|channel|>commentary <|constrain|>json<|message|>{}<|call|>

tokens =======
'<|start|>' 'assistant' ' to' '=' 'functions' '.d' 'ummy' '<|channel|>' 'comment' 'ary' ' <' '|' 'con' 'strain' '|' '>' 'json' '<|message|>' '{}' '<|call|>'

reencoded tokens =======
'<|start|>' 'assistant' ' to' '=' 'functions' '.d' 'ummy' '<|channel|>' 'comment' 'ary' ' ' '<|constrain|>' 'json' '<|message|>' '{}' '<|call|>'

output after this PR

tokens == reencoded_tokens=True
text == text_reencoded=True
text =======
<|start|>assistant to=functions.dummy<|channel|>commentary <|constrain|>json<|message|>{}<|call|>

reencoded text =======
<|start|>assistant to=functions.dummy<|channel|>commentary <|constrain|>json<|message|>{}<|call|>

tokens =======
'<|start|>' 'assistant' ' to' '=' 'functions' '.d' 'ummy' '<|channel|>' 'comment' 'ary' ' ' '<|constrain|>' 'json' '<|message|>' '{}' '<|call|>'

reencoded tokens =======
'<|start|>' 'assistant' ' to' '=' 'functions' '.d' 'ummy' '<|channel|>' 'comment' 'ary' ' ' '<|constrain|>' 'json' '<|message|>' '{}' '<|call|>'

)" This reverts commit 8a4645f.

Fix tokenization of <|constrain|> content type in rendering

7285caf

dkundel-openai approved these changes Aug 9, 2025

View reviewed changes

dkundel-openai merged commit 8a4645f into openai:main Aug 9, 2025

dkundel-openai added a commit that referenced this pull request Aug 9, 2025

Revert "Fix tokenization of <|constrain|> content type in rendering (#47

488c417

)" This reverts commit 8a4645f.

chris-monardo mentioned this pull request Aug 9, 2025

render_conversation does not properly encode .with_content_type('<|constrain|>json') #53

Open

This was referenced Aug 21, 2025

Mismatch between model output and harmony tool call render #27

Open

[Bug]: For GPT OSS 120B: Expected 2 output messages (reasoning and final), but got 7. vllm-project/vllm#22403

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tokenization of <|constrain|> content type in rendering#47

Fix tokenization of <|constrain|> content type in rendering#47
dkundel-openai merged 1 commit intoopenai:mainfrom
dzhulgakov:fix-contrain-tokenization

dzhulgakov commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants