fix(embed): truncate inputs over 8192-token limit by RomneyDa · Pull Request #4 · openclaw/gitcrawl

RomneyDa · 2026-04-30T09:30:58Z

Summary

Cap embedding input at 24,576 runes (~3 chars/token floor × 8192) so threads with oversized bodies stop blowing the entire batch.
Repro added: unit test on embeddingTextForBasis and e2e test with a fake OpenAI server that returns the real 400 error.

Closes #2. Per-model dynamic detection tracked in #3.

Test plan

go test ./... (full suite green)
Both new tests fail pre-fix with the production error message, pass post-fix

Threads with bodies past OpenAI's embedding cap (e.g. openclaw/openclaw#27137, 10k tokens) failed the entire batch. Cap embedding input to 24576 runes so 3 chars/token-floor content stays under 8192 tokens. Closes openclaw#2. Per-model dynamic detection tracked in openclaw#3.

RomneyDa · 2026-04-30T09:31:19Z

@vincentkoc fixed!

RomneyDa closed this Apr 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(embed): truncate inputs over 8192-token limit#4

fix(embed): truncate inputs over 8192-token limit#4
RomneyDa wants to merge 1 commit into
openclaw:mainfrom
RomneyDa:fix/embed-truncate-oversized-input

RomneyDa commented Apr 30, 2026

Uh oh!

RomneyDa commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

RomneyDa commented Apr 30, 2026

Summary

Test plan

Uh oh!

RomneyDa commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant