Skip to content

increase embedding batch size from 50 to 500#86

Merged
samcm merged 1 commit intomasterfrom
bump-embed-batch-size
Mar 18, 2026
Merged

increase embedding batch size from 50 to 500#86
samcm merged 1 commit intomasterfrom
bump-embed-batch-size

Conversation

@samcm
Copy link
Copy Markdown
Member

@samcm samcm commented Mar 18, 2026

Bumps the client-side embedding batch size to match the proxy's 500-item hard cap. The proxy already sub-batches to the upstream API at 100 internally, so this just cuts the number of round-trips on cold starts from ~400 to ~40.

The proxy already accepts up to 500 items per request and sub-batches
to the upstream API at 100 internally. Sending larger batches from the
server reduces proxy round-trips from ~400 to ~40 on a cold EIP embed.
@samcm samcm merged commit df87d31 into master Mar 18, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant