feat(server): add /v1/messages/count_tokens endpoint by audreyt · Pull Request #90 · antirez/ds4

audreyt · 2026-05-12T11:13:34Z

Anthropic's count_tokens API takes the same request shape as /v1/messages but only returns the prompt token count without running inference. This short-circuits before enqueueing a job: parse_anthropic_request renders and tokenizes the prompt the same way it would for a real generation, then we serialize {"input_tokens": N} and release the request.

Useful for clients that need to plan context budgets before committing to a generation, e.g. the Anthropic SDK token-counting flow.

Anthropic's count_tokens API takes the same request shape as /v1/messages but only returns the prompt token count without running inference. This short-circuits before enqueueing a job: parse_anthropic_request renders and tokenizes the prompt the same way it would for a real generation, then we serialize {"input_tokens": N} and release the request. Useful for clients that need to plan context budgets before committing to a generation, e.g. the Anthropic SDK token-counting flow. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

audreyt mentioned this pull request May 12, 2026

feat(loader): support stock-recipe (Q8_0/F32) abliterated GGUFs end-to-end on Metal #60

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(server): add /v1/messages/count_tokens endpoint#90

feat(server): add /v1/messages/count_tokens endpoint#90
audreyt wants to merge 1 commit into
antirez:mainfrom
audreyt:feat/count-tokens

audreyt commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

audreyt commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant