Only require `max_tokens` when token rate limits apply by GabrielBianconi · Pull Request #3771 · tensorzero/tensorzero

GabrielBianconi · 2025-10-01T14:43:53Z

Important

Modify rate limiting to require max_tokens only when token rate limits apply, updating RateLimitedRequest implementations and adding relevant tests.

Behavior:
- RateLimitedRequest trait's estimated_resource_usage method now takes resources parameter to determine if max_tokens is required.
- EmbeddingRequest and ModelInferenceRequest implementations updated to conditionally require max_tokens based on RateLimitResource::Token presence.
- RateLimitingConfig::get_rate_limited_resources method added to determine active rate-limited resources.
Tests:
- Added test_model_provider_infer_max_tokens_check in model.rs to validate behavior when max_tokens is missing or provided.
- Added test_max_tokens_validation_with_rate_limits in rate_limiting/mod.rs to ensure correct resource inclusion based on rate limits.
Misc:
- Renamed RateLimitResourceUsage to EstimatedRateLimitResourceUsage in several places for clarity.

^{This description was created by}^{for d7da1ef. You can customize this summary. It will automatically update as commits are pushed.}

…fix/issue-3648-max-tokens-rate-limit

Copilot

Pull Request Overview

This PR fixes issue #3648 by making the max_tokens parameter only required when token rate limits are actually active. The change introduces conditional validation that checks which rate limit resources are configured before requiring specific parameters.

Key changes:

Added resource-aware validation for rate limiting requirements
Replaced fixed resource usage calculation with conditional estimation based on active rate limits
Updated the trait interface to pass rate-limited resources to estimation methods

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
tensorzero-core/src/rate_limiting/mod.rs	Adds method to get active rate-limited resources and updates resource usage calculation to be conditional
tensorzero-core/src/model.rs	Adds test coverage for max_tokens validation with different rate limiting configurations
tensorzero-core/src/inference/types/mod.rs	Updates ModelInferenceRequest to conditionally estimate token usage only when token rate limits are active
tensorzero-core/src/embeddings.rs	Updates EmbeddingRequest to conditionally estimate resource usage based on active rate limits

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tensorzero-core/src/rate_limiting/mod.rs

tensorzero-core/src/inference/types/mod.rs

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tensorzero-core/src/rate_limiting/mod.rs

tensorzero-core/src/model.rs

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tensorzero-core/src/rate_limiting/mod.rs

tensorzero-core/src/inference/types/mod.rs

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tensorzero-core/src/rate_limiting/mod.rs

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated no new comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

…into gb/fix-3648

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tensorzero-core/src/rate_limiting/mod.rs

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tensorzero-core/src/inference/types/mod.rs

tensorzero-core/src/rate_limiting/mod.rs

…gb/fix-3648

Copilot

Pull Request Overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tensorzero-core/src/rate_limiting/mod.rs

tensorzero-core/src/inference/types/mod.rs

dangvu0502 and others added 4 commits September 27, 2025 02:16

Only require max_tokens when token rate limits apply

39b481d

Intermediate commit

108b25f

Merge branch 'main' of https://github.com/tensorzero/tensorzero into …

897f165

…fix/issue-3648-max-tokens-rate-limit

Fix implementation

8d5ad85

Copilot AI review requested due to automatic review settings October 1, 2025 14:43

Copilot AI reviewed Oct 1, 2025

View reviewed changes

tensorzero-core/src/rate_limiting/mod.rs Outdated Show resolved Hide resolved

tensorzero-core/src/inference/types/mod.rs Outdated Show resolved Hide resolved

Fix imports

5cdd089

GabrielBianconi marked this pull request as draft October 1, 2025 14:47

Fix

649caa8

Copilot AI review requested due to automatic review settings October 1, 2025 15:14

Copilot AI reviewed Oct 1, 2025

View reviewed changes

tensorzero-core/src/rate_limiting/mod.rs Show resolved Hide resolved

tensorzero-core/src/model.rs Outdated Show resolved Hide resolved

Fix

bbf12c6

GabrielBianconi requested a review from Copilot October 1, 2025 15:41

Apply suggestion from @Copilot

d051719

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI reviewed Oct 1, 2025

View reviewed changes

tensorzero-core/src/rate_limiting/mod.rs Show resolved Hide resolved

tensorzero-core/src/inference/types/mod.rs Outdated Show resolved Hide resolved

Apply suggestion from @Copilot

4c2fb56

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings October 1, 2025 15:43

Copilot AI reviewed Oct 1, 2025

View reviewed changes

tensorzero-core/src/rate_limiting/mod.rs Show resolved Hide resolved

Apply suggestion from @Copilot

1d9ffd1

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings October 1, 2025 15:53

Copilot AI reviewed Oct 1, 2025

View reviewed changes

GabrielBianconi added 2 commits October 1, 2025 11:57

Fix

2d87632

Merge branch 'gb/fix-3648' of https://github.com/tensorzero/tensorzero …

90d34f8

…into gb/fix-3648

GabrielBianconi requested a review from Copilot October 1, 2025 16:01

Copilot AI reviewed Oct 1, 2025

View reviewed changes

tensorzero-core/src/rate_limiting/mod.rs Outdated Show resolved Hide resolved

tensorzero-core/src/rate_limiting/mod.rs Show resolved Hide resolved

Fix

c59dc87

GabrielBianconi requested a review from Copilot October 1, 2025 16:19

GabrielBianconi marked this pull request as ready for review October 1, 2025 16:20

Copilot AI reviewed Oct 1, 2025

View reviewed changes

tensorzero-core/src/inference/types/mod.rs Show resolved Hide resolved

virajmehta self-assigned this Oct 1, 2025

virajmehta requested changes Oct 1, 2025

View reviewed changes

tensorzero-core/src/rate_limiting/mod.rs Outdated Show resolved Hide resolved

Fix

d7da1ef

GabrielBianconi requested a review from virajmehta October 1, 2025 16:38

virajmehta previously approved these changes Oct 1, 2025

View reviewed changes

virajmehta enabled auto-merge October 1, 2025 16:40

virajmehta added this pull request to the merge queue Oct 1, 2025

github-merge-queue bot removed this pull request from the merge queue due to a conflict with the base branch Oct 1, 2025

Merge branch 'main' of https://github.com/tensorzero/tensorzero into …

fc32ca3

…gb/fix-3648

Copilot AI review requested due to automatic review settings October 1, 2025 21:22

GabrielBianconi dismissed virajmehta’s stale review via fc32ca3 October 1, 2025 21:22

Copilot AI reviewed Oct 1, 2025

View reviewed changes

tensorzero-core/src/rate_limiting/mod.rs Show resolved Hide resolved

tensorzero-core/src/inference/types/mod.rs Show resolved Hide resolved

virajmehta approved these changes Oct 1, 2025

View reviewed changes

virajmehta enabled auto-merge October 1, 2025 21:36

virajmehta added this pull request to the merge queue Oct 1, 2025

Merged via the queue into main with commit 0a94257 Oct 1, 2025
30 checks passed

virajmehta deleted the gb/fix-3648 branch October 1, 2025 22:20

Conversation

GabrielBianconi commented Oct 1, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

GabrielBianconi commented Oct 1, 2025 •

edited by ellipsis-dev bot

Loading