Feature/token limit #193

elaminm2003 · 2025-11-27T01:00:10Z

Closes #176
This PR introduces a necessary feature to prevent message history from growing indefinitely, which previously led to high operational costs and frequent context length errors in LLM calls.

We now enforce a configurable token limit, ensuring better stability and cost control.

Key Changes
Token Limit Enforcement:

Added a new setting, MAX_TOKEN_LIMIT (default: 100,000 tokens), in src/ansari/config.py.

Integrated the tiktoken library to accurately count tokens in the message history (src/ansari/agents/ansari.py).

The process_message_history function now checks the token count. If the limit is exceeded, the agent gracefully refuses to process the request and prompts the user to start a new conversation.

Bug Fix:

Resolved a circular import issue in src/ansari/app/main_api.py by reordering imports. This ensures the server starts correctly when whatsapp_router is included.

Testing
Added new unit tests in tests/unit/test_token_limit.py to verify that the token limit is enforced correctly (blocking long histories while allowing short ones).

feat: Implement token limit for message history

1dca53e

elaminm2003 force-pushed the feature/token-limit branch from deea484 to 1dca53e Compare November 27, 2025 01:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/token limit #193

Feature/token limit #193

Uh oh!

elaminm2003 commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Feature/token limit #193

Are you sure you want to change the base?

Feature/token limit #193

Uh oh!

Conversation

elaminm2003 commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant