Incorrect exception type returned for gemini when token limit is exceeded

### Initial Checks

- [x] I confirm that I'm using the latest version of Pydantic AI
- [x] I confirm that I searched for my issue in https://github.com/pydantic/pydantic-ai/issues before opening this issue

### Description

When manually passing a token limit to the agent that's too restrictive and causes the model run to fail, the error we raise is of the wrong type


```python
from pydantic_ai import Agent

Agent("google-gla:gemini-2.5-pro-preview-05-06", model_settings=dict(max_tokens=5)).run_sync("write a haiku")
```

yields

```
UnexpectedModelBehavior: Content field missing from Gemini response, body: (...)
```

instead of `UsageLimitExceeded`

### Example Code

```Python
from pydantic_ai import Agent
Agent("google-gla:gemini-2.5-pro-preview-05-06", model_settings=dict(max_tokens=5)).run_sync("write a haiku")
```

### Python, Pydantic AI & LLM client version

```Text
Python 3.12.4
Pydantic 0.3.1
google-gla:gemini-2.5-pro-preview-05-06
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Incorrect exception type returned for gemini when token limit is exceeded #2021

Initial Checks

Description

Example Code

Python, Pydantic AI & LLM client version

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Incorrect exception type returned for gemini when token limit is exceeded #2021

Description

Initial Checks

Description

Example Code

Python, Pydantic AI & LLM client version

Activity

metaember commented on Jun 18, 2025

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions