The heuristic classifier caps at ~70% accuracy. Messages where `confident=False` are currently classified as `informational` — this is wrong. The LLM should classify these messages directly rather than guessing.
Tasks
Key files: `src/gaia/agents/email/tools/triage_heuristics.py`, `src/gaia/agents/email/tools/read_tools.py`
The heuristic classifier caps at ~70% accuracy. Messages where `confident=False` are currently classified as `informational` — this is wrong. The LLM should classify these messages directly rather than guessing.
Tasks
Key files: `src/gaia/agents/email/tools/triage_heuristics.py`, `src/gaia/agents/email/tools/read_tools.py`