When an agent produces a bad answer, today's option is to stare at logs. A proper debugger would let the user step through the agent's execution — see the LLM call, the tool invocation, the intermediate state, the next LLM call — and pause at any step.
What needs to ship:
User impact: debugging agents today is "cross your fingers and re-run"; a real debugger would turn agent development from vibes-based to engineering.
When an agent produces a bad answer, today's option is to stare at logs. A proper debugger would let the user step through the agent's execution — see the LLM call, the tool invocation, the intermediate state, the next LLM call — and pause at any step.
What needs to ship:
User impact: debugging agents today is "cross your fingers and re-run"; a real debugger would turn agent development from vibes-based to engineering.