Agent mode stopping short of fully implementing #170088

matanox · 2025-08-17T10:45:26Z

matanox
Aug 17, 2025

Select Topic Area

Question

Copilot Feature Area

Copilot Agent Mode

Body

I have it very common that agent mode stops short of really implementing a class or small module, leaving in comments in the code like:

"this is just a simulation of x,y,z",
"For now, we'll return None as placeholder"
"TODO: Create x,y,z when foo is available".

This is on humble objectives like converting a python class to rust and writing a parity test for it ― well not that simple, but you'd expect that converting code being rather benign for a code LLM, it will manage.

What might be your thoughts about it?

A wild thought of human hallucination might be that maybe if it didn't try to stop its work too early, it would have been able to push through during the same session. Or, it loses a lot of context (or blows up its own context by the details of its own work) between sessions on the same PR and loses focus (like a human) in forgetting its originally fair plan of implementation.

This would be very manageable, unless after stopping at 20% of it on the first session, it would need to be reminded what to do next in a PR comment, and even on that smaller chunk of work given to it on a follow-up PR comment, it would still yield those "TODO" placeholders rather than fully implement.

BTW, it often, though not always, claims it has implemented everything but then you see those placeholders and todos in the code.

It also degrades in following the repo's copilot-instructions.md guidelines as the session progresses.

Answered by Git-Kapish

Aug 17, 2025

@matanox You’re not alone in noticing this — agent mode sometimes leaves behind “TODO” placeholders or partial implementations instead of fully finishing the task. This usually happens for a couple of reasons:

Context window limitations:
The model can “lose track” of the broader plan if the code or discussion is long. It tries to stay safe by leaving TODOs instead of making incorrect assumptions.

Guardrails by design:
Copilot is tuned to prefer being cautious rather than generating incorrect/unsafe code. A placeholder signals “this part still needs human input” instead of silently pushing something broken.

Session continuity:
Since agent mode doesn’t truly remember across sessions, it may…

View full answer

Git-Kapish · 2025-08-17T11:00:16Z

Git-Kapish
Aug 17, 2025

@matanox You’re not alone in noticing this — agent mode sometimes leaves behind “TODO” placeholders or partial implementations instead of fully finishing the task. This usually happens for a couple of reasons:

Context window limitations:
The model can “lose track” of the broader plan if the code or discussion is long. It tries to stay safe by leaving TODOs instead of making incorrect assumptions.

Guardrails by design:
Copilot is tuned to prefer being cautious rather than generating incorrect/unsafe code. A placeholder signals “this part still needs human input” instead of silently pushing something broken.

Session continuity:
Since agent mode doesn’t truly remember across sessions, it may stop mid-way. Each session is like starting fresh with partial context, so follow-ups often repeat the same “TODO” style behavior.

👉 How to manage it:

Break larger tasks into smaller steps (“first convert the class”, then “add the parity test”) instead of asking for everything at once.

Remind it explicitly of earlier goals in the same PR thread (this helps it “regain focus”).

If you see TODOs, you can prompt it directly like: “Please now replace the TODO with a full implementation.” This often makes it push past its safe stopping point.

So in short: it’s not really hallucination as much as a mix of context limits + cautious defaults. With tighter prompts and follow-ups, you can usually get it to finish the work more reliably.

1 reply

matanox Aug 17, 2025
Author

Good points!

Although I guess that “Please now replace the TODO with a full implementation.” should be augmented with a reminder of the overall task or, rather, a careful re-framing of the smaller task at hand (which gets a little tedious). I assume this can be much prevented by right-sizing the tasks in the first place.

My only comment would be that perhaps the guardrails approach may better really say something more explicit about why and where it stopped short of implementing ― to reduce friction and promote utility of the sessions.

matanox · 2025-08-17T11:12:53Z

matanox
Aug 17, 2025
Author

Does it really forget its initial plan of implementation as it's rewriting the PR comments and even its title as the PR trail of comments goes on and specializes towards solving issues with the implementation? Or does it somehow give its initial plan any kind of special context status while iterating new sessions?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Community

Agent mode stopping short of fully implementing #170088

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

GitHub Community

Agent mode stopping short of fully implementing #170088

Uh oh!

Uh oh!

matanox Aug 17, 2025

Select Topic Area

Copilot Feature Area

Body

Replies: 2 comments · 1 reply

Uh oh!

Git-Kapish Aug 17, 2025

Uh oh!

Uh oh!

matanox Aug 17, 2025 Author

Uh oh!

Uh oh!

matanox Aug 17, 2025 Author

matanox
Aug 17, 2025

Replies: 2 comments 1 reply

Git-Kapish
Aug 17, 2025

matanox Aug 17, 2025
Author

matanox
Aug 17, 2025
Author