fix: prevent false success reporting on unresolved blocking error by sauravpanda · Pull Request #4324 · browser-use/browser-use

sauravpanda · 2026-03-11T03:33:09Z

Summary by cubic

Prevents false success by adding a blocking‑error check and tightens data grounding to verbatim copy across all browser agent prompts. Still forbids fabricated data; derived values are allowed with screenshot-based verification. Also reverts google-genai to 1.60.0 for stability.

Bug Fixes
- Blocking error policy: payment declined, login without creds, email/verification wall, required paywall, access denied → set success=false. Temporary obstacles (CAPTCHA, popups) don’t count.
- Data grounding: every URL/price/name/value must be observed verbatim in tool outputs, browser_state, or screenshot — copy exactly; don’t paraphrase or normalize URLs. Derived values (counts/totals) from observed data are allowed. Never construct; say not found.
- Prompt cleanup: removed hard‑constraint enforcement, explicit scroll exceptions/“pages below” rules, and extract dedup guidance (already_collected/results file). Applied across all prompt variants.
Dependencies
- Reverted google-genai from 1.65.0 to 1.60.0.

^{Written for commit b43c7dd. Summary will update on new commits.}

cubic-dev-ai

2 issues found across 5 files

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="browser_use/agent/system_prompts/system_prompt_anthropic_flash.md">

<violation number="1" location="browser_use/agent/system_prompts/system_prompt_anthropic_flash.md:96">
P2: This grounding check omits `browser_vision`, so screenshot-only facts can no longer be treated as verified even though the prompt defines screenshots as the ground truth.</violation>
</file>

<file name="browser_use/agent/system_prompts/system_prompt.md">

<violation number="1" location="browser_use/agent/system_prompts/system_prompt.md:148">
P2: This wording forbids grounded derived results like counts, so tasks that require counting or simple computation can be incorrectly reported as `not found` or unsuccessful.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

…o add-better-data-grounding

github-actions · 2026-03-11T18:00:13Z

Agent Task Evaluation Results: 2/2 (100%)

View detailed results

Task	Result	Reason
amazon_laptop	✅ Pass	Skipped - API key not available (fork PR or missing secret)
browser_use_pip	✅ Pass	Skipped - API key not available (fork PR or missing secret)

Check the evaluate-tasks job for detailed task execution logs.

cubic-dev-ai

1 issue found across 3 files (changes from recent commits).

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="browser_use/agent/system_prompts/system_prompt.md">

<violation number="1" location="browser_use/agent/system_prompts/system_prompt.md:148">
P2: Keep a verbatim requirement for direct fields; `observed` alone is loose enough to allow paraphrased names or normalized URLs back into `success=true` responses.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

fix: prevent false success reporting on unresolved blocking error

bda29e6

cubic-dev-ai Bot reviewed Mar 11, 2026

View reviewed changes

Comment thread browser_use/agent/system_prompts/system_prompt_anthropic_flash.md Outdated

Comment thread browser_use/agent/system_prompts/system_prompt.md Outdated

Merge branch 'main' of https://github.com/browser-use/browser-use int…

bbd5bf5

…o add-better-data-grounding

made prompt bit more linient

00748ba

cubic-dev-ai Bot reviewed Mar 11, 2026

View reviewed changes

Comment thread browser_use/agent/system_prompts/system_prompt.md Outdated

sauravpanda added 2 commits March 11, 2026 11:34

tighten data grounding to verbatim cop

3e76743

reverted to old gemini sdk

b43c7dd

sauravpanda merged commit d2272ab into main Mar 11, 2026
80 checks passed

sauravpanda deleted the add-better-data-grounding branch March 11, 2026 22:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prevent false success reporting on unresolved blocking error#4324

fix: prevent false success reporting on unresolved blocking error#4324
sauravpanda merged 5 commits into
mainfrom
add-better-data-grounding

sauravpanda commented Mar 11, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Mar 11, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sauravpanda commented Mar 11, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Agent Task Evaluation Results: 2/2 (100%)

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

sauravpanda commented Mar 11, 2026 •

edited by cubic-dev-ai Bot

Loading

github-actions Bot commented Mar 11, 2026 •

edited

Loading