Background
Phase 1 of the agentic mobile testing PoC (#87030) ships the agent-device skill at .claude/skills/agent-device/ via #87662. The skill drives iOS/Android by feeding accessibility tree snapshots to the LLM, which reasons about every tap, scroll, and assertion. In practice, most sessions share a setup prefix - signing in, clearing onboarding, navigating to a specific surface - before the agent reaches the screen the task actually concerns.
Problem
When the agent drives a recurring setup prefix entirely from LLM reasoning, each run traverses the UI via a different path and timing, which prevents the session from producing a deterministic, quickly-reached starting state for the screen under test.
Solution
Introduce an automated-flow mechanism in the agent-device skill: each flow is a named, deterministic unit that drives the UI to a known surface without LLM reasoning. Concrete authoring format, storage layout, drift handling, and the initial flow set are decided in the implementation follow-up.
Issue Owner
Current Issue Owner: @kacper-mikolajczak
Upwork Automation - Do Not Edit
- Upwork Job URL: https://www.upwork.com/jobs/~022046528898951391896
- Upwork Job ID: 2046528898951391896
- Last Price Increase: 2026-04-21
Background
Phase 1 of the agentic mobile testing PoC (#87030) ships the
agent-deviceskill at.claude/skills/agent-device/via #87662. The skill drives iOS/Android by feeding accessibility tree snapshots to the LLM, which reasons about every tap, scroll, and assertion. In practice, most sessions share a setup prefix - signing in, clearing onboarding, navigating to a specific surface - before the agent reaches the screen the task actually concerns.Problem
When the agent drives a recurring setup prefix entirely from LLM reasoning, each run traverses the UI via a different path and timing, which prevents the session from producing a deterministic, quickly-reached starting state for the screen under test.
Solution
Introduce an automated-flow mechanism in the
agent-deviceskill: each flow is a named, deterministic unit that drives the UI to a known surface without LLM reasoning. Concrete authoring format, storage layout, drift handling, and the initial flow set are decided in the implementation follow-up.Issue Owner
Current Issue Owner: @kacper-mikolajczakUpwork Automation - Do Not Edit