-
Notifications
You must be signed in to change notification settings - Fork 5
Closed
Labels
phase-2.5Phase 2.5: Agent Execution Redesign (ReAct)Phase 2.5: Agent Execution Redesign (ReAct)phase-2.5-FPhase F: Validation & Default SwitchPhase F: Validation & Default Switchtesting
Description
Summary
Validate the ReAct engine end-to-end by running it against the cf-test project (task tracker CLI from requirements.md). This is the primary acceptance test for the entire Phase 2.5 redesign.
Acceptance Criteria
- Clean cf-test directory: remove previous .codeframe workspace and generated code
- Run full Golden Path with ReAct engine:
- `cf init ~/projects/cf-test --detect`
- `cf prd add requirements.md`
- `cf tasks generate`
- Mark tasks READY
- `cf work start --execute --engine react` for each task
- Success criteria (from unified plan):
- Build working task tracker CLI with tests, on first attempt
- 0 ruff lint errors
- pyproject.toml preserved (not overwritten)
- No cross-file naming mismatches
- Lint errors caught per-edit and fixed in next iteration
- Each task completes within 30 iterations
- Document results: success rate, iteration counts, token usage, any failures
- Compare with Plan-and-Execute engine on same tasks (cost, success rate, quality)
- All existing codeframe tests still pass
Implementation Notes
- Use the existing cf-test project at ~/projects/cf-test with requirements.md
- Source ANTHROPIC_API_KEY from /home/frankbria/projects/codeframe/.env
- Run with --verbose for detailed output
- Document any issues found and file follow-up issues if needed
Dependencies
- All of Phases A-E must be complete ([Phase 2.5-A] Implement SearchReplaceEditor with fuzzy matching #343-[Phase 2.5-E] Implement token budget tracking and message compaction #352)
Reference
docs/AGENT_V3_UNIFIED_PLAN.md(Success Criteria)- ~/projects/cf-test/requirements.md (test project PRD)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
phase-2.5Phase 2.5: Agent Execution Redesign (ReAct)Phase 2.5: Agent Execution Redesign (ReAct)phase-2.5-FPhase F: Validation & Default SwitchPhase F: Validation & Default Switchtesting