Skip to content

[Phase 2.5-F] End-to-end CLI validation with cf-test project #353

@frankbria

Description

@frankbria

Summary

Validate the ReAct engine end-to-end by running it against the cf-test project (task tracker CLI from requirements.md). This is the primary acceptance test for the entire Phase 2.5 redesign.

Acceptance Criteria

  • Clean cf-test directory: remove previous .codeframe workspace and generated code
  • Run full Golden Path with ReAct engine:
    1. `cf init ~/projects/cf-test --detect`
    2. `cf prd add requirements.md`
    3. `cf tasks generate`
    4. Mark tasks READY
    5. `cf work start --execute --engine react` for each task
  • Success criteria (from unified plan):
    • Build working task tracker CLI with tests, on first attempt
    • 0 ruff lint errors
    • pyproject.toml preserved (not overwritten)
    • No cross-file naming mismatches
    • Lint errors caught per-edit and fixed in next iteration
    • Each task completes within 30 iterations
  • Document results: success rate, iteration counts, token usage, any failures
  • Compare with Plan-and-Execute engine on same tasks (cost, success rate, quality)
  • All existing codeframe tests still pass

Implementation Notes

  • Use the existing cf-test project at ~/projects/cf-test with requirements.md
  • Source ANTHROPIC_API_KEY from /home/frankbria/projects/codeframe/.env
  • Run with --verbose for detailed output
  • Document any issues found and file follow-up issues if needed

Dependencies

Reference

  • docs/AGENT_V3_UNIFIED_PLAN.md (Success Criteria)
  • ~/projects/cf-test/requirements.md (test project PRD)

Metadata

Metadata

Assignees

No one assigned

    Labels

    phase-2.5Phase 2.5: Agent Execution Redesign (ReAct)phase-2.5-FPhase F: Validation & Default Switchtesting

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions