🤖 Coding Agent Comparison Lab

A systematic comparison of AI coding assistants by implementing identical project prompts across different models

📊 Overview

This repository contains mini-projects implemented by different AI coding agents using identical prompts. The goal is to provide a transparent, side-by-side comparison of how different AI models approach the same coding challenges.

Participating Agents

Claude (Anthropic) - Claude 3.5 Sonnet via Cursor/Windsurf
OpenAI Codex - GPT-4 via GitHub Copilot/OpenAI API

🏗️ Repository Structure

coder-comps/
│
├── projects/               # Main project implementations
│   ├── project-01/        # First comparison project
│   │   ├── prompt.md      # Original project prompt
│   │   ├── claude/        # Claude's implementation
│   │   └── codex/         # Codex's implementation
│   │
│   ├── project-02/        # Second comparison project
│   │   ├── prompt.md
│   │   ├── claude/
│   │   └── codex/
│   │
│   └── .../               # Additional projects
│
├── evaluations/           # Comparison metrics and analysis
│   └── criteria.md        # Evaluation criteria
│
└── docs/                  # Documentation
    └── methodology.md     # Testing methodology

🌳 Branch Strategy

Each project follows this branching pattern:

main
 ├── project-01-prompt     # Contains only the prompt
 │   ├── project-01-claude # Claude's implementation
 │   └── project-01-codex  # Codex's implementation
 │
 ├── project-02-prompt
 │   ├── project-02-claude
 │   └── project-02-codex
 └── ...

After implementation, branches are merged into the main branch's folder structure for easy comparison.

📝 Project List

Project	Description	Status	Claude	Codex
Project 01	TBD	🔜 Planned	-	-
Project 02	TBD	🔜 Planned	-	-
Project 03	TBD	🔜 Planned	-	-
Project 04	TBD	🔜 Planned	-	-
Project 05	TBD	🔜 Planned	-	-

Status Legend

🔜 Planned - Prompt not yet created
📝 Prompt Ready - Prompt created, awaiting implementation
🚧 In Progress - Currently being implemented
✅ Complete - Both implementations finished
📊 Evaluated - Comparison analysis complete

🎯 Evaluation Criteria

Each implementation will be assessed on:

Correctness - Does it meet the requirements?
Code Quality - Readability, structure, best practices
Performance - Efficiency and optimization
Documentation - Comments, README, setup instructions
Testing - Test coverage and quality
Innovation - Creative solutions and extra features
Error Handling - Robustness and edge cases

🚀 Getting Started

For Contributors

Creating a New Project Prompt

git checkout -b project-XX-prompt
# Add prompt.md to projects/project-XX/
git commit -m "Add prompt for project XX"

Implementing with an Agent

git checkout -b project-XX-[agent-name] project-XX-prompt
# Let the agent implement the solution
git commit -m "Project XX: [Agent] implementation"

Merging to Main

git checkout main
# Copy implementation to projects/project-XX/[agent-name]/
git commit -m "Add [Agent] implementation for project XX"

For Viewers

Browse the projects/ directory to see side-by-side comparisons of how different AI agents approach the same problems.

📈 Insights & Observations

Key findings and patterns will be documented here as projects are completed.

🤝 Contributing

This is an experimental project to understand AI coding capabilities. If you'd like to suggest project ideas or evaluation criteria, please open an issue.

📄 License

MIT License - See LICENSE file for details.

🔗 Resources

This repository is maintained for educational and research purposes to understand the capabilities and differences between AI coding assistants.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
docs		docs
evaluations		evaluations
projects		projects
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
WORKFLOW.md		WORKFLOW.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 Coding Agent Comparison Lab

📊 Overview

Participating Agents

🏗️ Repository Structure

🌳 Branch Strategy

📝 Project List

Status Legend

🎯 Evaluation Criteria

🚀 Getting Started

For Contributors

For Viewers

📈 Insights & Observations

🤝 Contributing

📄 License

🔗 Resources

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

jakenuts/coder-comps

Folders and files

Latest commit

History

Repository files navigation

🤖 Coding Agent Comparison Lab

📊 Overview

Participating Agents

🏗️ Repository Structure

🌳 Branch Strategy

📝 Project List

Status Legend

🎯 Evaluation Criteria

🚀 Getting Started

For Contributors

For Viewers

📈 Insights & Observations

🤝 Contributing

📄 License

🔗 Resources

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages