Skip to content

CrewBench/CrewBench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Contributors Forks Stargazers Issues project_license


Logo

CrewBench

An open-source, enhanced version of Claude Cowork built specifically for programmers.
Explore the docs »

Getting Started · View Demo · Report Bug · Request Feature · Contact Us

Table of Contents
  1. Overview
  2. Getting Started
  3. CrewBench Usage
  4. Roadmap
  5. Contributing
  6. License
  7. Contact
  8. Acknowledgments

Overview

CrewBench Home

CrewBench is a desktop UI and control layer for AI coding agents such as Claude Code, Gemini CLI, and Codex.

It provides visibility, control, and rollback over how AI agents read and modify a real codebase.

Note

CrewBench is built on top of AionUi (Apache-2.0) and inherits all upstream AionUi capabilities, while adding developer-focused observability and guardrails.

Core Features

Feature Description
Universal Model Support Work with cloud and local models through a unified interface, including OpenAI-compatible APIs and local runtimes.
Agent Coworking Environment Multiple agents and tasks can run in parallel with isolated context, enabling real cowork-style workflows.
Direct File System Access Agents operate directly on real project files via a file-tree interface, not abstract chat outputs.
File Timeline Every file change is recorded in a chronological timeline, making agent activity inspectable and reviewable.
Action Logs & Audit Trail Full logs of both AI and human actions provide observability, debugging capability, and reproducibility.
Live Preview & Editing Generated artifacts are surfaced as editable, previewable files rather than hidden chat responses.
Extensible & Customizable UI The interface is developer-controlled, supporting layout flexibility and custom CSS theming.

Special Features:

File Level Version Tracking and CrewBench Logging

File Timeline

CrewBench adds AI-aware file version tracking, recording every agent-initiated change:

  • Full content snapshots per change
  • Line-based diffs between versions
  • Operation type (create, write, delete)
  • Agent attribution
  • Timestamped history per workspace

This complements Git by tracking agent behavior, not commits.

Action Logs

Action Logs

Action logs provide a comprehensive audit trail of all actions performed by agents and humans in the workspace. Each log entry includes:

  • Timestamp
  • Action type (create, write, delete)
  • Agent attribution
  • File path
  • Content snapshots
  • Line-based diffs

Technical Focus Areas

Focus Area What It Covers Why It Matters
Observability • File timelines and diffs
• Clear agent vs human attribution
• Action-level history
Lets developers see exactly what happened, who did it (human or AI), and in what order
Control / Guardrails • Path-based read/write restrictions
• Operation-level permissions
• Confirmation gates for risky actions
Prevents agents from touching the wrong files or making unsafe changes
Reproducibility • Inspectable agent action sequences
• Debugging and diagnosis of AI behavior
Makes AI behavior debuggable, auditable, and repeatable
Human + Agent Workflows • Continuous history across human and agent edits
• No context loss between steps
Enables seamless collaboration without losing state or intent

Our Relationship to AionUi

AionUi provides the base platform, including:

  • Multi-agent orchestration
  • Desktop UI infrastructure
  • File system access
  • Local-first execution
  • Cross-platform support

CrewBench builds on top of this foundation and focuses specifically on software engineering workflows.

Note

What CrewBench Does Not Do

  • Does not replace Git
  • Does not replace an IDE
  • Does not introduce new AI models
  • Does not focus on prompt presets

CrewBench operates at the control and observability layer, not the editor or model layer.

Getting Started

Download Latest Release

System Requirements

  • macOS: 10.15 or higher
  • Windows: Windows 10 or higher
  • Linux: Ubuntu 18.04+ / Debian 10+ / Fedora 32+
  • Memory: Recommended 4GB or more
  • Storage: At least 500MB available space

Prerequisites

  • CrewBench is a desktop application and does not require any prerequisites.

Installation

  1. Download the latest release from here.
  2. Install the downloaded file.
  3. Now you can run the CrewBench as an Application file.

(back to top)

CrewBench Usage

Cowork With CLI AI Agents (Unified Developer Interface)

CrewBench acts as a graphical cowork layer on top of existing command-line AI coding tools like Gemini CLI, Claude Code, Codex, Qwen Code, Goose, and more.

Capability Description
Multi-Agent Sessions Run multiple AI coding agents simultaneously
Unified Interface One UI instead of multiple terminals
Context Isolation Each agent and task runs with independent memory
Local Execution Agents run locally on your machine

Cowork Directly on Your Codebase

Agents operate on real project files — not abstract chat outputs.

Developer Need How CrewBench Supports It
Code Exploration File-tree based navigation
File Modification Agents read and write actual project files
Batch Operations Rename, merge, and reorganize files
Safe Changes All changes are tracked and reviewable

Inspect Agent Output, Not Just Results

CrewBench surfaces agent outputs as first-class artifacts you can inspect and modify.

Visibility Feature Description
Multi-Format Preview Code, Markdown, HTML, documents, images, diffs
Live File Sync Previews update automatically when files change
Inline Editing Edit or refine agent output directly
Change Awareness See what changed, not just the final state

File Timeline and Action Logs

Engineering-grade observability for AI coding agents.

Observability Tool Purpose
File Timeline Chronological history of every file change
Action Logs Full audit trail of agent and human actions
Change Attribution Clear mapping of who (or which agent) did what
Reproducibility Inspect and debug agent behavior

Parallel Task Execution Without Context Collision

Run multiple tasks simultaneously without cross-contamination.

Workflow Pattern Support
Parallel Sessions Multiple active tasks at the same time
Isolated Memory No cross-task confusion
Long-Running Work Persistent sessions for ongoing work

Remote Coworking and Team Workflows

Control and observe agents remotely while keeping execution local.

Capability Description
WebUI Access Control CrewBench from other devices on the network
Local Agent Control Agents continue running on the host machine
Server Deployment Suitable for self-hosted setups
CrewBench Teams Collaborative multi-developer workflows (coming soon)

Developer-Controlled Interface

CrewBench adapts to your workflow — not the other way around.

Customization Description
CSS Customization Full control over UI appearance
Layout Flexibility Shape the interface around your workflow
Personal Setup Make CrewBench feel like your own IDE

(back to top)

Roadmap

  • [Completed] File Timeline
  • [In Progress] Action Logs
  • [Not Started] GuardRails

(back to top)

Contributing to CrewBench

To contribute to CrewBench, please read the CONTRIBUTING.md file.

Top contributors:

contrib.rocks image

License

Distributed under the project_license. See LICENSE.txt for more information.

(back to top)

Contact

Abhinav Sharma - x/abhinavsharma_x

Project Link: https://github.com/CrewBench/CrewBench

(back to top)

Acknowledgments

  • Electron — for enabling cross-platform desktop application development
  • Node.js — for the underlying runtime and tooling ecosystem
  • SQLite — for reliable, local-first data storage
  • OpenAI — for OpenAI-compatible APIs and ecosystem standards
  • Google Gemini — for multimodal and large-context model support
  • Anthropic Claude — for advancing safe and capable coding agents

(back to top)

About

An open-source, enhanced version of Claude Cowork built specifically for programmers. PLEASE STAR IF YOU LIKED IT.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors