GitHub - CrewBench/CrewBench: An open-source, enhanced version of Claude Cowork built specifically for programmers. PLEASE STAR IF YOU LIKED IT.

CrewBench

An open-source, enhanced version of Claude Cowork built specifically for programmers.
Explore the docs »

Getting Started · View Demo · Report Bug · Request Feature · Contact Us

Table of Contents

Overview
Getting Started
CrewBench Usage
Roadmap
Contributing
License
Contact
Acknowledgments

Overview

CrewBench is a desktop UI and control layer for AI coding agents such as Claude Code, Gemini CLI, and Codex.

It provides visibility, control, and rollback over how AI agents read and modify a real codebase.

Note

CrewBench is built on top of AionUi (Apache-2.0) and inherits all upstream AionUi capabilities, while adding developer-focused observability and guardrails.

Core Features

Feature	Description
Universal Model Support	Work with cloud and local models through a unified interface, including OpenAI-compatible APIs and local runtimes.
Agent Coworking Environment	Multiple agents and tasks can run in parallel with isolated context, enabling real cowork-style workflows.
Direct File System Access	Agents operate directly on real project files via a file-tree interface, not abstract chat outputs.
File Timeline	Every file change is recorded in a chronological timeline, making agent activity inspectable and reviewable.
Action Logs & Audit Trail	Full logs of both AI and human actions provide observability, debugging capability, and reproducibility.
Live Preview & Editing	Generated artifacts are surfaced as editable, previewable files rather than hidden chat responses.
Extensible & Customizable UI	The interface is developer-controlled, supporting layout flexibility and custom CSS theming.

Special Features:

File Level Version Tracking and CrewBench Logging

CrewBench adds AI-aware file version tracking, recording every agent-initiated change:

Full content snapshots per change
Line-based diffs between versions
Operation type (create, write, delete)
Agent attribution
Timestamped history per workspace

This complements Git by tracking agent behavior, not commits.

Action Logs

Action logs provide a comprehensive audit trail of all actions performed by agents and humans in the workspace. Each log entry includes:

Timestamp
Action type (create, write, delete)
Agent attribution
File path
Content snapshots
Line-based diffs

Technical Focus Areas

Focus Area	What It Covers	Why It Matters
Observability	• File timelines and diffs • Clear agent vs human attribution • Action-level history	Lets developers see exactly what happened, who did it (human or AI), and in what order
Control / Guardrails	• Path-based read/write restrictions • Operation-level permissions • Confirmation gates for risky actions	Prevents agents from touching the wrong files or making unsafe changes
Reproducibility	• Inspectable agent action sequences • Debugging and diagnosis of AI behavior	Makes AI behavior debuggable, auditable, and repeatable
Human + Agent Workflows	• Continuous history across human and agent edits • No context loss between steps	Enables seamless collaboration without losing state or intent

Our Relationship to AionUi

AionUi provides the base platform, including:

Multi-agent orchestration
Desktop UI infrastructure
File system access
Local-first execution
Cross-platform support

CrewBench builds on top of this foundation and focuses specifically on software engineering workflows.

Note

What CrewBench Does Not Do

Does not replace Git
Does not replace an IDE
Does not introduce new AI models
Does not focus on prompt presets

CrewBench operates at the control and observability layer, not the editor or model layer.

Getting Started

System Requirements

macOS: 10.15 or higher
Windows: Windows 10 or higher
Linux: Ubuntu 18.04+ / Debian 10+ / Fedora 32+
Memory: Recommended 4GB or more
Storage: At least 500MB available space

Prerequisites

CrewBench is a desktop application and does not require any prerequisites.

Installation

Download the latest release from here.
Install the downloaded file.
Now you can run the CrewBench as an Application file.

(back to top)

CrewBench Usage

Cowork With CLI AI Agents (Unified Developer Interface)

CrewBench acts as a graphical cowork layer on top of existing command-line AI coding tools like Gemini CLI, Claude Code, Codex, Qwen Code, Goose, and more.

Capability	Description
Multi-Agent Sessions	Run multiple AI coding agents simultaneously
Unified Interface	One UI instead of multiple terminals
Context Isolation	Each agent and task runs with independent memory
Local Execution	Agents run locally on your machine

Cowork Directly on Your Codebase

Agents operate on real project files — not abstract chat outputs.

Developer Need	How CrewBench Supports It
Code Exploration	File-tree based navigation
File Modification	Agents read and write actual project files
Batch Operations	Rename, merge, and reorganize files
Safe Changes	All changes are tracked and reviewable

Inspect Agent Output, Not Just Results

CrewBench surfaces agent outputs as first-class artifacts you can inspect and modify.

Visibility Feature	Description
Multi-Format Preview	Code, Markdown, HTML, documents, images, diffs
Live File Sync	Previews update automatically when files change
Inline Editing	Edit or refine agent output directly
Change Awareness	See what changed, not just the final state

File Timeline and Action Logs

Engineering-grade observability for AI coding agents.

Observability Tool	Purpose
File Timeline	Chronological history of every file change
Action Logs	Full audit trail of agent and human actions
Change Attribution	Clear mapping of who (or which agent) did what
Reproducibility	Inspect and debug agent behavior

Parallel Task Execution Without Context Collision

Run multiple tasks simultaneously without cross-contamination.

Workflow Pattern	Support
Parallel Sessions	Multiple active tasks at the same time
Isolated Memory	No cross-task confusion
Long-Running Work	Persistent sessions for ongoing work

Remote Coworking and Team Workflows

Control and observe agents remotely while keeping execution local.

Capability	Description
WebUI Access	Control CrewBench from other devices on the network
Local Agent Control	Agents continue running on the host machine
Server Deployment	Suitable for self-hosted setups
CrewBench Teams	Collaborative multi-developer workflows (coming soon)

Developer-Controlled Interface

CrewBench adapts to your workflow — not the other way around.

Customization	Description
CSS Customization	Full control over UI appearance
Layout Flexibility	Shape the interface around your workflow
Personal Setup	Make CrewBench feel like your own IDE

(back to top)

Roadmap

[Completed] File Timeline
[In Progress] Action Logs
[Not Started] GuardRails

(back to top)

Contributing to CrewBench

To contribute to CrewBench, please read the CONTRIBUTING.md file.

Top contributors:

License

Distributed under the project_license. See LICENSE.txt for more information.

(back to top)

Contact

Abhinav Sharma - x/abhinavsharma_x

Project Link: https://github.com/CrewBench/CrewBench

(back to top)

Acknowledgments

Electron — for enabling cross-platform desktop application development
Node.js — for the underlying runtime and tooling ecosystem
SQLite — for reliable, local-first data storage
OpenAI — for OpenAI-compatible APIs and ecosystem standards
Google Gemini — for multimodal and large-context model support
Anthropic Claude — for advancing safe and capable coding agents

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github		.github
.husky		.husky
.specify		.specify
.vscode		.vscode
assistant		assistant
config/webpack		config/webpack
docs		docs
public		public
resources		resources
scripts		scripts
skills		skills
src		src
tests		tests
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.npmrc		.npmrc
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
CODE_STYLE.md		CODE_STYLE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE[AionUi]		LICENSE[AionUi]
VERIFICATION_GUIDE.md		VERIFICATION_GUIDE.md
WEBUI_GUIDE.md		WEBUI_GUIDE.md
electron-builder.yml		electron-builder.yml
entitlements.plist		entitlements.plist
forge.config.ts		forge.config.ts
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
readme.md		readme.md
tsconfig.json		tsconfig.json
uno.config.ts		uno.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CrewBench

Overview

Core Features

Special Features:

File Level Version Tracking and CrewBench Logging

Action Logs

Technical Focus Areas

Our Relationship to AionUi

What CrewBench Does Not Do

Getting Started

System Requirements

Prerequisites

Installation

CrewBench Usage

Roadmap

Contributing to CrewBench

Top contributors:

License

Contact

Acknowledgments

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CrewBench

Overview

Core Features

Special Features:

File Level Version Tracking and CrewBench Logging

Action Logs

Technical Focus Areas

Our Relationship to AionUi

What CrewBench Does Not Do

Getting Started

System Requirements

Prerequisites

Installation

CrewBench Usage

Roadmap

Contributing to CrewBench

Top contributors:

License

Contact

Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages