[cli-consistency] CLI Consistency Issues - 2026-05-19

## Summary

Automated CLI consistency inspection found **107 inconsistencies** in command help text that should be addressed for better user experience and documentation clarity.

### Breakdown by Severity

- **High**: 21 (Breaks functionality, causes confusion, or misleading information)
- **Medium**: 47 (Inconsistent terminology, style issues affecting professionalism)
- **Low**: 39 (Minor punctuation and formatting variations)

### Issue Categories

1. **Typos and Grammar** (14 issues) - 2 actual errors, 7 awkward phrasings, 5 style issues
2. **Style and Terminology** (78 issues) - 8 high, 31 medium, 39 low severity
3. **Documentation Mismatches** (13 issues) - 3 critical, 9 subcommands, 1 minor gap
4. **Flag Consistency** (13 issues) - 8 high, 3 medium, 2 low severity

### Inspection Details

- **Total Commands Inspected**: 55 (43 primary + 12 subcommands)
- **Date**: 2026-05-19
- **Method**: Executed all CLI commands with `--help` and analyzed actual output (1,742 lines)

---

<details>
<summary><h3>🔴 Critical Issues (Must Fix)</h3></summary>

### Documentation - Wrong Flag Name

**Issue**: `logs` command documentation uses `--after` but CLI implements `--cache-before`

**CLI Implementation**:
```
--cache-before string   Evict locally cached run folders for runs before this date
```

**Documentation** (lines 420-426 in `docs/src/content/docs/setup/cli.md`):
```bash
gh aw logs --after -1w        # Wrong flag name
```

**Impact**: Users following docs get "unknown flag: --after" errors

**Fix**: Replace all `--after` with `--cache-before` in documentation

---

### Documentation - `forecast` Command Completely Undocumented

**Issue**: Experimental token forecasting command exists but has no documentation

**CLI Description**:
```
[EXPERIMENTAL] Forecast effective token usage for agentic workflows by sampling
recent run history and projecting forward on a per-week or per-month basis.
```

**Examples from CLI**:
```bash
gh aw forecast                        # Forecast all workflows (monthly)
gh aw forecast ci-doctor              # Forecast a specific workflow
gh aw forecast --period week          # Weekly projections
gh aw forecast --eval                 # Backtest forecast quality
```

**Impact**: Users cannot discover cost planning and token forecasting capabilities

**Fix**: Add complete documentation section for `forecast` command

---

### Flag Collision - `-l` Used for Two Different Purposes

**Issue**: Same short flag means different things in different commands

**Commands affected**:
- `trial`: `-l, --logical-repo` - Repository to simulate execution against
- `project new`: `-l, --link` - Repository to link project to

**Impact**: Users cannot reliably use `-l` shortcut

**Fix**: Remove `-l` from one command or reassign to different short flag

---

### Flag Semantic Confusion - `--engine` Has Three Meanings

**Issue**: Same flag name with different semantic purposes

| Meaning | Commands | Description |
|---------|----------|-------------|
| **Override** | add, compile, deploy, new, run, trial, update, validate | "Override AI engine" |
| **Filter** | logs | "Filter logs by AI engine" |
| **Check** | secrets bootstrap | "Check tokens for specific engine" |

**Impact**: Users expect same flag to have consistent meaning

**Fix**: Rename filter variant to `--filter-engine` or `--engine-filter` in `logs` command

---

### Flag Description Drift - `--dir` Has Three Different Descriptions

**Issue**: Same flag with varying descriptions and behaviors

**Variants**:
1. Standard (9 commands): `"Workflow directory (default: .github/workflows)"`
2. Lock-specific (lint): `"Directory to scan for *.lock.yml files when no arguments are provided"`
3. Environment-aware (list): `"Workflow directory (default: $GH_AW_WORKFLOWS_DIR or .github/workflows; ignored when --repo is set)"`

**Impact**: Unclear if flag respects environment variables or interacts with other flags

**Fix**: Standardize to first variant, document special behaviors in command long help

---

### Grammar - Missing Article in `compile` Command

**Location**: `gh aw compile --dependabot` flag description

**Current**:
> "For Python: Creates requirements.txt for pip packages"

**Issue**: Missing article before "requirements.txt"

**Fix**:
> "For Python: Creates a requirements.txt for pip packages"

---

### Grammar - Missing Article in `health` Command

**Location**: `gh aw health` description

**Current**:
> "Shows health metrics for workflows including:"

**Issue**: Missing article/comma before "workflows"

**Fix**:
> "Shows health metrics for workflows, including:"

OR
> "Shows health metrics for all workflows, including:"

</details>

---

<details>
<summary><h3>🟡 High-Priority Issues (Should Fix)</h3></summary>

### Terminology - `workflow-id` vs `workflow ID` Inconsistency

**Issue**: Mixing hyphenated and spaced forms (affects 15+ commands)

**Examples**:
```
Line 214: "The workflow-id is the basename..."  (hyphenated)
Line 1043: "The workflow-id is the basename..." (hyphenated)
Usage: gh aw compile [workflow-id]              (hyphenated in usage)
Text: "...a numeric run ID (e.g., 1234567890)"  (spaced in prose)
```

**Recommendation**: Use "workflow ID" (with space) in prose, `workflow-id` only in code/usage syntax

---

### Terminology - `run-id` vs `run ID` Inconsistency

**Issue**: Alternates between "run-id", "run ID", and "run-id-or-url"

**Examples**:
```
Text: "A numeric run ID (e.g., 1234567890)"     (spaced)
Usage: gh aw audit <run-id-or-url>              (hyphenated)
```

**Recommendation**: Use "run ID" in prose, `<run-id-or-url>` only in usage syntax

---

### Style - Long Command Descriptions Not Split

**Issue**: Several commands have 100+ character single-line descriptions

**Examples**:
```
audit:    "Audit one or more workflow runs by downloading artifacts and logs, detecting errors, analyzing MCP tool usage, and generating a concise report suitable for AI agents."

deploy:   "Deploy one or more workflows to a target repository by combining clone, update, add, compile, and pull request creation."
```

**Recommendation**: Break into brief first line (<80 chars) + additional context

---

### Documentation - 9 Subcommands Not Properly Documented

**Undocumented or unclear**:
- `mcp add` - Add MCP server to workflow  
- `mcp inspect` - Inspect MCP servers and tools
- `mcp list` - List MCP servers in workflows
- `mcp list-tools` - List tools for MCP server
- `completion install` - Auto-install completions
- `completion uninstall` - Remove completions
- `project new` - Create GitHub Project (hierarchy unclear)
- `secrets set` - Set repository secret (hierarchy unclear)
- `secrets bootstrap` - Check required secrets (hierarchy unclear)

**Impact**: Users cannot find detailed usage for these subcommands

**Fix**: Add dedicated subsections with full examples and flags

---

### Flag Availability - `--repo` Missing from 6 Commands

**Issue**: Cross-repository support inconsistent

**Commands WITH --repo**: audit, checks, deploy, disable, enable, forecast, health, list, logs, outcomes, run, status, update, pr transfer, secrets

**Commands MISSING --repo** (but operate on workflows):
- add
- add-wizard  
- compile
- fix
- remove
- validate

**Impact**: Users expect to operate on remote repositories consistently

**Recommendation**: Either add `--repo` to all commands or document local-only design decision

---

### Flag Availability - `--json` Missing from Display Commands

**Issue**: 4 display commands lack JSON output for automation

**Missing from**:
- `version` - Could output structured version data
- `mcp list` - Currently table format only
- `mcp list-tools` - Currently table format only
- `mcp inspect` - Currently human-readable only

**Impact**: Automation scripts cannot easily parse output

**Recommendation**: Add `--json` support to all display commands

---

### Flag Inconsistency - `--days` Has Different Allowed Values

**Issue**: Same flag with incompatible value ranges

| Command | Allowed Values | Default |
|---------|----------------|---------|
| forecast | 7, 30 | 30 |
| health | 7, 30, 90 | 7 |

**Impact**: Users expect `--days 90` to work in both commands

**Recommendation**: Align to support 7, 30, 90 in both commands

---

### Flag Description - `--force` Varies Across Commands

**Issue**: Similar but subtly different behaviors (affects 5 commands)

**Variants**:
- add, deploy, new: "Overwrite existing workflow files without confirmation"
- compile: "Force overwrite of existing dependency files"
- update: "Force update even if no changes are detected"

**Note**: `compile` is missing `-f` short form while others have it

**Recommendation**: Unify description style and add missing short form

---

### Flag Semantic Difference - `--ref` Has Dual Purpose

**Issue**: Same flag for filtering vs targeting

| Command | Meaning | Description |
|---------|---------|-------------|
| logs | Filter | "Filter runs by branch or tag name" |
| run | Target | "Branch or tag name to run the workflow on" |
| status | Filter | "Filter runs by branch or tag name" |

**Recommendation**: Acceptable difference given context, but clarify `run` as "Target branch/tag for workflow execution"

---

### Style - Inconsistent Punctuation in Flag Descriptions

**Issue**: Most long-form flag descriptions end with periods, but some don't

**Examples**:
```
✓ Good: "--create-pull-request   Create a pull request with the workflow changes"
✗ Bad:  "--skip-secret         Skip the API secret prompt (use when..."
```

**Recommendation**: All complete sentences should end with periods

---

### Style - "Workflow Specifications:" Bullets Lack Consistent Punctuation

**Issue**: Bullet lists without terminal periods (affects `add`, `add-wizard`)

**Example**:
```
- Two parts: "owner/repo[`@version`]" (loads repository-root aw.yml package)
- Three parts: "owner/repo/workflow-name[`@version`]" (implicitly looks in...)
```

**Recommendation**: Remove periods from all items for consistency with list formatting

---

### Awkward Phrasing - Multiple Instances

1. **audit**: "the first is used as the base (reference) and the remaining runs" - awkward double comma
2. **checks**: "JSON output includes two state fields" - "state fields" could be "status fields"
3. **forecast**: "measures actual runs in that period and computes quality metrics" - missing comma
4. **init**: "Initialize the repository... by configuring... and creating..." - slightly redundant
5. **logs**: "aw.patch: Git patch... (legacy; see aw-{branch}.patch)" - parenthetical awkward
6. **trial**: Long run-on sentence with unclear quotation usage
7. **docs**: Long descriptions in multiple commands need splitting

</details>

---

<details>
<summary><h3>🟢 Low-Priority Issues (Polish)</h3></summary>

### Minor Style Issues

1. **Capitalization after colons**: Inconsistent in descriptive lists
2. **Imperative vs descriptive voice**: Mix of "This command..." vs imperative
3. **JSON flag descriptions**: Minor variations ("results" vs "metrics" vs "trial results")
4. **Ellipsis usage**: Both `...` and `[...]` for variadic arguments (appears correct but verify)
5. **Whitespace in flag descriptions**: Varying indentation depths
6. **Example ordering**: Some by complexity, some by frequency
7. **Hyphenation**: "3-way merge" vs "three-way merge"
8. **Time period format**: "7d, 24h" vs "7 days"
9. **Path notation**: Inconsistent "./path" vs "/path" vs "path"
10. **Boolean flag descriptions**: Inconsistent use of "Enable" vs "Disable"

### Missing Short Forms for Common Flags

**Flags used in 3+ commands without short forms**:
- `--disable-security-scanner` (4 commands)
- `--stop-after` (4 commands)
- `--append` (3 commands)
- `--approve` (3 commands)
- `--ref` (3 commands) - has short form in some, missing in others

**Recommendation**: Consider adding short forms for frequently used flags

### Help Flag Spacing Inconsistency

**Issue**: `--help` descriptions have variable spacing (cosmetic only)

**Examples**:
```
add:       -h, --help                       Show help for gh aw add
mcp list:  -h, --help   Show help for gh aw mcp list
```

**Recommendation**: Standardize spacing in help formatter

### Documentation - Minor Gap

**Issue**: `compile --ghes` flag not documented

**CLI**:
```
--ghes    Enable GitHub Enterprise Server (GHES) compatibility mode
```

**Documentation**: No mention in compile command options (though GHES is mentioned in installation)

**Fix**: Add `--ghes` to compile command options list

</details>

---

### Positive Findings

The following areas show excellent consistency:

✅ **Global flags** (`--banner`, `--verbose`) present in all 43 commands  
✅ **Help flag** (`-h, --help`) universal and consistent  
✅ **Short flag assignments** mostly collision-free (only 1 collision found)  
✅ **Repository flag description** identical across 16 commands  
✅ **Engine values** consistently listed as `(copilot, claude, codex, gemini, crush)`  
✅ **JSON flag description** identical across 14 commands  
✅ **No phantom commands** - all documented commands exist in CLI  
✅ **Core workflows** well documented

---

### Recommended Action Plan

#### Phase 1 - Critical Fixes (High Impact)
1. Fix `logs --after` → `--cache-before` in documentation
2. Add `forecast` command documentation
3. Resolve `-l` short flag collision
4. Clarify `--engine` semantics (rename filter variant)
5. Standardize `--dir` descriptions
6. Fix grammatical errors (2 instances)

#### Phase 2 - High-Priority Improvements
7. Standardize `workflow-id` / `run-id` terminology
8. Split long command descriptions (<80 chars first line)
9. Document MCP subcommands properly
10. Add `--repo` to missing commands or document local-only design
11. Add `--json` to display commands
12. Align `--days` allowed values
13. Standardize `--force` descriptions

#### Phase 3 - Polish
14. Fix awkward phrasings (7 instances)
15. Standardize flag description punctuation
16. Fix workflow specifications bullet formatting
17. Add short forms for common flags
18. Minor style consistency improvements

---

### Testing Recommendations

1. **Regression test**: Verify all existing flag behaviors remain unchanged
2. **Documentation test**: Ensure help text is clear after changes
3. **Automation test**: Verify `--json` output is parseable
4. **Cross-command test**: Verify flags with same names have same behaviors
5. **Error message test**: Clear errors for invalid flag combinations

---

### Files Analyzed

- **CLI Help Output**: `/tmp/gh-aw/agent/help-output/all-help.txt` (1,742 lines, 55 commands)
- **Documentation**: `docs/src/content/docs/setup/cli.md` (848 lines, 43.6 KB)
- **Flag Analysis**: 123 unique flags across 43 commands

**Analysis Date**: 2026-05-19  
**Method**: Automated inspection using specialized agents for:
- Typo and grammar scanning
- Style and terminology consistency
- Documentation cross-referencing
- Flag naming and availability analysis




> Generated by [✅ CLI Consistency Checker](https://github.com/github/gh-aw/actions/runs/26102456365) · ● 27.8M · [◷](https://github.com/search?q=repo%3Agithub%2Fgh-aw+is%3Aissue+%22gh-aw-workflow-call-id%3A+github%2Fgh-aw%2Fcli-consistency-checker%22&type=issues)
> - [x] expires  on May 21, 2026, 2:18 PM UTC

Meaning	Commands	Description
Override	add, compile, deploy, new, run, trial, update, validate	"Override AI engine"
Filter	logs	"Filter logs by AI engine"
Check	secrets bootstrap	"Check tokens for specific engine"

Command	Meaning	Description
logs	Filter	"Filter runs by branch or tag name"
run	Target	"Branch or tag name to run the workflow on"
status	Filter	"Filter runs by branch or tag name"

[cli-consistency] CLI Consistency Issues - 2026-05-19 #33322

Description

Summary

Breakdown by Severity

Issue Categories

Inspection Details

🔴 Critical Issues (Must Fix)

Documentation - Wrong Flag Name

Documentation - forecast Command Completely Undocumented

Flag Collision - -l Used for Two Different Purposes

Flag Semantic Confusion - --engine Has Three Meanings

Flag Description Drift - --dir Has Three Different Descriptions

Grammar - Missing Article in compile Command

Grammar - Missing Article in health Command

🟡 High-Priority Issues (Should Fix)

Terminology - workflow-id vs workflow ID Inconsistency

Terminology - run-id vs run ID Inconsistency

Style - Long Command Descriptions Not Split

Documentation - 9 Subcommands Not Properly Documented

Flag Availability - --repo Missing from 6 Commands

Flag Availability - --json Missing from Display Commands

Flag Inconsistency - --days Has Different Allowed Values

Flag Description - --force Varies Across Commands

Flag Semantic Difference - --ref Has Dual Purpose

Style - Inconsistent Punctuation in Flag Descriptions

Style - "Workflow Specifications:" Bullets Lack Consistent Punctuation

Awkward Phrasing - Multiple Instances

🟢 Low-Priority Issues (Polish)

Minor Style Issues

Missing Short Forms for Common Flags

Help Flag Spacing Inconsistency

Documentation - Minor Gap

Positive Findings

Recommended Action Plan

Phase 1 - Critical Fixes (High Impact)

Phase 2 - High-Priority Improvements

Phase 3 - Polish

Testing Recommendations

Files Analyzed

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Documentation - `forecast` Command Completely Undocumented

Flag Collision - `-l` Used for Two Different Purposes

Flag Semantic Confusion - `--engine` Has Three Meanings

Flag Description Drift - `--dir` Has Three Different Descriptions

Grammar - Missing Article in `compile` Command

Grammar - Missing Article in `health` Command

Terminology - `workflow-id` vs `workflow ID` Inconsistency

Terminology - `run-id` vs `run ID` Inconsistency

Flag Availability - `--repo` Missing from 6 Commands

Flag Availability - `--json` Missing from Display Commands

Flag Inconsistency - `--days` Has Different Allowed Values

Flag Description - `--force` Varies Across Commands

Flag Semantic Difference - `--ref` Has Dual Purpose