FlakeGuard

This repository is the Marketplace/action-only distribution for FlakeGuard. Development, tests, and dogfooding live in the source repo: https://github.com/goat-ai-claw/flakeguard

Detect suspect flaky tests from JUnit history in GitHub Actions.

FlakeGuard is a local TypeScript GitHub Action that turns JUnit XML history into a flaky-test report directly in the workflow UI. It keeps a rolling JSON history, flags likely flaky tests with simple deterministic rules, and writes a markdown summary you can review in GitHub Actions.

What it does

Parses one or more JUnit XML files from explicit paths or glob patterns
Normalizes tests as classname::name
Tracks recent passed, failed, and skipped outcomes in a rolling history file
Marks likely flaky tests when the recent window contains both passes and failures and the failure count reaches a threshold
Writes a markdown summary file and appends it to $GITHUB_STEP_SUMMARY when available
Exposes suspect_count, summary_path, and history_path as action outputs

Minimal workflow example

name: flakeguard

on:
  workflow_dispatch:

jobs:
  detect-flakes:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Run tests
        run: npm test -- --reporters=default --reporters=jest-junit

      - name: FlakeGuard
        uses: goat-ai-claw/flakeguard-action@v1
        with:
          report_paths: 'reports/junit.xml'
          history_file: '.flakeguard/history.json'
          max_runs: '10'
          suspect_threshold: '2'

For local development and tests, use the source repo (goat-ai-claw/flakeguard) and replace goat-ai-claw/flakeguard-action@v1 with ./ there.

That minimal snippet proves the wiring. To detect real cross-run flakes, pair it with the cache-backed history pattern in examples/cache-history.yml.

What the workflow summary looks like

After FlakeGuard has seen a few runs for the same test, the GitHub Actions step summary highlights mixed pass/fail history directly in the workflow UI:

# FlakeGuard Summary

## Current run totals

- Total: 2
- Passed: 1
- Failed: 1
- Skipped: 0

## Suspect flakes

- `suite.Flake::toggles` — latest: **failed**, passes: 1, failures: 2, recent: F → P → F

## Stable failures

No stable failures detected.

To reproduce that exact suspect-flake summary locally, run npm install && npm run demo:cross-run in the source repo: https://github.com/goat-ai-claw/flakeguard

Inputs

Input	Default	Description
`report_paths`	—	Comma-separated JUnit XML files or glob patterns.
`history_file`	`.flakeguard/history.json`	Rolling JSON history snapshot.
`max_runs`	`10`	Maximum recent runs to retain per test.
`suspect_threshold`	`2`	Minimum failures in the rolling window before a mixed pass/fail test becomes a suspect flake.

Outputs

Output	Description
`suspect_count`	Number of likely flaky tests detected.
`summary_path`	Path to the generated markdown summary file.
`history_path`	Path to the updated JSON history file.

History persistence

The lightest credible MVP path is a branch-scoped cache that restores .flakeguard/history.json before FlakeGuard runs and saves it again afterward. See examples/cache-history.yml for a copy-paste workflow using actions/cache/restore@v4 and actions/cache/save@v4.

That pattern keeps the UX lightweight:

one stable history_file path inside the repo workspace
one unique cache key per run attempt (github.run_id + github.run_attempt)
one branch-scoped restore-keys prefix so the latest history is reused on the next run without committing state back to the repo

Because this pattern creates one immutable cache entry per run attempt, teams should pair it with normal cache-retention hygiene if they keep FlakeGuard history for a long time.

Local demo

For local demo and development commands, use the source repo: https://github.com/goat-ai-claw/flakeguard

That source repo includes the build tooling, tests, and demo:cross-run script used to generate the example output shown here.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dist		dist
examples		examples
scripts		scripts
LICENSE		LICENSE
README.md		README.md
action.yml		action.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FlakeGuard

What it does

Minimal workflow example

What the workflow summary looks like

Inputs

Outputs

History persistence

Local demo

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FlakeGuard

What it does

Minimal workflow example

What the workflow summary looks like

Inputs

Outputs

History persistence

Local demo

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages