[clang][utils] Add auto mode to reduction script #163282

ilovepi · 2025-10-13T22:17:52Z

The existing script allows us to run creduce/cvise automatically on clang reproducers. However, llvm-reduce is much more efficient for middle or backend issues.

In the early stages, we can just determine which phase the crash happens, and choose the best tool. For now, we only support simple test scripts for opt and llc, but this should cover many more cases.

This feature is behind a new flag, --auto, and the default behavior of the script is otherwise unchanged.

Fixes #126827

The existing script allows us to run creduce/cvise automatically on clang reproducers. However, llvm-reduce is much more efficient for middle or backend issues. In the early stages, we can just determine which phase the crash happens, and choose the best tool. For now, we only support simple test scripts for opt and llc, but this should cover many more cases. This feature is behind a new flag `--auto`, and the default behavior of the script is otherwise unchanged.

llvmbot · 2025-10-13T22:18:28Z

@llvm/pr-subscribers-clang

Author: Paul Kirth (ilovepi)

Changes

The existing script allows us to run creduce/cvise automatically on clang reproducers. However, llvm-reduce is much more efficient for middle or backend issues.

In the early stages, we can just determine which phase the crash happens, and choose the best tool. For now, we only support simple test scripts for opt and llc, but this should cover many more cases.

This feature is behind a new flag, --auto, and the default behavior of the script is otherwise unchanged.

Full diff: https://github.com/llvm/llvm-project/pull/163282.diff

1 Files Affected:

(modified) clang/utils/reduce-clang-crash.py (+189-12)

diff --git a/clang/utils/reduce-clang-crash.py b/clang/utils/reduce-clang-crash.py
index 22e3dbb8f9b3f..f36733251e2da 100755
--- a/clang/utils/reduce-clang-crash.py
+++ b/clang/utils/reduce-clang-crash.py
@@ -18,12 +18,22 @@
 import subprocess
 import shlex
 import tempfile
-import shutil
 import multiprocessing
+from enum import StrEnum, auto
+from typing import List, Optional
 
 verbose = False
 creduce_cmd = None
 clang_cmd = None
+opt_cmd = None
+llc_cmd = None
+
+
+class FailureType(StrEnum):
+    FrontEnd = auto()
+    MiddleEnd = auto()
+    BackEnd = auto()
+    Unknown = auto()
 
 
 def verbose_print(*args, **kwargs):
@@ -70,6 +80,44 @@ def write_to_script(text, filename):
     os.chmod(filename, os.stat(filename).st_mode | stat.S_IEXEC)
 
 
+def extract_opt_level(args_list: List[str]) -> Optional[str]:
+    """
+    Finds the last optimization flag (-O...) from a list of arguments.
+
+    Args:
+      args_list: A list of string arguments passed to the compiler.
+
+    Returns:
+      The last matching optimization flag string if found, otherwise None.
+    """
+    valid_opt_flags = {"-O0", "-O1", "-O2", "-O3", "-Os", "-Oz", "-Og", "-Ofast"}
+
+    # Iterate in reverse to find the last occurrence
+    for arg in reversed(args_list):
+        if arg in valid_opt_flags:
+            return arg
+    return None
+
+
+def remove_first_line(file_path):
+    """
+    Removes the first line from a specified file.
+    """
+    try:
+        with open(file_path, "r") as f:
+            lines = f.readlines()
+
+        # If the file is not empty, write all lines except the first one back.
+        if lines:
+            with open(file_path, "w") as f:
+                f.writelines(lines[1:])
+
+    except FileNotFoundError:
+        print(f"Error: File '{file_path}' not found.")
+    except Exception as e:
+        print(f"An error occurred: {e}")
+
+
 class Reduce(object):
     def __init__(self, crash_script, file_to_reduce, creduce_flags):
         crash_script_name, crash_script_ext = os.path.splitext(crash_script)
@@ -85,6 +133,9 @@ def __init__(self, crash_script, file_to_reduce, creduce_flags):
         self.expected_output = []
         self.needs_stack_trace = False
         self.creduce_flags = ["--tidy"] + creduce_flags
+        self.opt = opt_cmd
+        self.llc = llc_cmd
+        self.ir_file = "crash.ll"
 
         self.read_clang_args(crash_script, file_to_reduce)
         self.read_expected_output()
@@ -186,22 +237,30 @@ def skip_function(func_name):
 
         self.expected_output = result
 
-    def check_expected_output(self, args=None, filename=None):
+    def check_expected_output(self, cmd=None, args=None, filename=None):
+        if not cmd:
+            cmd = self.clang
         if not args:
             args = self.clang_args
         if not filename:
             filename = self.file_to_reduce
 
         p = subprocess.Popen(
-            self.get_crash_cmd(args=args, filename=filename),
+            self.get_crash_cmd(cmd=cmd, args=args, filename=filename),
             stdout=subprocess.PIPE,
             stderr=subprocess.STDOUT,
         )
         crash_output, _ = p.communicate()
         return all(msg in crash_output.decode("utf-8") for msg in self.expected_output)
 
-    def write_interestingness_test(self):
+    def write_interestingness_test(self, cmd=None, use_llvm_reduce=False):
         print("\nCreating the interestingness test...")
+        if not cmd:
+            cmd = self.get_crash_cmd()
+
+        # llvm-reduce interestingness tests take the file as the first argument.
+        # NOTE: this cannot be escaped by quote_cmd(), since it needs expansion.
+        filename = '< "$1"' if use_llvm_reduce else ""
 
         # Disable symbolization if it's not required to avoid slow symbolization.
         disable_symbolization = ""
@@ -210,32 +269,39 @@ def write_interestingness_test(self):
 
         output = """#!/bin/bash
 %s
-if %s >& t.log ; then
+if %s %s >& t.log ; then
   exit 1
 fi
 """ % (
             disable_symbolization,
-            quote_cmd(self.get_crash_cmd()),
+            quote_cmd(cmd),
+            filename,
         )
 
         for msg in self.expected_output:
             output += "grep -F %s t.log || exit 1\n" % shlex.quote(msg)
 
         write_to_script(output, self.testfile)
-        self.check_interestingness()
+        self.check_interestingness(cmd, use_llvm_reduce=use_llvm_reduce)
 
-    def check_interestingness(self):
-        testfile = os.path.abspath(self.testfile)
+    def check_interestingness(self, cmd, use_llvm_reduce=False):
+        test_cmd = [os.path.abspath(self.testfile)]
 
+        # llvm-reduce interestingness tests take the file as the first arg.
+        if use_llvm_reduce:
+            test_cmd += [self.ir_file]
         # Check that the test considers the original file interesting
-        returncode = subprocess.call(testfile, stdout=subprocess.DEVNULL)
-        if returncode:
+        result = subprocess.run(args=test_cmd, stdout=subprocess.DEVNULL)
+        if result.returncode:
             sys.exit("The interestingness test does not pass for the original file.")
 
         # Check that an empty file is not interesting
         # Instead of modifying the filename in the test file, just run the command
         with tempfile.NamedTemporaryFile() as empty_file:
-            is_interesting = self.check_expected_output(filename=empty_file.name)
+            new_args = cmd[1:] if use_llvm_reduce else cmd[1:-1]
+            is_interesting = self.check_expected_output(
+                cmd=cmd[0], args=new_args, filename=empty_file.name
+            )
         if is_interesting:
             sys.exit("The interestingness test passes for an empty file.")
 
@@ -424,11 +490,76 @@ def run_creduce(self):
             print("\n\nctrl-c detected, killed reduction tool")
             p.kill()
 
+    def run_llvm_reduce(self):
+        full_llvm_reduce_cmd = [
+            llvm_reduce_cmd,
+            f"--test={self.testfile}",
+            self.ir_file,
+        ]
+        print("\nRunning llvm-reduce tool...")
+        verbose_print(quote_cmd(full_llvm_reduce_cmd))
+        try:
+            p = subprocess.Popen(full_llvm_reduce_cmd)
+            p.communicate()
+        except KeyboardInterrupt:
+            # Hack to kill C-Reduce because it jumps into its own pgid
+            print("\n\nctrl-c detected, killed reduction tool")
+            p.kill()
+
+    def classify_crash(self) -> FailureType:
+        print("classifying crash ...")
+        if self.check_expected_output(args=self.clang_args + ["-fsyntax-only"]):
+            print("Found Frontend Crash")
+            return FailureType.FrontEnd
+
+        print("Found Middle/Backend failure")
+        args = self.clang_args + [
+            "-mllvm",
+            "--print-on-crash",
+            "-mllvm",
+            f"--print-on-crash-path={self.ir_file}",
+            "-mllvm",
+            "--print-module-scope",
+        ]
+
+        if not self.check_expected_output(args=args):
+            sys.exit("The interestingness test does not pass with '--print-on-crash'.")
+
+        # The output from --print-on-crash has an invalid first line (pass name).
+        remove_first_line(self.ir_file)
+
+        self.opt_level = extract_opt_level(self.clang_args) or "-O2"
+
+        if self.check_expected_output(
+            cmd=self.opt,
+            args=[self.opt_level, "-disable-output"],
+            filename=self.ir_file,
+        ):
+            print("Found MiddleEnd Crash")
+            return FailureType.MiddleEnd
+        if self.check_expected_output(
+            cmd=self.llc, args=[self.opt_level], filename=self.ir_file
+        ):
+            print("Found BackEnd Crash")
+            return FailureType.BackEnd
+        print("Found Unknow Crash Type. Falling back to creduce")
+        return FailureType.Unknown
+
+    def reduce_ir_crash(self, new_cmd: List[str]):
+        print("Writing interestingness test...")
+        self.write_interestingness_test(cmd=new_cmd, use_llvm_reduce=True)
+        print("Starting llvm-reduce with llc test case")
+        self.run_llvm_reduce()
+        print("Done Reducing IR file.")
+
 
 def main():
     global verbose
     global creduce_cmd
+    global llvm_reduce_cmd
     global clang_cmd
+    global opt_cmd
+    global llc_cmd
 
     parser = ArgumentParser(description=__doc__, formatter_class=RawTextHelpFormatter)
     parser.add_argument(
@@ -450,6 +581,20 @@ def main():
         help="The path to the `clang` executable. "
         "By default uses the llvm-bin directory.",
     )
+    parser.add_argument(
+        "--opt",
+        dest="opt",
+        type=str,
+        help="The path to the `opt` executable. "
+        "By default uses the llvm-bin directory.",
+    )
+    parser.add_argument(
+        "--llc",
+        dest="llc",
+        type=str,
+        help="The path to the `llc` executable. "
+        "By default uses the llvm-bin directory.",
+    )
     parser.add_argument(
         "--creduce",
         dest="creduce",
@@ -457,13 +602,29 @@ def main():
         help="The path to the `creduce` or `cvise` executable. "
         "Required if neither `creduce` nor `cvise` are on PATH.",
     )
+    parser.add_argument(
+        "--llvm-reduce",
+        dest="llvm_reduce",
+        type=str,
+        help="The path to the `llvm-reduce` executable. "
+        "By default uses the llvm-bin directory.",
+    )
+    parser.add_argument(
+        "--auto",
+        action="store_true",
+        help="Use auto reduction mode, that uses `creduce`/`cvise`"
+        "for frontend crashes and llvm-reduce for middle/backend crashes.",
+    )
     parser.add_argument("-v", "--verbose", action="store_true")
     args, creduce_flags = parser.parse_known_args()
     verbose = args.verbose
     llvm_bin = os.path.abspath(args.llvm_bin) if args.llvm_bin else None
     creduce_cmd = check_cmd("creduce", None, args.creduce)
     creduce_cmd = check_cmd("cvise", None, args.creduce)
+    llvm_reduce_cmd = check_cmd("llvm-reduce", llvm_bin, args.llvm_reduce)
     clang_cmd = check_cmd("clang", llvm_bin, args.clang)
+    opt_cmd = check_cmd("opt", llvm_bin, args.opt)
+    llc_cmd = check_cmd("llc", llvm_bin, args.llc)
 
     crash_script = check_file(args.crash_script[0])
     file_to_reduce = check_file(args.file_to_reduce[0])
@@ -472,6 +633,22 @@ def main():
         creduce_flags += ["--n", str(max(4, multiprocessing.cpu_count() // 2))]
 
     r = Reduce(crash_script, file_to_reduce, creduce_flags)
+    if args.auto:
+        crash_type = r.classify_crash()
+        match crash_type:
+            case FailureType.FrontEnd | FailureType.Unknown:
+                print("Starting reduction with creduce/cvise")
+                pass
+            case FailureType.MiddleEnd:
+                # TODO: parse the exact pass from the backtrace and set the
+                # pass pipeline directly.
+                new_cmd = [opt_cmd, "-disable-output", r.opt_level]
+                r.reduce_ir_crash(new_cmd)
+                return
+            case FailureType.BackEnd:
+                new_cmd = [llc_cmd, r.opt_level]
+                r.reduce_ir_crash(new_cmd)
+                return
 
     r.simplify_clang_args()
     r.write_interestingness_test()

ilovepi · 2025-10-13T23:22:42Z

crash.cpp

# Crash reproducer for Fuchsia clang version 22.0.0git (git@github.com:ilovepi/llvm-project 1220cd3c868115be2c4d80c77d65719eb9dd1c1d)
# Driver args: "-O2" "-mcpu=grace" "dumb.cpp" "-target" "aarch64-linux-gnu"
# Original command:  "/usr/local/google/home/paulkirth/llvm-fork/build/bin/llvm" "clang" "-cc1" "-triple" "aarch64-unknown-linux-gnu" "-O2" "-emit-obj" "-dumpdir" "a-" "-disable-free" "-clear-ast-before-backend" "-main-file-name" "dumb.cpp" "-mrelocation-model" "pic" "-pic-level" "2" "-pic-is-pie" "-mframe-pointer=non-leaf" "-fmath-errno" "-ffp-contract=on" "-fno-rounding-math" "-mconstructor-aliases" "-funwind-tables=2" "-enable-tlsdesc" "-target-cpu" "grace" "-target-feature" "+v9a" "-target-feature" "+aes" "-target-feature" "+bf16" "-target-feature" "+ccidx" "-target-feature" "+complxnum" "-target-feature" "+crc" "-target-feature" "+dotprod" "-target-feature" "+ete" "-target-feature" "+fp-armv8" "-target-feature" "+fp16fml" "-target-feature" "+fpac" "-target-feature" "+fullfp16" "-target-feature" "+i8mm" "-target-feature" "+jsconv" "-target-feature" "+lse" "-target-feature" "+mte" "-target-feature" "+neon" "-target-feature" "+pauth" "-target-feature" "+perfmon" "-target-feature" "+rand" "-target-feature" "+ras" "-target-feature" "+rcpc" "-target-feature" "+rdm" "-target-feature" "+sha2" "-target-feature" "+sha3" "-target-feature" "+sm4" "-target-feature" "+spe" "-target-feature" "+ssbs" "-target-feature" "+sve" "-target-feature" "+sve-aes" "-target-feature" "+sve-bitperm" "-target-feature" "+sve-sha3" "-target-feature" "+sve-sm4" "-target-feature" "+sve2" "-target-feature" "+trbe" "-target-abi" "aapcs" "-debugger-tuning=gdb" "-fdebug-compilation-dir=/usr/local/google/home/paulkirth/llvm-fork/build/repro-test" "-fcoverage-compilation-dir=/usr/local/google/home/paulkirth/llvm-fork/build/repro-test" "-resource-dir" "/usr/local/google/home/paulkirth/llvm-fork/build/lib/clang/22" "-internal-isystem" "/usr/local/google/home/paulkirth/llvm-fork/build/bin/../include/aarch64-unknown-linux-gnu/c++/v1" "-internal-isystem" "/usr/local/google/home/paulkirth/llvm-fork/build/bin/../include/c++/v1" "-internal-isystem" "/usr/local/google/home/paulkirth/llvm-fork/build/lib/clang/22/include" "-internal-isystem" "/usr/local/include" "-internal-externc-isystem" "/include" "-internal-externc-isystem" "/usr/include" "-fdeprecated-macro" "-ferror-limit" "19" "-fmessage-length=106" "-fno-signed-char" "-fgnuc-version=4.2.1" "-fskip-odr-check-in-gmf" "-fcxx-exceptions" "-fexceptions" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-target-feature" "+outline-atomics" "-faddrsig" "-D__GCC_HAVE_DWARF2_CFI_ASM=1" "-o" "/tmp/dumb-bde16f.o" "-x" "c++" "crash.cpp"
 "/usr/local/google/home/paulkirth/llvm-fork/build/bin/clang" "-cc1" "-triple" "aarch64-unknown-linux-gnu" "-O2" "-emit-obj" "-dumpdir" "a-" "-disable-free" "-clear-ast-before-backend" "-main-file-name" "dumb.cpp" "-mrelocation-model" "pic" "-pic-level" "2" "-pic-is-pie" "-mframe-pointer=non-leaf" "-fmath-errno" "-ffp-contract=on" "-fno-rounding-math" "-mconstructor-aliases" "-funwind-tables=2" "-enable-tlsdesc" "-target-cpu" "grace" "-target-feature" "+v9a" "-target-feature" "+aes" "-target-feature" "+bf16" "-target-feature" "+ccidx" "-target-feature" "+complxnum" "-target-feature" "+crc" "-target-feature" "+dotprod" "-target-feature" "+ete" "-target-feature" "+fp-armv8" "-target-feature" "+fp16fml" "-target-feature" "+fpac" "-target-feature" "+fullfp16" "-target-feature" "+i8mm" "-target-feature" "+jsconv" "-target-feature" "+lse" "-target-feature" "+mte" "-target-feature" "+neon" "-target-feature" "+pauth" "-target-feature" "+perfmon" "-target-feature" "+rand" "-target-feature" "+ras" "-target-feature" "+rcpc" "-target-feature" "+rdm" "-target-feature" "+sha2" "-target-feature" "+sha3" "-target-feature" "+sm4" "-target-feature" "+spe" "-target-feature" "+ssbs" "-target-feature" "+sve" "-target-feature" "+sve-aes" "-target-feature" "+sve-bitperm" "-target-feature" "+sve-sha3" "-target-feature" "+sve-sm4" "-target-feature" "+sve2" "-target-feature" "+trbe" "-target-abi" "aapcs" "-debugger-tuning=gdb" "-fdebug-compilation-dir=/usr/local/google/home/paulkirth/llvm-fork/build/repro-test" "-fcoverage-compilation-dir=/usr/local/google/home/paulkirth/llvm-fork/build/repro-test" "-fdeprecated-macro" "-ferror-limit" "19" "-fmessage-length=106" "-fno-signed-char" "-fgnuc-version=4.2.1" "-fskip-odr-check-in-gmf" "-fcxx-exceptions" "-fexceptions" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-target-feature" "+outline-atomics" "-faddrsig" "-D__GCC_HAVE_DWARF2_CFI_ASM=1" "-x" "c++" "crash.cpp"

crash.cpp:

# 1 "<built-in>"
# 1 "crash.cpp"
char a, b;
char *c;
void d(bool e) {
  for (;;)
    for (int f; f < 2; f += e) {
      a = c[f];
      b += !f ?: e;
    }
}

I used the above case to test locally w/ something reported in the issue tracker a few days back.

ilovepi · 2025-10-13T23:25:58Z

Also, I know @arichardson has a probably more sophisticated version in the CHERI fork of LLVM, but this was what I could whip up in a few hours and test this morning. I figured its better to have something now that covers the basics, and we can just update this if it gets ported.

aeubanks

thanks for doing this, I really appreciate this. initial quick review, will do more thorough review later

can you link #126827?

aeubanks · 2025-10-13T22:59:59Z

clang/utils/reduce-clang-crash.py

+            print("Found MiddleEnd Crash")
+            return FailureType.MiddleEnd
+        if self.check_expected_output(
+            cmd=self.llc, args=[self.opt_level], filename=self.ir_file


it doesn't exist yet but we should have a -disable-output for llc

clang/utils/reduce-clang-crash.py

aeubanks · 2025-10-13T23:27:59Z

clang/utils/reduce-clang-crash.py

        creduce_flags += ["--n", str(max(4, multiprocessing.cpu_count() // 2))]

    r = Reduce(crash_script, file_to_reduce, creduce_flags)
+    if args.auto:


maybe we only try to reduce in opt/llc if those binaries are provided to us, rather than on the --auto flag?

I wanted to gate everything behind a flag to start and be sure the default flow wasn't changed. Happy to modify the mechanism if we're by and large happy w/ the direction of this patch.

aeubanks · 2025-10-13T23:28:50Z

clang/utils/reduce-clang-crash.py

+        print("Found Middle/Backend failure")
+        args = self.clang_args + [
+            "-mllvm",
+            "--print-on-crash",


--print-on-crash can be very slow since it's saving the entire module's IR after every pass, but we can reevaluate this in the future if this turns out to actually be slow in practice

Its nice in that its very simple to use this mechanism, vs. something like getting the pass number and then dumping that IR w/ -print-before. It certainly made scripting this simpler.

I factored this logic into some methods, that way if we want to switch the mechanism it should be slightly easier.

at some point we should also incorporate llvm/utils/reduce_pipeline.py, but that's for the future

oh, that'd be a nice improvement.

clang/utils/reduce-clang-crash.py

aeubanks · 2025-10-13T23:34:32Z

clang/utils/reduce-clang-crash.py

+
+        self.opt_level = extract_opt_level(self.clang_args) or "-O2"
+
+        if self.check_expected_output(


running the entire pipeline on the IR at the point of the crash isn't precisely matching what the crashing pass would see since we're running all the passes up to that point twice

ideally we'd go through a couple of steps:

run the specific pass that crashed (maybe extractable in the crash output?) on the --print-on-crash output rather than the entire pipeline

if that doesn't work, run the entire pipeline on the output of the frontend clang -Xclang -disable-llvm-passes -S -emit-llvm

and similarly for llc reduction it should run on the output post-optimization (clang -S -emit-llvm) rather than the IR on crash/IR output from frontend

if you want to push this off to another PR, please add a TODO

Yeah, I'd like to have something more sophisticated. I have a TODO around line 640, but maybe that's better here.

The backtrace seems to contain the exact pass spelling that things crash on, so I think it will be not that hard to adapt, but I wanted something we could iterate on w/o worrying about every detail. But, I agree there's lots of room for improvement here.

iiuc opt won't crash here because the IR we're passing it has already been through the optimization pipeline. we need to generate the IR with -Xclang -disable-llvm-passes

let me double check, but I think opt definitly still crashes here (at least in my test case). it certainly was in my earlier testing (and I'm pretty sure I confirmed it after I refactored this yesterday).

but I'd actually like to get the precise pass from the backtrace and use that to just craft the

opt -passes="function<eager-inv>(drop-unnecessary-assumes,float2int,lower-constant-intrinsics,loop(loop-rotate<header-duplication;no-prepare-for-lto>,loop-deletion),loop-distribute,inject-tli-mappings,loop-vectorize<no-interleave-forced-only;no-vectorize-forced-only;>,infer-alignment,loop-load-elim,instcombine<max-iterations=1;no-verify-fixpoint>,simplifycfg<bonus-inst-threshold=1;forward-switch-cond;switch-range-to-icmp;switch-to-lookup;no-keep-loops;hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,slp-vectorizer,vector-combine,instcombine<max-iterations=1;no-verify-fixpoint>,loop-unroll<O2>,transform-warning,sroa<preserve-cfg>,infer-alignment,instcombine<max-iterations=1;no-verify-fixpoint>,loop-mssa(licm<allowspeculation>),alignment-from-assumptions,loop-sink,instsimplify,div-rem-pairs,tailcallelim,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;speculate-unpredictables>)" < crash.ll

This is the backtrace I'm seeing locally. I'm not 100% on if any of this is gated on a specific cmake flag we set in a typical build of our toolchain, but having the pass string printed seems like a really easy way to get it to reproduce if we get the IR from -print-on-crash.

../bin/opt < reduced.ll -O2 -disable-output opt: /usr/local/google/home/paulkirth/llvm-fork/llvm/include/llvm/Support/Casting.h:109: static bool llvm::isa_impl_cl<llvm::VPWidenCastRecipe, const llvm::VPRecipeBase *>::doit(const From *) [To = llvm::VPWidenCastRecipe, From = const llvm::VPRecipeBase *]: Assertion `Val && "isa<> used on a null pointer"' failed. PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace and instructions to reproduce the bug. Stack dump: 0. Program arguments: ../bin/opt -O2 -disable-output 1. Running pass "function<eager-inv>(drop-unnecessary-assumes,float2int,lower-constant-intrinsics,loop(loop-rotate<header-duplication;no-prepare-for-lto>,loop-deletion),loop-distribute,inject-tli-mappings,loop-vectorize<no-interleave-forced-only;no-vectorize-forced-only;>,infer-alignment,loop-load-elim,instcombine<max-iterations=1;no-verify-fixpoint>,simplifycfg<bonus-inst-threshold=1;forward-switch-cond;switch-range-to-icmp;switch-to-lookup;no-keep-loops;hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,slp-vectorizer,vector-combine,instcombine<max-iterations=1;no-verify-fixpoint>,loop-unroll<O2>,transform-warning,sroa<preserve-cfg>,infer-alignment,instcombine<max-iterations=1;no-verify-fixpoint>,loop-mssa(licm<allowspeculation>),alignment-from-assumptions,loop-sink,instsimplify,div-rem-pairs,tailcallelim,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;speculate-unpredictables>)" on module "<stdin>" 2. Running pass "loop-vectorize<no-interleave-forced-only;no-vectorize-forced-only;>" on function "_Z1db" #0 0x0000555739c1b686 ___interceptor_backtrace ../../../../../../../../../llvm-llvm-project/compiler-rt/lib/asan/../sanitizer_common/sanitizer_common_interceptors.inc:4556:13 #1 0x0000555739daf2e4 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) /usr/local/google/home/paulkirth/llvm-fork/llvm/lib/Support/Unix/Signals.inc:838:8 [1] 3448508 IOT instruction ../bin/opt -O2 -disable-output < reduced.ll

github-actions · 2025-10-14T01:42:18Z

✅ With the latest revision this PR passed the Python code formatter.

arichardson · 2025-10-14T16:07:51Z

@ilovepi Thanks for reminding me about this - I should really upstream the remaining changes from https://github.com/CTSRD-CHERI/llvm-project/blob/dev/clang/utils/creduce_crash_testcase.py.

arichardson · 2025-10-14T17:45:50Z

clang/utils/reduce-clang-crash.py

+        if backend_result == FailureType.BackEnd:
+            return backend_result
+
+        # Try running w/ -emit-llvm to generate an IR file,


I'd also try changing the opt-level and/or disabling LLVM passes when creating the IR file. In https://github.com/CTSRD-CHERI/llvm-project/blob/238a953f5c7ca3de3cff67bf2433a0f59c9677ee/clang/utils/creduce_crash_testcase.py#L899C75-L899C96 I try the normal command first, then -O1 and then -O0 -disable-O0-optnone.

A lot of the time we also need many of the llc command flags to actually reproduce the crash so I tried mapping some of the clang args to llc args, but it's not very reliable. Luckily you can still use llvm-reduce with clang in the reproducer script.

+1, at some point if llc doesn't repro the backend crash we should also attempt to run this through clang -Xclang -disable-llvm-passes -c -o /dev/null which skips the opt pipeline and only runs the codegen pipeline

yeah, I think the strategy is still WIP. Likely this bit will get the most refactoring/has most room for improvement.

aeubanks

we really need to come up with some sort of testing strategy for this, otherwise I think it'll be very hard to maintain this script and make improvements without breaking things (would need some blessed way to crash the compiler, e.g. #pragma clang __debug crash crashes the frontend, need to figure out how to crash the middle/backend). I think I'd prefer that we figure that out before making improvements

aeubanks · 2025-10-15T20:59:12Z

clang/utils/reduce-clang-crash.py

+        print("Found Middle/Backend failure")
+        args = self.clang_args + [
+            "-mllvm",
+            "--print-on-crash",


at some point we should also incorporate llvm/utils/reduce_pipeline.py, but that's for the future

aeubanks · 2025-10-15T21:09:43Z

clang/utils/reduce-clang-crash.py

+            sys.exit("The interestingness test does not pass with '--print-on-crash'.")
+
+        # The output from --print-on-crash has an invalid first line (pass name).
+        remove_first_line(self.ir_file)


we should change the extra line to start with ; to avoid having to do this (I've also been annoyed with this in the past). lemme have someone make this change to get into the LLVM development workflow

oh, yeah, that'd be way simpler.

aeubanks · 2025-10-15T21:44:15Z

clang/utils/reduce-clang-crash.py

+    Returns:
+      The last matching optimization flag string if found, otherwise None.
+    """
+    valid_opt_flags = {"-O0", "-O1", "-O2", "-O3", "-Os", "-Oz", "-Og", "-Ofast"}


llc/opt won't recognize -Og/-Ofast

crap, so much for simple.

aeubanks · 2025-10-15T21:56:45Z

clang/utils/reduce-clang-crash.py

+
+        self.opt_level = extract_opt_level(self.clang_args) or "-O2"
+
+        if self.check_expected_output(


iiuc opt won't crash here because the IR we're passing it has already been through the optimization pipeline. we need to generate the IR with -Xclang -disable-llvm-passes

aeubanks · 2025-10-15T23:02:04Z

clang/utils/reduce-clang-crash.py

+        if backend_result == FailureType.BackEnd:
+            return backend_result
+
+        # Try running w/ -emit-llvm to generate an IR file,


+1, at some point if llc doesn't repro the backend crash we should also attempt to run this through clang -Xclang -disable-llvm-passes -c -o /dev/null which skips the opt pipeline and only runs the codegen pipeline

aeubanks · 2025-10-15T23:05:23Z

clang/utils/reduce-clang-crash.py


-    def check_expected_output(self, args=None, filename=None):
+    def check_expected_output(self, cmd=None, args=None, filename=None):
+        if not cmd:


having all these default arguments makes it very hard to reason, it would be nice to clean up the callers to explicitly pass things in

ack. I'll look at doing that.

boomanaiden154 · 2025-10-15T23:21:45Z

would need some blessed way to crash the compiler, e.g. #pragma clang __debug crash crashes the frontend, need to figure out how to crash the middle/backend

What do you think the easiest way to do this is? I'm wondering if we could add clang builtins that trigger verifier errors? Doing that for the middle end would be easy. I'm not sure it would be easy to codegen something that makes it through to the backend though. Introducing IR intrinsics that produce verifier failures is the cleanest I can think of but feels a bit heavyweight.

ilovepi · 2025-10-16T03:00:14Z

we really need to come up with some sort of testing strategy for this, otherwise I think it'll be very hard to maintain this script and make improvements without breaking things (would need some blessed way to crash the compiler, e.g. #pragma clang __debug crash crashes the frontend, need to figure out how to crash the middle/backend). I think I'd prefer that we figure that out before making improvements

why not add an off by default debugging pass we can add to a pipeline that just has a call to __builtin_trap() as its run() impl? could do the same for a machine function pass to check for backend stuff. that's probably good enough for testing purposes.

aeubanks · 2025-10-16T22:38:39Z

we really need to come up with some sort of testing strategy for this, otherwise I think it'll be very hard to maintain this script and make improvements without breaking things (would need some blessed way to crash the compiler, e.g. #pragma clang __debug crash crashes the frontend, need to figure out how to crash the middle/backend). I think I'd prefer that we figure that out before making improvements

why not add an off by default debugging pass we can add to a pipeline that just has a call to __builtin_trap() as its run() impl? could do the same for a machine function pass to check for backend stuff. that's probably good enough for testing purposes.

we have trigger-crash-module/trigger-crash-function as middle-end passes that crash for the purposes of testing. if we can hook these up that would definitely work, but the tricky part is how do we add these crashes to the pipeline in a way that we can add to a clang invocation but would also get passed to the opt command that we use to llvm-reduce. e.g. if we add a cl::opt<bool> AddCrashPass("add-crash-pass") and use clang -mllvm -add-crash-pass, we'd need to ensure we pass -mllvm flags in the clang command to the opt command. for something like trigger-crash-function we can test that the final reduced IR contains exactly one function and the opt command is opt -passes=trigger-crash-function.

could do the same with a backend crash where we add an MIR pass that crashes

ilovepi requested review from aeubanks, arichardson, arsenm, boomanaiden154 and rnk October 13, 2025 22:17

llvmbot added the clang Clang issues not falling into any other category label Oct 13, 2025

aeubanks reviewed Oct 13, 2025

View reviewed changes

Fix up detection for backend failures

cac2faa

Refactor IR extraction and checking

81d4fea

arichardson reviewed Oct 14, 2025

View reviewed changes

aeubanks reviewed Oct 15, 2025

View reviewed changes


		self.opt_level = extract_opt_level(self.clang_args) or "-O2"

		if self.check_expected_output(

[clang][utils] Add auto mode to reduction script #163282

Are you sure you want to change the base?

[clang][utils] Add auto mode to reduction script #163282

Uh oh!

Conversation

ilovepi commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Oct 13, 2025

Uh oh!

ilovepi commented Oct 13, 2025

Uh oh!

ilovepi commented Oct 13, 2025

Uh oh!

aeubanks left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arichardson commented Oct 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aeubanks left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

boomanaiden154 commented Oct 15, 2025

Uh oh!

ilovepi commented Oct 16, 2025

Uh oh!

aeubanks commented Oct 16, 2025

ilovepi commented Oct 13, 2025 •

edited

Loading

github-actions bot commented Oct 14, 2025 •

edited

Loading