More closely mimic the Clang compilation #5543

ilya-biryukov · 2025-05-27T16:24:21Z

Instead of using ASTUnit and tooling APIs, directly mimic what Clang does during the compilation.

Run the Clang frontend until the translation unit parsing is done, at which point switch back to Carbon to finish the Check phase and interface with Clang through ASTContext and Sema. In lower, finish the corresponding Clang compilation phase, i.e. CodeGen.

Clang does not have the corresponding APIs and instead provides a callback-based mechanism. To map this back to Carbon APIs, we create a separate thread that gives control back to Carbon through the callbacks, and then finishes the compilation when the Carbon code is done.

Although essentially a hack, this allows to easily fit into the Carbon codebase quickly and we should have an option of refactoring Clang code in LLVM upstream if this approach proves fruitful.

Replace this paragraph with a description of what this PR is changing or
adding, and why.

Closes #ISSUE

Instead of using ASTUnit and tooling APIs, directly mimic what Clang does during the compilation. Run the Clang frontend until the translation unit parsing is done, at which point switch back to Carbon to finish the `Check` face and interface with Clang through `ASTContext` and `Sema`. In lower, finish the corresponding Clang compilation phase, i.e. CodeGen. Clang does not have the corresponding APIs and instead provides a callback-based mechanism. To map this back to Carbon APIs, we create a separate thread that gives control back to Carbon through the callbacks, and then finishes the compilation when the Carbon code is done. Although essentially a hack, this allows to easily fit into the Carbon codebase quickly and we should have an option of refactoring Clang code in LLVM upstream if this approach proves fruitful.

danakj · 2025-05-27T16:27:07Z

Run the Clang frontend until the translation unit parsing is done, at which point switch back to Carbon to finish the Check face

Did you mean Check phase?

ilya-biryukov · 2025-05-27T16:27:39Z

This is very raw: tests fail, an approach to codegen needs to be updated, the documentation and PR description needs to be improved.

Still posting this to get early feedback about the feasibility of this approach, especially the dance with multiple threads.
Sharing the same CompilerInstance, AST, Sema, etc between multiple threads is not unheard of, but, like other use-cases (e.g. clangd), it's unusual in its own way. LLVM should be prepared to handle that, though. Even if there are few global variables / thread locals left, they should be easily fixable.

…tion and Clang-related logic

ilya-biryukov · 2025-06-03T15:54:45Z

Did you mean Check phase?

🤦 I did, thanks for pointing that out.

CarbonInfraBot · 2025-06-04T14:15:56Z

toolchain/base/BUILD

@@ -24,6 +24,21 @@ cc_library(
    ],
 )

+cc_library(
+    name = "in_flight_clang",
+    hdrs = ["in_flight_clang.h"],


[diff] _{reported by reviewdog 🐶}

Suggested change

hdrs = ["in_flight_clang.h"],

CarbonInfraBot · 2025-06-04T14:15:56Z

toolchain/base/BUILD

+    name = "in_flight_clang",
+    hdrs = ["in_flight_clang.h"],
+    srcs = ["in_flight_clang.cpp"],
+    deps = [


[diff] _{reported by reviewdog 🐶}

Suggested change

deps = [

hdrs = ["in_flight_clang.h"],

deps = [

CarbonInfraBot · 2025-06-04T14:15:56Z

toolchain/base/BUILD

+    srcs = ["in_flight_clang.cpp"],
+    deps = [
+        "//common:check",
+        "@llvm-project//llvm:Support",


[diff] _{reported by reviewdog 🐶}

Suggested change

"@llvm-project//llvm:Support",

CarbonInfraBot · 2025-06-04T14:15:56Z

toolchain/base/BUILD

+        "@llvm-project//clang:driver",
+        "@llvm-project//clang:frontend",
+        "@llvm-project//clang:frontend_tool",
+    ],


[diff] _{reported by reviewdog 🐶}

Suggested change

],

"@llvm-project//llvm:Support",

],

ilya-biryukov · 2025-06-17T16:52:09Z

I've picked this up again and it's shaping up now, should be close to something I'd like to land.
In particular, we now use the code generator from Clang rather than creating our own and take the llvm::Module Clang produces.

I want to do one last polish of the code before sending this out for review, but it's now roughly what I want it to be in terms of behavior. One last bit that I want to do a little later is to get rid of the use of CodeGenerator for marking which functions are used and instead go through Clang frontend interfaces. But since it's pretty independent, I am thinking of doing it as a follow-up.

github-actions bot added the toolchain label May 27, 2025

bricknerb self-requested a review May 27, 2025 18:32

Factor out the callbacks more explicitly, separate thread synchroniza…

2ece08e

…tion and Clang-related logic

CarbonInfraBot reviewed Jun 4, 2025

View reviewed changes

ilya-biryukov added 5 commits June 17, 2025 18:46

Properly set the file manager

98065d1

Prevent unnecessary verbose output

b77b11d

Expose code generator from Clang

31c6c2e

clang-format

16cd0ba

Do not emit LLVM metadata, update test

b44c5db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More closely mimic the Clang compilation #5543

More closely mimic the Clang compilation #5543

Uh oh!

ilya-biryukov commented May 27, 2025 •

edited

Loading

Uh oh!

danakj commented May 27, 2025

Uh oh!

ilya-biryukov commented May 27, 2025

Uh oh!

ilya-biryukov commented Jun 3, 2025

Uh oh!

CarbonInfraBot Jun 4, 2025

Uh oh!

CarbonInfraBot Jun 4, 2025

Uh oh!

CarbonInfraBot Jun 4, 2025

Uh oh!

CarbonInfraBot Jun 4, 2025

Uh oh!

ilya-biryukov commented Jun 17, 2025

Uh oh!

Uh oh!

More closely mimic the Clang compilation #5543

Are you sure you want to change the base?

More closely mimic the Clang compilation #5543

Uh oh!

Conversation

ilya-biryukov commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danakj commented May 27, 2025

Uh oh!

ilya-biryukov commented May 27, 2025

Uh oh!

ilya-biryukov commented Jun 3, 2025

Uh oh!

CarbonInfraBot Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

CarbonInfraBot Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

CarbonInfraBot Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

CarbonInfraBot Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

ilya-biryukov commented Jun 17, 2025

Uh oh!

Uh oh!

ilya-biryukov commented May 27, 2025 •

edited

Loading