[MLIR/Frontend] C++ compiler driver improvements, ability to compile textual IR #216

pengmai · 2023-07-20T21:36:08Z

Context: The previous version of the C++ compiler driver is a work-in-progress implementation that went down to the LLVM IR module. It also is missing important features around debugging that exist in the current subprocess driver.

Description of the Change:

The C++ driver compiles an MLIR module down to an object binary file.
General refactoring, separating out the extensions from the driver into separate pybind modules.
Addition of the ability to use @qjit on a string containing textual IR (MLIR at any level and LLVM IR) and get it to run from Python.
Enzyme module is updated to be compiled statically.

Benefits:
Improved compilation time

Progress

Implement the C++ part of the driver functionality
Implement pipeline configuration Python API
Implement pipeline output printing Python API
Update the tests

[sc-41430]
[sc-41704]

* Add MHLO as C++ dependency to quantum-opt * Configure parsing from C++ * Implement lowering to LLVM IR with custom metadata handling for Enzyme * Fix bugs with memory ownership * Clean up C calling convention * Canonicalize ._mlir attribute in frontend * python formatting * Update tests with canonical IR * Update canonicalized lit tests * Add MHLO as a dependency to quantum dialect build in CI * Formatting + typo in workflow * CI: Attempt to re-checkout MHLO during dialect build * CI: Attempt to use cached MHLO * CI: Add _mlir_libs to mocked modules for doc build, fix logic issue with MHLO source caching * CI: mock specific CAPI library * CI: typo in docs configuration * Switch mlir_canonicalize to generic pass runner, reorganize driver files * Clean up, rename LLVMTarget to avoid confusion with core LLVMTarget * Fix error message for mlir_run_pipeline * Update mlir/CMakeLists.txt Co-authored-by: Ali Asadi <ali@xanadu.ai> * Update copyright year Co-authored-by: Ali Asadi <ali@xanadu.ai> * Update mlir/lib/Catalyst/Driver/Pipelines.cpp Co-authored-by: Ali Asadi <ali@xanadu.ai> * Update copyright year Co-authored-by: Ali Asadi <ali@xanadu.ai> * Update copyright year Co-authored-by: Ali Asadi <ali@xanadu.ai> * Add #pragma once Co-authored-by: Ali Asadi <ali@xanadu.ai> * Add #pragma once Co-authored-by: Ali Asadi <ali@xanadu.ai> * Move MHLO passes to top-level CMake variable, documentation --------- Co-authored-by: Ali Asadi <ali@xanadu.ai>

…++-compiler-driver

…kage

…ipeline

sergei-mironov · 2023-07-25T12:08:41Z

@pengmai @erick-xanadu
I would like to discuss some additional design decisions we might need to make regarding this driver. I think that the API proposed in this PR might actually limit some features we currently have. Although these features may be not very important, I think that I need to ask some questions before starting the integration. First, let me define the scope of the problem. I suggest to think that this PR changes the original "composable" API which I imagine as

$MLIR_{mhlo} \to_{pl_0} MLIR_1 \to_{pl_1} MLIR_2 \to_{pl_2} ... \to_{pl_{N-1}} MLIR_N \to (LLVM,Path_{obj})$

into a "programmable black box" model

$(MLIR_{mhlo}, Spec) \to_{compileasm} (LLVM,Path_{obj},Effects)$

Where: arrow ($\to_x$) refers to the evaluation of a pipeline $x$ consisting of some passes, $MLIR$, $LLVM$ are corresponding IRs in a string form; $Effects$ are side-effects like printing to streams or dumping files to disk that become important to note in the latter design. Finally, $Spec$ is a placeholder for configuration object which we might want pass to the compile_asm function to increase its configurability.

The questions are:

What level of details do we want in a pipeline configuration in $Spec$? In the previous version of the design, users were able to combine MLIR passes into pipelines by themselves. Do we need to keep CompilerDriver fully-compatible with this? If so, I can imagine that $Spec$ may be dict-of-lists parameter similar to this definition which is currently hardcoded. Alternatively, we may just allow passing names of some pre-defined pipelines.
What level of details do we need in the $Effects$? The print_stage functionality that we had previously allowed users to print the pipeline outputs in any combination. Lit tests typically want to print the output of only one pipeline. Are we allowed to limit the API scope down to this use-case?
What is the idea behind the mlir_run_pipeline function? It accepts inputs and returns outputs in string form, which means that we may lose the speed benefits if we use it multiple times. At the same time, it does not follow compile options and is not designed to e.g. infer IR attributes, so we can't use it in the sequence with compile_asm. Should we remove it in favor of a more generic version of compile_asm?

erick-xanadu · 2023-07-25T13:52:40Z

I suggest to think that this PR changes the original "composable" API which I imagine as

$MLIR_{mhlo} \to_{pl_0} MLIR_1 \to_{pl_1} MLIR_2 \to_{pl_2} ... \to_{pl_{N-1}} MLIR_N \to (LLVM,Path_{obj})$

into a "programmable black box" model

$(MLIR_{mhlo}, Spec) \to_{compileasm} (LLVM,Path_{obj},Effects)$

I think both models are equivalent. The first one also had effects (printing to a file and preserved in the file system). And the Spec was just the default pipeline. We only gave the users the ability to define their own pipeline, which I think should also be possible in the compiler driver, but I haven't investigated.

If so, I can imagine that $Spec$ may be dict-of-lists parameter similar to this definition which is currently hardcoded.

Can you elaborate on this?

I think for the scope of this PR, we can limit the ability for the user to define their own pass pipelines if it is getting in the way while we think of which passes are useful to us. In GCC, there's no option for the user to add passes (beyond enabling passes that are disabled by default) without recompiling the compiler. Similarly, if the user wants to change the order of transformations, they would need to recompile the compiler. I think this wouldn't be too bad but I agree that it would take away some of the dynamism we are accustomed to.

What level of details do we need in the $Effects$? The print_stage functionality that we had previously allowed users to print the pipeline outputs in any combination.

What do you mean by "in any combination"?

I think the Compiler Driver already prints all the IR with the shouldPrintAfterPass. Or do you mean that you would like to see the options in between?

sergei-mironov · 2023-07-25T15:25:51Z

MLIRmhlo→pl0MLIR1→pl1MLIR2→pl2...→plN−1MLIRN→(LLVM,Pathobj)
(MLIRmhlo,Spec)→compileasm(LLVM,Pathobj,Effects)

The first one also had effects (printing to a file and preserved in the file system).

Yes, this is true. I didn't mention effects in the current design because I am assuming that users can control them via the Python API quite precisely (so we are not responsible that much).

I think both models are equivalent.

No, I don't think so, unless Spec is as powerful as Python that we are using now. But I do think that we might not need to have all this power actually. So I would like to know what do we expect from the pipeline configurations.

If so, I can imagine that Spec may be dict-of-lists parameter similar to this definition which is currently hardcoded.

Can you elaborate on this?

The idea I have in mind - is to allow users to call compile_asm like this

filename, llvm_ir, *inferred_data = compile_asm(ir, workspace_name, module_name, ...
   pipelines = {
     'mhloToCorePasses' : ["func.func(chlo-legalize-to-hlo)", ..., "convert-to-signless" ],
     'quantumCompilationPasses' : ["this-pass", "that-pass", ...],
     ...
     })

I think for the scope of this PR, we can limit the ability for the user to define their own pass pipelines if it is getting in the way while we think of which passes are useful to us. In GCC, there's no option for the user to add passes (beyond enabling passes that are disabled by default) without recompiling the compiler. Similarly, if the user wants to change the order of transformations, they would need to recompile the compiler. I think this wouldn't be too bad but I agree that it would take away some of the dynamism we are accustomed to.

What level of details do we need in the Effects? The print_stage functionality that we had previously allowed users to print the pipeline outputs in any combination.

What do you mean by "in any combination"?

I can imagine users calling print_stage like we do in tests, but they may do the printing in any order. I am wondering do we need to keep this feature or (as one option) we can just add a parameter asking the driver to output the result of a single stage X instead of the final result.

Yes, another option we have - is just to hardcode the pipelines into C++ , but we would still need to specify their names to be able to refer to them in print_stage configuration for example.

I think the Compiler Driver already prints all the IR with the shouldPrintAfterPass. Or do you mean that you would like to see the options in between?

Hmm, probably yes, if I understand you correctly. In the tests we run the pipeline but only want the result of a single intermediate pipeline.

erick-xanadu · 2023-07-25T15:46:50Z

No, I don't think so, unless Spec is as powerful as Python that we are using now. But I do think that we might not need to have all this power actually. So I would like to know what do we expect from the pipeline configurations.

Yes the user could have any function whatsoever, but the intended use case is mostly for a way to specify command line arguments to mlir-opt tools and similar.

The idea I have in mind - is to allow users to call compile_asm like this

Yeah, something like that would be great! EDIT: I also wouldn't be opposed to essentially having 'mhloToCorePasses' : -mhlo-lowering and -mhlo-lowering to be expanded in the C++ side to the current definition.

I can imagine users calling print_stage like we do in tests, but they may do the printing in any order. I am wondering do we need to keep this feature or (as one option) we can just ask the driver to output the result of a single stage X instead of the final result.

I don't understand here the notion of order fully, but I think it the main point is that the user did not specify that there would be an output. I think both options (having the human readable IR for these stages available vs printing it only on demand) have their use cases. Accessing the .mlir field in the QJIT object is a nice convenience. Avoiding printing it out is more efficient. I think we should preserve the behaviour for .qir and .mlir though.

In the tests we run the pipeline but only want the result of a single intermediate pipeline.

We can dump it to a file and not print it. In the tests all human readable IRs are dump to a file (and as you pointed out elsewhere read into a dictionary) but they are only printed when the user requests them to be printed to stdout. I think we can keep that behaviour for .qir and .mlir

Co-authored-by: David Ittah <dime10@users.noreply.github.com>

dime10

🥳

pengmai and others added 15 commits July 17, 2023 10:00

Merge branch 'main' of github.com:PennyLaneAI/catalyst into staging/c…

ed0af43

…++-compiler-driver

Saving WIP compilation from IR

9b469a8

Add compilation of LLVM IR module to object file

f4b368b

Remove llvm module dump, set PIC as relocation model

0ef21fe

Add support for compiling from textual IR (MLIR/LLVM IR)

6b572ac

Break out compiler driver into separate pybind 11 module with C++ lin…

110dc0c

…kage

Removed unused 'source' compile option from Python

77298fc

Implement printing intermediate IR to workspace

5d1558a

Add dumping of LLVM IR, use std::optional instead of raw pointer

48fcb55

Always dump the LLVM IR module, tweaks to the printing intermediate p…

a484bee

…ipeline

break out registerAllCatalystPasses to reduce redundancy

3b4964e

Bugfixes with pass pipeline and running compiler pipeline

71e5cc2

Add workaround for test_tensor_ops tests

8f29239

Remove unused include

74a5808

pengmai changed the title ~~Jmp/c++ compile from ir~~ [MLIR/Frontend] C++ compiler driver improvements, ability to compile textual IR Jul 20, 2023

pengmai requested review from erick-xanadu and sergei-mironov July 20, 2023 21:37

Sergei Mironov added 2 commits July 24, 2023 10:54

Support 'verbose' Python-side argument

45c25bc

Adjust the comments and apply formatters

60d1648

Print messages using py::print

241d66f

Sergei Mironov added 5 commits July 26, 2023 13:20

[WIP] Add Python-side pipeline configuration parameter

da99eaa

Implementing the pipeline configuration and result retrieval API

b3be13e

Rename CompilerDriver -> CppCompiler

d2dbfe1

Re-organise compiler wrappers, adjust tests

a43e415

Merge Pipelines into CompileDriver file

f0d8c1d

erick-xanadu and others added 17 commits September 18, 2023 17:00

Remove TargetParser to LLVM_LINK_COMPONENTS.

5fc7c3d

Remove Target from LLVM_LINK_COMPONENTS.

5fa77e4

Remove Support from LLVM_LINK_COMPONENTS.

de1c680

Remove SelectionDAG.

5c9bdf7

Remove ScalarOpts.

d4bc092

Remove a bunch.

5c49e3a

More.

2a7652f

More.

96af1ff

More.

d630c7f

Comment.

24f233c

Only dependency on z.

3b74d0b

Merge branch 'main' into eochoa/2023-09-07/c++-compile-from-ir

1dcb84b

Apply suggestions from code review

8ac3e33

Co-authored-by: David Ittah <dime10@users.noreply.github.com>

Apply suggestions from code review

135be31

Co-authored-by: David Ittah <dime10@users.noreply.github.com>

Add documentation for pipelines.

ffe25de

Style.

e9107e2

Typo.

de35930

dime10 mentioned this pull request Sep 19, 2023

Scatter lowering #273

Merged

erick-xanadu added 7 commits September 19, 2023 13:40

Target the compiler_driver.

45f3359

Update Makefile echo.

0f95b85

Adds comments.

16025d5

Comment.

f99ece8

Move linking to core catalyst driver.

69ed526

Re-add test.

f0dfdbb

Only print intermediates when keep_intermediate=True.

a352ba8

erick-xanadu force-pushed the jmp/c++-compile-from-ir branch from 5bf7d9e to a352ba8 Compare September 19, 2023 20:33

erick-xanadu requested a review from dime10 September 19, 2023 20:44

dime10 approved these changes Sep 19, 2023

View reviewed changes

erick-xanadu merged commit 64be9d2 into main Sep 20, 2023
19 of 20 checks passed

erick-xanadu deleted the jmp/c++-compile-from-ir branch September 20, 2023 12:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLIR/Frontend] C++ compiler driver improvements, ability to compile textual IR #216

[MLIR/Frontend] C++ compiler driver improvements, ability to compile textual IR #216

pengmai commented Jul 20, 2023 •

edited by dime10

Loading

sergei-mironov commented Jul 25, 2023 •

edited

Loading

erick-xanadu commented Jul 25, 2023

sergei-mironov commented Jul 25, 2023 •

edited

Loading

erick-xanadu commented Jul 25, 2023 •

edited

Loading

dime10 left a comment

[MLIR/Frontend] C++ compiler driver improvements, ability to compile textual IR #216

[MLIR/Frontend] C++ compiler driver improvements, ability to compile textual IR #216

Conversation

pengmai commented Jul 20, 2023 • edited by dime10 Loading

sergei-mironov commented Jul 25, 2023 • edited Loading

erick-xanadu commented Jul 25, 2023

sergei-mironov commented Jul 25, 2023 • edited Loading

erick-xanadu commented Jul 25, 2023 • edited Loading

dime10 left a comment

Choose a reason for hiding this comment

pengmai commented Jul 20, 2023 •

edited by dime10

Loading

sergei-mironov commented Jul 25, 2023 •

edited

Loading

sergei-mironov commented Jul 25, 2023 •

edited

Loading

erick-xanadu commented Jul 25, 2023 •

edited

Loading