Integrating alp with iree-llvm-sandbox #83

giuseros · 2021-12-03T18:41:44Z

This is the first integration for our project in alp/experimental. I tried to make the minimal number of changes outside of this folder, but:

I still wanted to integrate with mlir-proto-opt
I still want to be able to switch the experimental/alp folder from the root CMakeLists.txt (although everything is disabled by default)
All of this is super-early stage and it is very buggy and not tested. However, I wanted to show earlier than later how we would like to integrate with this, and what were the main ideas. In particular:
I integrate at the c++ level (so mostly reusing mlir-proto-opt, but not reusing the Pyhton harness you have)
The major components are there (minimal python harness+autotuner+few simple passes). Once I have it cleaned up, I can add a README.md to explain how to use it.
If you have any early comment on anything, please feel free to add them or to open an issue

Thank you so much,
Giuseppe

CMakeLists.txt

nicolasvasilache · 2021-12-06T07:22:58Z

CMakeLists.txt

@@ -32,6 +32,15 @@ set(MLIR_MAIN_SRC_DIR ${LLVM_MAIN_SRC_DIR}/../mlir)
 set(MLIR_INCLUDE_DIR ${LLVM_MAIN_SRC_DIR}/../mlir/include)
 set(MLIR_TABLEGEN_OUTPUT_DIR ${CMAKE_BINARY_DIR}/tools/mlir/include)

+# Disable experimental alp by default
+set(SANDBOX_ENABLE_ALP OFF)
+if (SANDBOX_ENABLE_ALP)


I am fine with this level of integration, I agree for projects that want it it is good to be buildable and integrated from the get go.
The fact that it is optional and cannot break the CI SGTM!

experimental/alp/CMakeLists.txt

experimental/alp/alp/compile_op.py

experimental/alp/alp/library/blas.py

experimental/alp/lib/Transforms/extract_kernel_pass.cpp

experimental/alp/lib/Transforms/modulo_scheduling_pass.cpp

nicolasvasilache · 2021-12-06T07:42:28Z

experimental/alp/lib/Transforms/modulo_scheduling_pass.cpp

+using namespace mlir;
+
+namespace{
+  struct ModuloSchedulingPass : public ModuloSchedulingPassBase<ModuloSchedulingPass>


Not reviewing the pass itself for now, this would need a bunch of .mlir tests to qualify as "in reviewable state".

tools/mlir-proto-opt/mlir-proto-opt.cpp

nicolasvasilache · 2021-12-06T07:49:50Z

I still wanted to integrate with mlir-proto-opt
I still want to be able to switch the experimental/alp folder from the root CMakeLists.txt (although everything is disabled by default)

SGTM, since everything is optional I don't see an issue.
We may want to restructure the CMake a bit so that the number of places touched by a future such addition is minimized
but this is fine for now and you don't need to shoulder this burden (I expect the next project will have some refactoring requests).

All of this is super-early stage and it is very buggy and not tested. However, I wanted to show earlier than later how we would like to integrate with this, and what were the main ideas. In particular:
I integrate at the c++ level (so mostly reusing mlir-proto-opt, but not reusing the Pyhton harness you have)

Sounds good for a first commit and to get started. I still think in the future the proper efforts should be made to either integrate with the harness or to come up with something better we can all reuse; I added comments where appropriate.
Note in particular the point about just reusing ScikitLearn/pandas/pytorch rather than having to reimplement a lot of functionality in C++ to just be able to do anything: composition is our friend, both in IR and with libraries.

The major components are there (minimal python harness+autotuner+few simple passes). Once I have it cleaned up, I can add a README.md to explain how to use it.
If you have any early comment on anything, please feel free to add them or to open an issue

I gave you review comments, no need to address everything now but please open the proper issues (prefixed with [ALP]) for the stuff you punt on.

nicolasvasilache · 2021-12-06T07:51:25Z

Note I create and landed the alp subdir in an effort to try and give you rights on it specifically.
However it is not a default github mode and I have to parse through this:
https://stackoverflow.com/questions/40567468/give-permissions-on-project-folder-in-github?noredirect=1&lq=1

It will take me a little time as I have a bunch of stuff in my stack.

giuseros · 2021-12-07T00:52:34Z

Thanks Nicolas for this first round of comments. I started to reply to some. Few overall replies:

Our main focus for now is performance. We need to show that, by using this framework we can compete with handwritten routines. We have proof of this for a very specific case, but now we are trying to get more general results.
Once we are happy with performance we will refactor everything to generalize to other operations. There are some features in the harness that I feel are missing, but we can try to add them

One question I didn't ask yet. Do you have any performance result for x86? Do they look promising?

nicolasvasilache · 2021-12-07T07:19:54Z

Re. Our main focus for now is performance.
Re. Once we are happy with performance we will refactor everything
Yes understood, the objective here is to give you some broad feedback so you have a few directions for the future.
You should absolutely optimize for your velocity at this point.

Still, it looks like you have a string-stichy and inflexible bandaid that limits you in ways you may not realize (what you have is very close to what the sandbox started with, we invested into making the QoL better for these reasons). You have to decide
what is best for your current sprint, I can only share my experience 😄

Re x86, the various benchmarks run at a high fraction of peak on my AVX512 machine. I have previously dumped some perf results here:

matmul: https://gist.github.com/nicolasvasilache/9a526e6af1aae841a4f97d49f9b37db1
1-d conv: https://gist.github.com/nicolasvasilache/356c1be72370f8d82b0fa6cadd7676b2
1-d depthwise conv: https://gist.github.com/nicolasvasilache/570868fddf59c141a8647ec28de99aee

The matmul cases has mixes of divisible and non-divisble sizes; I get similar perf with ? everywhere.
The conv / depthwise conv cases are all divisible, the first objective was to ensure we can get the main kernel to high-perf.
I have not yet tried to plug padding for non-divisible cases with those.
I also plugged and ran the 2-d cases for conv and depthwise conv to similar high perf (note that depthwise 1-D is much less arithmetic intensive and it is prob limited by instruction issue / number of instructions).

Lastly, I did some moderate amount of manual searching to get to these perf points, scaling up search (esp. for the kernel) and rooting out the type of issue you mention re outlining will be important for us too.

Hope this helps!

nicolasvasilache · 2021-12-08T14:13:47Z

could you please rebase/fix CI/land?

giuseros · 2021-12-08T16:40:44Z

I tried to rebase and fix few things. Let's see if CI is happy.

giuseros · 2021-12-08T16:44:49Z

Alright, CI is happy, but I am not authorized to merge

giuseros force-pushed the alp_experimental branch from 0096c00 to 495f7b2 Compare December 3, 2021 21:59

nicolasvasilache approved these changes Dec 6, 2021

View reviewed changes

This was referenced Dec 8, 2021

Can we have the python harness to emit directly the LLVM ll file? #92

Closed

Can the harness produce intermediate compilation IRs in separate files? #93

Closed

Can we pass to the harness a partially transformed file and execute only a partial pipeline? #94

Open

giuseros force-pushed the alp_experimental branch 2 times, most recently from 5c535ca to 76945be Compare December 8, 2021 16:32

Integrating with iree-llvm-sandbox

ec5f72f

giuseros force-pushed the alp_experimental branch from 76945be to ec5f72f Compare December 8, 2021 16:37

nicolasvasilache merged commit 3464454 into iree-org:main Dec 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating alp with iree-llvm-sandbox #83

Integrating alp with iree-llvm-sandbox #83

giuseros commented Dec 3, 2021

nicolasvasilache Dec 6, 2021

nicolasvasilache Dec 6, 2021

nicolasvasilache commented Dec 6, 2021

nicolasvasilache commented Dec 6, 2021

giuseros commented Dec 7, 2021

nicolasvasilache commented Dec 7, 2021

nicolasvasilache commented Dec 8, 2021

giuseros commented Dec 8, 2021

giuseros commented Dec 8, 2021

Integrating alp with iree-llvm-sandbox #83

Integrating alp with iree-llvm-sandbox #83

Conversation

giuseros commented Dec 3, 2021

nicolasvasilache Dec 6, 2021

Choose a reason for hiding this comment

nicolasvasilache Dec 6, 2021

Choose a reason for hiding this comment

nicolasvasilache commented Dec 6, 2021

nicolasvasilache commented Dec 6, 2021

giuseros commented Dec 7, 2021

nicolasvasilache commented Dec 7, 2021

nicolasvasilache commented Dec 8, 2021

giuseros commented Dec 8, 2021

giuseros commented Dec 8, 2021