[FIRRTL][InferResets] Peel interesting operations paralelly, NFC #5629

uenoku · 2023-07-19T08:40:59Z

This commit reduces the execution time of InferResets by 30%~40%. Since it's costly to walk the entire IR before Dedup, it's low hanging fruit to peel interesting operations (operations with uninferred resets in this case) paralelly.

Before:
54.5713 ( 1.5%) 54.5713 ( 7.8%) InferResets
After:
34.2072 ( 0.9%) 34.2072 ( 5.1%) InferResets

This commit reduce executin time of InferReests by 30%~40%. Since it's costly to walk the entire IR before Dedup, it's low hanging fruit to peel interesting operations (operations with uninferred resets in this case) parallely. Before: 54.5713 ( 1.5%) 54.5713 ( 7.8%) InferResets After: 34.2072 ( 0.9%) 34.2072 ( 5.1%) InferResets

prithayan · 2023-07-19T13:31:47Z

lib/Dialect/FIRRTL/Transforms/InferResets.cpp

+              op->getResultTypes(),
+              [](mlir::Type type) { return typeContainsReset(type); }) ||
+          llvm::any_of(op->getOperandTypes(), typeContainsReset))
+        e.second.push_back(op);


Is it thread safe to update the vector moduleToOps in parallel ? If it is, then this is great.

I think it's thread safe because moduleToOps[i] is mutated only by i-th iteration. That's why moduleToOps is constructed beforehand.

This should be safe. Another option here since the result set is just aggregated is to use a parallel reduce. This is more efficient though, just slightly less clear (perhaps).

darthscsi

LGTM. 2 things for possible future work:

the walk IR and collect ops to worklist is a common pattern which we should factor out.
the traceResets might be parallizable with a reduction over all the partial (per-module) results (thus parallelizing the main loop rather than the identification loop)

darthscsi · 2023-07-19T15:35:26Z

lib/Dialect/FIRRTL/Transforms/InferResets.cpp

-          if (typeContainsReset(e.type))
-            return true;
-        return false;
+static bool typeContainsReset(Type type) {


Feel free to peel this off and commit it directly.

darthscsi · 2023-07-19T15:37:23Z

lib/Dialect/FIRRTL/Transforms/InferResets.cpp

+              op->getResultTypes(),
+              [](mlir::Type type) { return typeContainsReset(type); }) ||
+          llvm::any_of(op->getOperandTypes(), typeContainsReset))
+        e.second.push_back(op);


This should be safe. Another option here since the result set is just aggregated is to use a parallel reduce. This is more efficient though, just slightly less clear (perhaps).

fabianschuiki

LGTM! Very cool to see that significant speedup.

For some of these inference passes I'm wondering if there's a mode where we run the module-local part of the pass in parallel (e.g. figuring out the reset networks within a module), and then do a global pass across the modules afterwards. Not sure that will help much here because you're already getting a lot of mileage out of the parallelization 😎

fabianschuiki · 2023-07-19T21:19:11Z

lib/Dialect/FIRRTL/Transforms/InferResets.cpp

-          if (typeContainsReset(e.type))
-            return true;
-        return false;
+static bool typeContainsReset(Type type) {


uenoku · 2023-07-20T08:27:41Z

Yeah I think it does makes sense to accumulate module-scope equivalence relation paralelly and apply reductions. I originally went in that direction, but I changed to the current simpler implementation as the number of interesting operations is very small (there are 2*10^5 interesting ops out of 10^8 ops) so I guess the performance benefit might not be that much.

uenoku requested review from darthscsi, dtzSiFive and seldridge as code owners July 19, 2023 08:40

uenoku requested a review from fabianschuiki July 19, 2023 11:48

prithayan reviewed Jul 19, 2023

View reviewed changes

darthscsi approved these changes Jul 19, 2023

View reviewed changes

fabianschuiki approved these changes Jul 19, 2023

View reviewed changes

uenoku merged commit c0b0e6c into main Jul 20, 2023

uenoku deleted the infer-reset-sped branch July 20, 2023 08:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIRRTL][InferResets] Peel interesting operations paralelly, NFC #5629

[FIRRTL][InferResets] Peel interesting operations paralelly, NFC #5629

uenoku commented Jul 19, 2023 •

edited

Loading

prithayan Jul 19, 2023

uenoku Jul 19, 2023

darthscsi Jul 19, 2023

darthscsi left a comment

darthscsi Jul 19, 2023

fabianschuiki Jul 19, 2023

darthscsi Jul 19, 2023

fabianschuiki left a comment

fabianschuiki Jul 19, 2023

uenoku commented Jul 20, 2023

[FIRRTL][InferResets] Peel interesting operations paralelly, NFC #5629

[FIRRTL][InferResets] Peel interesting operations paralelly, NFC #5629

Conversation

uenoku commented Jul 19, 2023 • edited Loading

prithayan Jul 19, 2023

Choose a reason for hiding this comment

uenoku Jul 19, 2023

Choose a reason for hiding this comment

darthscsi Jul 19, 2023

Choose a reason for hiding this comment

darthscsi left a comment

Choose a reason for hiding this comment

darthscsi Jul 19, 2023

Choose a reason for hiding this comment

fabianschuiki Jul 19, 2023

Choose a reason for hiding this comment

darthscsi Jul 19, 2023

Choose a reason for hiding this comment

fabianschuiki left a comment

Choose a reason for hiding this comment

fabianschuiki Jul 19, 2023

Choose a reason for hiding this comment

uenoku commented Jul 20, 2023

uenoku commented Jul 19, 2023 •

edited

Loading