New compiler. #4513

stuartarchibald · 2019-09-03T12:51:36Z

This PR is large, both in terms of churn and in terms of conceptual change.

The intent is as follows:

Redesign the Numba pipeline to be a bit more like that which is in LLVM. This
to comprise:
1. Passes must extend from a base class and implement defined methods.
2. Passes must be registered with a pass registry.
3. Instances of a PassManager class are used to orchestrate a "pipeline" of
  passes to execute.
4. A CompilerBase class instance holds the state, the PassManager
  orchestrated passes operate on this state.
5. A DefaultPassBuilder class provides static methods defining default or
  commonly used pipelines.
In addition, points for future work on safety and optimisation are added.
1. AnalysisUsage cf. that in LLVM is stubbed out to permit eventual
  declaration of analysis dependencies between passes.
2. Passes must return True/False depending on whether statement level changes
  have been made.
3. At registration time passes must declare whether they mutate the CFG and
  whether they are analysis only passes.
4. A simple timer is present to capture the execution time spent in each pass
  for a given pipeline execution.
5. Basic support for print_after like functionality to print the IR
  following any transform is added.

The approach taken in this PR:

Each "stage" in the old compiler pipeline is now approximately a single new
compiler pass.
The new compiler passes are split largely into two categories, those which
are untyped (i.e. do not need type information and run before type inference)
and those which are typed (i.e. need type information and run after type
inference). These can be found in numba.untyped_passes and
numba.typed_passes respectively. Passes relating object mode are in
numba.object_mode_passes.
The fundamental types and machinery to build the new compilation chain can be
found in numba.compiler_machinery. This includes the PassManager, the
base class for passes CompilerPass, and the pass_registry decorator (used
for registering new passes).
The original compiler.py has been heavily refactored to make use of the
more modular design described above. Most notably the CompilerBase class is
now responsible for determining how to execute pipelines defined by
PassManager instances with each PassManager holding a single pipeline.
The compilation state information is also attached to the CompilerBase
class and is not part of a PassManager. This makes it easier for users
building their own compilers to not have to subscribe to the Numba JIT
compilation semantics, the Compiler state can be user defined as can the
pipeline managed by the PassManager itself (along with all the passes and
their execution order).
A large amount of "fixing" was required throughout the code base to
accommodate the above.

Outstanding items/areas of concern:

Does this meet the needs of Numba? Is it too complicated?
Is the CompilerPass design and its registration mechanism reasonable?
- Could be based on class instances instead of classes themselves to allow
  more configuration from the same code. But equally inheritance/composition
  could achieve the same.
- Could/should pipelines be derived and manipulated more easily?
Is this sufficiently recognisable in terms of compiler design to permit
getting up to speed quickly (along with new docs/examples).
With respect to object mode deprecation, does this design permit/allow a
"mode" based pipeline as default? It seems like it could with separate
compiler instances extending from CompilerBase each defining a single
pipeline via the define_pipelines() method with new named pipelines for
each purpose added to the DefaultPassBuilder.

Work left on this PR:

Fix any failing tests.
Write new unit tests as needed.
Documentation.
Further code clean up/refactoring, there's some duplication between the modules holding passes.

As title.

numba/compiler_machinery.py

numba/compiler.py

As title.

As title

ehsantn · 2019-09-06T21:53:25Z

I just tested this PR and everything seems good. It actually exposed a problem in my pipeline since there was a name conflict in stage function names. Also, the "print after" feature is great!

sklam · 2019-09-09T14:01:14Z

numba/compiler_machinery.py

+class CompilerPass(object):
+
+    @abstractmethod
+    def __init__(self, *args, **kwargs):


Why does __init__ has to be an abstractmethod? A lot of times it's just a passthru to the parent's implementation.

I don't think it needs to be. I tried a bunch of different ways of describing this class and the pipeline and I think that this is an artifact from when pass registration required an instance of a pass to be registered. I was thinking about how pass classes might be extended, and was considering the merits of inheritance over composition and settled for the latter as most of the time passes are pretty unique should be state free by design.

sklam · 2019-09-09T14:06:06Z

I just tried adapting the literal dispatch PR to use the new pass API. It's so easy.

As title.

As title

sklam · 2019-09-11T20:36:42Z

The latest commit is causing 28 (out of 47) errors in numba.tests.test_withlifting

stuartarchibald · 2019-09-11T21:00:08Z

Thanks @sklam, turns out that removing the .pipeline from the state to drop the self reference breaks that, so instead I think it's best to just set it to None as the function exits.

sklam · 2019-09-11T20:49:19Z

numba/compiler_machinery.py

+                break
+        else:
+            raise ValueError("Could not find pass %s" % location)
+        self.passes.insert(idx + 1, (pass_cls, str(pass_cls)))


shouldn't the str(pass_cls) be from a description parameter.

It could be, I was maintaining the original interface style, is a description useful in this new style as passes are uniquely defined and have a name? Am happy to do either.

The same pass could be added several times; i.e. a pass can require a DCE beforehand. A description can help inform why the pass is added. Not high priority though.

numba/compiler_machinery.py

sklam · 2019-09-11T20:52:34Z

numba/compiler_machinery.py

+
+    def finalize(self):
+        """
+        Finalize the PassManager, after which no more passes may be added and


no more passes may be added

^ Not enforced?

I was in two minds... On the one hand, it's quite useful to be able to say "finalize this, compute analysis usage etc" and that be fixed. On the other, its quite useful to be able to mutate a "finalized" pass, which unsets its finalized state and subsequently re-finalize it when done. I think one way of achieving the latter with finalization equating to immutable would be to have a copy constructor or similar to generate a new pipeline from an existing? Suggestions welcomed.

I like the idea on having a copy constructor

numba/compiler_machinery.py

Conflicts: numba/ir_utils.py

sklam · 2019-09-12T14:58:46Z

Looks to me that the self-reference is only needed for pipeline re-entrant.

stuartarchibald · 2019-09-12T14:59:57Z

Looks to me that the self-reference is only needed for pipeline re-entrant.

yeah, I think that's the case.

numba/compiler.py

As title

sklam

Thanks for the patch. This is an important refactor to cleanup the compiler pipeline. Other comments in the review can be resolved later. We'll learn more once we use the new code in practice.

New compiler.

3090fe7

As title.

stuartarchibald force-pushed the wip/pass_managers_3 branch from 767f2cf to 3090fe7 Compare September 3, 2019 13:20

sklam reviewed Sep 3, 2019

View reviewed changes

numba/compiler_machinery.py Outdated Show resolved Hide resolved

sklam reviewed Sep 3, 2019

View reviewed changes

numba/compiler.py Outdated Show resolved Hide resolved

sklam reviewed Sep 3, 2019

View reviewed changes

numba/compiler.py Outdated Show resolved Hide resolved

Fix Py27 and respond to initial feedback.

ed995ad

As title.

stuartarchibald added the 2 - In Progress label Sep 3, 2019

stuartarchibald added this to the Numba 0.46 RC milestone Sep 3, 2019

Another go at Python 2

7d3fe27

As title

sklam reviewed Sep 9, 2019

View reviewed changes

stuartarchibald added 2 commits September 10, 2019 12:06

Python 2.7 fixes

acdaacc

As title.

Remove self reference

270b555

As title

stuartarchibald mentioned this pull request Sep 11, 2019

Make different stages of compiler pipeline invokable separately #2077

Closed

sklam reviewed Sep 11, 2019

View reviewed changes

stuartarchibald added 3 commits September 11, 2019 22:14

Merge remote-tracking branch 'upstream/master' into wip/pass_managers_3

041516c

Conflicts: numba/ir_utils.py

Remove self reference later

96adfc9

new style classes

6172ddd

sklam reviewed Sep 12, 2019

View reviewed changes

numba/compiler.py Outdated Show resolved Hide resolved

stuartarchibald added 2 commits September 12, 2019 16:46

Address feedback, retry build

5d7c3a8

As title

Address feedback, retry build 2

dbe4366

As title

stuartarchibald added 3 - Ready for Review and removed 2 - In Progress labels Sep 12, 2019

stuartarchibald added 2 commits September 12, 2019 20:29

Fix flake8

fe8098e

As title

Merge branch 'master' into wip/pass_managers_3

2eb233c

Fix up against master

43ed3d6

As title

sklam added the 5 - Ready to merge Review and testing done, is ready to merge label Sep 13, 2019

sklam approved these changes Sep 13, 2019

View reviewed changes

seibert merged commit 3a1e088 into numba:master Sep 13, 2019

stuartarchibald mentioned this pull request Oct 21, 2019

Formalise compiler passes #4008

Closed

stuartarchibald mentioned this pull request Sep 27, 2022

Tasks for deprecating/removing "fallback" compilation pipelines from the default @jit behaviour. #8465

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New compiler. #4513

New compiler. #4513

stuartarchibald commented Sep 3, 2019

ehsantn commented Sep 6, 2019

sklam Sep 9, 2019

stuartarchibald Sep 10, 2019

sklam commented Sep 9, 2019

sklam commented Sep 11, 2019

stuartarchibald commented Sep 11, 2019

sklam Sep 11, 2019

stuartarchibald Sep 11, 2019

sklam Sep 12, 2019

sklam Sep 11, 2019

stuartarchibald Sep 11, 2019

sklam Sep 12, 2019

sklam commented Sep 12, 2019

stuartarchibald commented Sep 12, 2019

sklam left a comment

New compiler. #4513

New compiler. #4513

Conversation

stuartarchibald commented Sep 3, 2019

ehsantn commented Sep 6, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sklam commented Sep 9, 2019

sklam commented Sep 11, 2019

stuartarchibald commented Sep 11, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sklam commented Sep 12, 2019

stuartarchibald commented Sep 12, 2019

sklam left a comment

Choose a reason for hiding this comment