(2 of 2) of the rest of the new bytecode emitter #2889

magarciaEPFL · 2013-08-29T08:49:41Z

This PR completes the new bytecode emitter, including building and writing class files in parallel, by building upon #2863

Before writing class files, a modicum of code tightening transformations are necessary: unreachable code elimination, which in turn calls for cleanup of labels and exception entries made redundant by unreachable code elimination.

In this PR, each such transformation is introduced in a commit with tests for the transformation in isolation. Another commit also in this PR tests each such transformation in connection with the bytecode emitter.

I'll suggest a reviewer once tests pass, comments and reviews welcome.

The assert in question was aimed at ruling out gotos (ie "jumping-applys") in actual argument position of a jumping-apply. But the assert in question went overboard to also rule out a LabelDef in actual argument position. This commit removes the assert in question altogether. The unwanted behaviors, and only those, are rule out by the test added in this commit and the existing tests for SI-6089. See also https://issues.scala-lang.org/browse/SI-7749

SI-5604 was fixed in commit e6f10b0

This bytecode transformer rephrases one ore more jumps in a multi-jump chain to target its final destination via a single jump Details in the documentation for scala.tools.nsc.backend.bcode.JumpChainsCollapser

This bytecode transformer removes those LabelNodes, LocalVarEntries, and LineNumberNodes that are redundant. Details in the documentation for scala.tools.nsc.backend.bcode.LabelsCleanup

This bytecode transformer removes those exception-entries (corresponding to try-catch-blocks) that are redundant. Details in the documentation for scala.tools.nsc.backend.bcode.DanglingExcHandlers

This bytecode transformer removes code that is unreachable. Details in the documentation for scala.tools.nsc.backend.bcode.UnreachableCode

Before writing classfiles with "optimization level zero" (ie -Ybackend:GenBCode) the very least we want to do is remove dead code beforehand, so as to prevent an artifact of stack-frames computation from showing up, the artifact described at http://asm.ow2.org/doc/developer-guide.html#deadcode That artifact results from the requirement by the Java 6 split verifier that a stack map frame be available for each basic block, even unreachable ones. Additionally, for dead-code elimination to be effective, each multi-jump chain should be collapsed to target its final destination via a single jump. Details in: - scala.tools.nsc.backend.bcode.JumpChainsCollapser Just removing dead code might leave stale entries (ie pointing to LabelNodes removed during dead-code elimination) in LocalVariableTable as well as in the exception-handlers table, thus `EssentialCleanser.cleanseMethod()` also gets rid of those. The classes in charge of that are: - scala.tools.nsc.backend.bcode.LabelsCleanup - scala.tools.nsc.backend.bcode.DanglingExcHandlers

The GenICode-counterpart to this functionality was added in commit b50a0d8 to fix SI-7006 As for GenBCode, the documentation says it all: /* * Detects and removes unreachable code. * * Should be used last in a transformation chain, before stack map frames are computed. * The Java 6 verifier demands frames be available even for dead code. * Those frames are tricky to compute, http://asm.ow2.org/doc/developer-guide.html#deadcode * The problem is avoided altogether by not emitting unreachable code in the first place.

Code that contains redundant jumps usually arrives to both GenASM and GenBCode, for a variety of reasons. To name a few: - test/files/run/private-inline.scala after -optimize - virtual pattern matcher (for an example see the test in this commit) The backend(s) manage to get rid of the redundant jumps, for example: - GenASM, using collapseJumpOnlyBlocks() - GenBCode, using JumpChainsCollapser The test case in this commit shows how GenBCode reduces the number of GOTOs from 22 down to 7.

An asm.tree.MethodNode includes a list of exception-handler entries, each such entry includes (among the other things) start and end LabelNodes to demarcate the range being protected. - Any executable instruction other than NOP means the exception-handler-entry should stay. - Otherwise, DanglingExcHandlers removes the exception-handler entry in question.

Traversing a ClassDef to build an ASM ClassNode requires typer and therefore must run single-threaded. However, bytecode transformers that manipulate ClassNodes don't need typer and thus can run in parallel. Two other activities that don't depend on typer either are: - encoding ClassNodes into the byte array representation of a classfile - writing classfiles to disk The activities above (building ClassNodes, encoding, disk writing) are performed in different threads (for now, "encoding" is only lightly loaded, but in the future certain optimizations will run there).

retronym · 2013-08-29T14:37:01Z

src/compiler/scala/tools/nsc/backend/jvm/GenBCode.scala

@@ -58,6 +111,15 @@ abstract class GenBCode extends BCodeSyncAndTry {
    override def description = "Generate bytecode from ASTs using the ASM library"
    override def erasedTypes = true

+    // number of woker threads for pipeline-2 (the pipeline in charge of most optimizations except inlining).
+    val MAX_THREADS = scala.math.min(
+      4,


We need a way to configure this. I'd suggest a -Y compiler option. Use case would be to disable parallelism altogether here to work around a bug, or when investigating a bug.

A system property is an alternative, but that is JVM-wide and unsuitable for platforms that run multiple compilers in one JVM (e.g. SBT).

I agree.

Would it also make sense to make ExecutionContext pluggable so people can't control thread pools when running Scala compiler in an embedded context?

magarciaEPFL · 2013-08-29T15:07:18Z

Thanks for the comments so far. I'll give more time for others to chime in, to then add:

a single commit with all documentation and readability improvements (eg removing the _root_ prefixes)
one commit per behavioral change (eg adding -Y option to choose the number of worker threads, from 1 to N, etc.)

magarciaEPFL · 2013-08-30T13:35:28Z

Commits eb48a8c and bdafe6d incorporate review feedback. However, another commit still pending, to add a -Y compiler option to configure the number of worker threads. Coming soon.

gkossakowski · 2013-08-30T21:37:32Z

@magarciaEPFL: this PR looks really good. Thanks for taking time and writing proper unit tests for your ASM transformations! Could you rewrite all of them to be JUnit tests, though?

In 54a7f80 you do test bytecode generated by Scala compiler so you can leave that test as-is.

magarciaEPFL · 2013-08-31T13:18:11Z

@gkossakowski , @retronym A few more commits addressing the pending items:

converting tests to JUnit
compiler option for number of threads

are being worked on, and should be ready in a day or two.

It would be great if this PR were merged as-is so that early adopters can field-test GenBCode with M5 and provide us with much needed feedback. Quite likely the pending improvements will also be ready on time, but this PR is the main thing.

Rest assured that the next commits I'll include (in that case, in a follow-up PR) will address the pending items.

magarciaEPFL · 2013-09-01T21:09:10Z

One of the comments in a previous PR emphasized a distinction between integration and unit tests in the context of ASM transformations. This PR as originally submitted contains both kinds of tests.

The commits just added refactor the units tests into JUnit tests, as requested. The actual work performed by old and new unit tests hasn't changed.

Now that both integration and junit tests have been addressed, it only remains to add a compiler option to configure the number of worker threads in GenBCode.

As already mentioned, this PR is correct, and can be merged. The compiler option in question will be added soon.

gkossakowski · 2013-09-04T00:57:01Z

@magarciaEPFL: we'll need a little bit more polish when it comes to unit tests. I'll try to work on that tonight to demonstrate what I mean by that.

magarciaEPFL · 2013-09-04T06:59:49Z

@gkossakowski , I don't know what kind of polish in unit tests it's so important to show off, all I'm saying is this PR makes the compiler faster. Would have made, had it been merged in M5.

gkossakowski · 2013-09-04T07:58:39Z

@magarciaEPFL: it's not about "showing off" but merging code that is of high quality. All I'm offering is a little guidance on how to write good unit tests.

Also, I'm very much aware of benefits of the new backend. However, that's not what we are discussing it here.

gkossakowski · 2013-09-04T19:53:45Z

@magarciaEPFL: I reworked your tests to become idiomatic JUnit tests.

Here's me showing off:

More seriously, see gkossakowski/scala@3c80c57 and @gkossakowski bcf58bd. Please cherry pick the first commit and squash the second one into your 1aacd32. Once done, push the final result this this PR. I'll review everything today. If there will be no problems I'll merge it today and your PR will make it into M5.

magarciaEPFL · 2013-09-04T21:02:31Z

@gkossakowski

I don't know what I like most,

the animated GIF
seeing a few lines shorter ant test.junit output
or having to clean build/junit before running the updated tests

Anyway, it's late here so all I've managed to do is:

branch from the last commit that remains as is ( magarciaEPFL@bdafe6d )
cherry-pick your commit to upgrade to JUnit 4.11
squash the not-quite-idiomatic and the idiomatic-show-off commits
add the commit to configure the number of threads

The result can be found in branch https://github.com/magarciaEPFL/scala/tree/backendish46 that I'm submitting as PR #2908

magarciaEPFL · 2013-09-04T21:11:55Z

OK, thanks to all reviewers and I'll find out tomorrow what M5 is good for 😄

som-snytt · 2013-09-05T09:40:16Z

I have a commit in a PR somewhere to fix:

having to clean build/junit before running the updated tests

which is totally insanity-inducing.

But do you wish you had to ant partest.task if you happen to touch code under partest-extras?

Monolithicity has its privileges.

But it's all in the right direction. NNE.

gkossakowski · 2013-09-05T21:03:50Z

@magarciaEPFL: In the future please keep amending existing PR instead of opening a new one. It makes it easier to manage things (we have 30 PRs open at the moment) for us and we don't lose the context like existing comments.

All newly opened PRs are automatically assigned to M6 release. I'll reassign your #2908 to M5 manually but please keep that in mind in the future.

magarciaEPFL added 13 commits August 29, 2013 10:30

removing defensive code made obsolete by existing fix to SI-5604

5f1298a

SI-5604 was fixed in commit e6f10b0

BCodeOpt buidling block: JumpChainsCollapser

900afbd

This bytecode transformer rephrases one ore more jumps in a multi-jump chain to target its final destination via a single jump Details in the documentation for scala.tools.nsc.backend.bcode.JumpChainsCollapser

BCodeOpt buidling block: LabelsCleanup

e933462

This bytecode transformer removes those LabelNodes, LocalVarEntries, and LineNumberNodes that are redundant. Details in the documentation for scala.tools.nsc.backend.bcode.LabelsCleanup

BCodeOpt buidling block: DanglingExcHandlers

7cd86e3

This bytecode transformer removes those exception-entries (corresponding to try-catch-blocks) that are redundant. Details in the documentation for scala.tools.nsc.backend.bcode.DanglingExcHandlers

BCodeOpt buidling block: UnreachableCode

f482aa5

This bytecode transformer removes code that is unreachable. Details in the documentation for scala.tools.nsc.backend.bcode.UnreachableCode

pretty-printing of ASM ClassNode, MethodNodes, and instructions as well

b6beaf4

SI-6759 fixed in GenBCode but beware, it remains open for GenASM

7e03e0b

retronym reviewed Aug 29, 2013
View reviewed changes

magarciaEPFL added 3 commits August 30, 2013 12:57

overzealous assert in BCodeBodyBuilder rejected throw null

12a833b

documentation and readability improvements in GenBCode

eb48a8c

field encapsulation in backend.bcode transformers

bdafe6d

ghost assigned gkossakowski Aug 30, 2013

clamor no more: quantum leap from unit tests to junit tests

1aacd32

compiler option: number of threads in GenBCode

43eda91

magarciaEPFL mentioned this pull request Sep 4, 2013

(3 of 2) of the rest of the new bytecode emitter (yes, this PR is Part 3 of 2) #2908

Closed

magarciaEPFL closed this Sep 4, 2013

scabug mentioned this pull request Apr 7, 2017

defensive code in bytecode emitter made redundant by existing fix to SI-5604 scala/bug#7621

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(2 of 2) of the rest of the new bytecode emitter #2889

(2 of 2) of the rest of the new bytecode emitter #2889

magarciaEPFL commented Aug 29, 2013

retronym Aug 29, 2013

gkossakowski Aug 30, 2013

magarciaEPFL commented Aug 29, 2013

magarciaEPFL commented Aug 30, 2013

gkossakowski commented Aug 30, 2013

magarciaEPFL commented Aug 31, 2013

magarciaEPFL commented Sep 1, 2013

gkossakowski commented Sep 4, 2013

magarciaEPFL commented Sep 4, 2013

gkossakowski commented Sep 4, 2013

gkossakowski commented Sep 4, 2013

magarciaEPFL commented Sep 4, 2013

magarciaEPFL commented Sep 4, 2013

som-snytt commented Sep 5, 2013

gkossakowski commented Sep 5, 2013

(2 of 2) of the rest of the new bytecode emitter #2889

(2 of 2) of the rest of the new bytecode emitter #2889

Conversation

magarciaEPFL commented Aug 29, 2013

retronym Aug 29, 2013

Choose a reason for hiding this comment

gkossakowski Aug 30, 2013

Choose a reason for hiding this comment

magarciaEPFL commented Aug 29, 2013

magarciaEPFL commented Aug 30, 2013

gkossakowski commented Aug 30, 2013

magarciaEPFL commented Aug 31, 2013

magarciaEPFL commented Sep 1, 2013

gkossakowski commented Sep 4, 2013

magarciaEPFL commented Sep 4, 2013

gkossakowski commented Sep 4, 2013

gkossakowski commented Sep 4, 2013

magarciaEPFL commented Sep 4, 2013

magarciaEPFL commented Sep 4, 2013

som-snytt commented Sep 5, 2013

gkossakowski commented Sep 5, 2013