Fix <op>= RHS/LHS eval order #992

svaarala · 2016-10-04T16:08:07Z

In x <op>= y the value of x (LHS) should be evaluated before RHS. This sometimes matters when chained <op>= expressions are used, see #987 (comment).

Tasks:

Add bug testcase
Extend bug testcase to cover all LHS cases (reg-bound variable, slow path variable, property, etc)
Slow but correct fix, check bytecode
Optimize for common cases if possible
Fix backtracking issue if temp reg load is shuffled
Run shuffle torture test
Releases entry

svaarala · 2016-10-04T16:25:50Z

I fixed a related bug earlier related to <op>= evaluation. The basic difficult is that with a simple implementation x += 4 generates:

LDREG temp, x
ADD x, temp, 4

Because the RHS, 4, is side effect free in this case the preferred output would be:

ADD x, x, 4

The existing (and incorrect) fix in the current code base is that for top level expressions the RHS is assumed not to have side effects on the LHS binding. I mistakenly assumed LHS would be evaluate when the operator is applied but the LHS is actually conceptually evaluated before RHS is ever looked at, and that looked up value is used for the final operation. WIth the mistaken assumption the top level shortcut would be safe.

But with the correct evaluation order, a top level <op>= is not safe to optimize unless one is certain RHS is side effect free. There's unfortunately no easy way to do that. I'll have to see if I can somehow manage that at least for plain constants, because generating that unnecessary load wouldn't be nice.

The peephole optimizator doesn't have enough state now to optimize the sequence later: JUMPs crossing the optimized instructions would need to be fixed too.

svaarala · 2016-10-04T16:30:38Z

Hmm, I guess one relatively simple approach would be:

Emit the temporary load for the LHS.
Evaluate the RHS to an ivalue. No code is generated for e.g. a constant, the ivalue just represents the constant.
Coerce the ivalue to a simple value, i.e. a constant or register. This resolves ivalues like 1 + 2 or 1 + x.
Check if any code has been emitted since emitting the temporary load for LHS. If not, remove the temporary load and assume the RHS is side effect free.

In other words, if the RHS is in a plain constant/register form, and no code has been emitted, there cannot be side effects in the RHS and we can optimize away the temporary. The RHS can be a constant or anything that constant folds without emitting code and the optimization would kick in.

fatcerberus · 2016-10-04T16:35:27Z

Sounds good. It would be a shame to lose a performance optimization for something as common as assignment.

svaarala · 2016-10-04T16:37:59Z

Actually looking at the current code and comments, the case being optimized in the current implementation is the resulting ivalue for a x <op>= y expression. Conceptually the ivalue result must NOT be the x register binding but a fresh temporary because the x register value may change if the <op>= expression is inside an expression further mutating x.

For that case the top level assumption seems safe, i.e. if the assignment is at the top level, there's no outer expression to deal with and a temporary is unnecessary.

The issue in this pull is actually different and is related to <op>= expressions only, while the result ivalue issue affects also plain assignment.

svaarala · 2016-10-04T16:40:49Z

This will involve at least a few cups of coffee to keep the optimized form but fixing the bug. I'll be back later :)

svaarala · 2016-10-04T21:00:15Z

Ok, I think I got the fix and the optimization approach seems to be working. In particular, these come out as a single ADD opcode:

var x = 10, y = 20;

x += 10;  // constant
x += 10 * 400;  // constant folded into a constant
x += y;  // register bound variable, no code is emitted for evaluation so no side effects
x += 'foo' + 'bar' + 'quux';  // constant folded into a constant

But for example this involves an unnecessary temporary:

var x = 10, y = 20;

x += y + 1;  // computation would be safe because y is register bound and side effect free

fatcerberus · 2016-10-04T21:03:43Z

I guess that's fair, detecting the second case reliably (in particular, without false positives) would presumably require an IR.

fatcerberus · 2016-10-04T21:21:41Z

Although... isn't the entire x += y + 1 expression available as an ivalue while parsing it? So it should be possible, assuming there are no function calls or similar involved, to look at the expression and see if there are any side effects, no?

svaarala · 2016-10-04T22:24:02Z

Since there's no IR that expression basically looks like x += <something> when parsing the += expression, and y + 1 when parsing the RHS. There isn't a point where the whole expression is visible to the compiler.

What would be possible is tracking some sort of flags / state as ivalues are combined (for example, are they side effect free) and then detecting that when the RHS had been evaluated.

svaarala · 2016-10-04T22:26:37Z

Note that ivalues are either plain values or binary operations on two plain values. They are in effect miniature IR trees which have a fixed size, and as the compiler proceeds it "collapses" the hypothetical IR to the immediate ivalues at hand. An ivalue never references another ivalue - that would actually be an expression tree. The compiler document link I pointed to provides a few very concrete examples of this.

svaarala · 2016-10-04T22:28:38Z

test-assign-add.js                  : duk.O2  3.73 duk.O2.master  3.80 duk.O2.150  5.29
test-assign-addto-nan.js            : duk.O2  1.13 duk.O2.master  1.15 duk.O2.150  1.57
test-assign-addto.js                : duk.O2  3.72 duk.O2.master  3.71 duk.O2.150  5.26
test-assign-boolean.js              : duk.O2  4.74 duk.O2.master  4.74 duk.O2.150  4.82
test-assign-const-int.js            : duk.O2  2.45 duk.O2.master  2.45 duk.O2.150  2.59
test-assign-const-int2.js           : duk.O2  4.68 duk.O2.master  4.67 duk.O2.150  9.07
test-assign-const.js                : duk.O2  3.64 duk.O2.master  3.64 duk.O2.150  4.26
test-assign-literal.js              : duk.O2  3.78 duk.O2.master  3.78 duk.O2.150  4.23
test-assign-proplhs-reg.js          : duk.O2  3.38 duk.O2.master  3.39 duk.O2.150  3.71
test-assign-proprhs.js              : duk.O2  3.66 duk.O2.master  3.64 duk.O2.150  4.19
test-assign-reg.js                  : duk.O2  2.79 duk.O2.master  2.79 duk.O2.150  2.82

fatcerberus · 2016-10-04T23:34:32Z

I see, so it's something like recursive descent but at the expression level. That makes sense then why optimizations are so difficult.

svaarala · 2016-10-04T23:39:23Z

Well no, it's recursive descent at the statement level. But it's top-down operator parsing with ivalues-based code emission for expressions. The compiler.rst document provides a summary for this and links to the top-down operator parsing paper explaining that technique.

svaarala · 2016-10-04T23:40:52Z

Top-down operator parsing could also be used to construct a traditional IR and just compile from there. The ivalue based approach is a memory saving technique which essentially works with a hypothetical IR tree on-the-fly, with a tiny "window" into the IR tree represented by the most immediate ivalues at hand. Ivalues are then combined as we go on, each such combination triggering code emission, temporary register allocation, constant allocation, etc.

fatcerberus · 2016-10-04T23:45:03Z

I'll give compiler.rst a good read when I have some more free time. Now that I've played with the compiler code I'm kind of intrigued. :-)

svaarala · 2016-10-04T23:52:15Z

The reason the compiler uses that technique by the way is that I originally just targeted ES5 and was looking for the most memory efficient parser which was still capable of doing simple constant folding and such. Top-down operator parsing turned out to be a handy approach for allocating temporaries in the right order, and the ivalue technique allows expressions to be parsed without needing to decide beforehand whether the expression will ultimately be RHS or LHS -- that decision can be done in the final step when e.g. a property access ivalue hasn't yet been forced into a GETPROP or a PUTPROP, in effect deciding its RHS/LHS role.

For ES6 it's still going to be a challenge to remain memory efficient for the low memory targets. So I'm basically trying to look for a solution with some of these characteristics:

Allow enough IR to parse destructuring and other ES6 constructs.
Scale down to low memory targets and ES5 parsing. Not necessarily as memory efficient as the current compiler but should be quite close.
Scale up to ES6 parsing, and to allow optional optimization modules to be tacked on. This should be much more modular than the current compiler logic which is (quite intentionally) very tightly coupled. For example, tree walks doing constant folding, inlining known-to-be-safe constants, etc.

Overall ES6 will probably require at least a statement level IR, i.e. an IR tree constructed for each statement and then thrown away. Multiple passes would then be needed for hoisting variable declarations and such. Assuming single pass parsing and a full function IR is going to be a low memory challenge, but would otherwise make things much simpler. So there are some trade-offs involved.

What I don't want to do is choose a new structure which is low memory hostile and then try to somehow make that work well for low memory targets. While premature optimization is not usually a good idea, choosing a structure which actively works against that is also very counterproductive. So ideally both low memory issues and compilation quality issues (optimizations) would be addressed simultaneously.

svaarala · 2016-10-04T23:55:16Z

One viable but boring option would be to leave low memory targets at ES5 with the current compiler (maintaining it as necessary for opcode format changes) but develop an ES6 compiler for non-low-memory targets. The ES6 compiler wouldn't then need to be as memory conservative which would make its structure easier to maintain. But overall maintaining two compilers would be awkward, and eventually they would have conflicting interests with respect to what bytecode and various other internals should look like.

So, I'm very much trying to avoid this outcome ... :)

svaarala · 2016-10-05T00:09:04Z

Now that I've played with the compiler code I'm kind of intrigued. :-)

Compilers are one of the most interesting parts in my opinion. Low memory scalable compilers especially :)

The current compiler was written with a lot of trial-and-error (I think I rewrote it at least 2 times) and I didn't have many metrics back then. So with better metrics for footprint, performance, etc, it'll be much nicer to rewrite the compiler.

Hopefully there'd be a solid month or two to work on it soon :)

Optimization to avoid a temporary for x <op>= y works for any RHS which doesn't emit code when evaluated to an ivalue, e.g.: * A plain constant or any expression which constant folds to a constant, e.g.: x += 4 and x += 'foo' + 'bar'. * A register-bound variable, e.g. x += y. The optimization doesn't have enough state to detect safe cases such as register bound 'y' in: x += y + 1.

* A few RegExp issues have been resolved via ES6 RegExp syntax.

fatcerberus · 2016-10-05T16:52:45Z

I thought this might make multiple chained compound assignments to the same variable idempotent, but it turns out that's not the case, either in Duktape or other engines. Huh.

In any case the behavior seems to be correct now. Thanks for the quick fix. :-)

svaarala · 2016-10-05T17:30:41Z

What kind of compound assignment do you mean?

fatcerberus · 2016-10-05T17:54:26Z

By "compound assignment" I was referring to the X <op>= Y construct, as opposed to "simple assignment", X = Y. Those are the terms I usually see for the two operations.

Basically I assumed that x += x += x += ... += 1 was idempotent regardless of how many += you chain, because I mistakenly assumed x wouldn't be updated at all until the semicolon. What actually happens is that it gets updated at each step.

svaarala · 2016-10-05T18:38:04Z

Sorry my question was ambiguous, but yes I meant what kind of concrete idempotent expression were you thinking about. Yes, x += x += 1 updates x on each step but on each step the LHS is evaluated first.

The semantics are not immediately obvious (and for me it'd be more natural if the LHS was evaluated after the RHS had potentially updated the LHS variable). Incidentally test262 test suite doesn't cover this.

fatcerberus · 2016-10-05T19:20:12Z

This seems to confuse everyone, not just us:
http://stackoverflow.com/questions/13106754/chaining-compound-assignment-operators-in-javascript

:)

fatcerberus · 2016-10-13T06:16:02Z

Hm... so I was looking through the changelogs for old Duktape versions to get a sense for how far it's evolved since I started work on minisphere, and found a very similar bug to this was already fixed in v1.1.2, #118 (which it seems got opened as a result of my post on SphereDev, go figure :). How come that fix didn't automatically fix this bug too?

svaarala · 2016-10-13T09:57:36Z

That's the bug I was referring to above too (the one I fixed earlier). It's a different case. In this case it was about evaluating LHS before RHS for the OP in <OP>=. In #118 it's about where the result of RHS evaluation is stored before continuing evaluation, especially of a sequence of assignments.

svaarala added the bug label Oct 4, 2016

svaarala added this to the v2.0.0 milestone Oct 4, 2016

svaarala mentioned this pull request Oct 4, 2016

Fixes for 1.5.2 release #945

Closed

11 tasks

svaarala mentioned this pull request Oct 4, 2016

Implement ES7 exponentiation operator (**) #987

Merged

7 tasks

svaarala force-pushed the fix-op-assign-eval-order branch 3 times, most recently from f2ccd99 to aead63b Compare October 4, 2016 20:54

svaarala added 2 commits October 5, 2016 01:28

Testcase for <op>= eval order (and related bug)

8495b26

Safe fix for x <op>= y evaluation order

066296d

svaarala force-pushed the fix-op-assign-eval-order branch from 77a051d to 62a886d Compare October 4, 2016 22:28

svaarala added 2 commits October 5, 2016 04:43

Releases: X <op>= Y LHS/RHS eval order

b3d84ff

Update test262 known issues metadata

49cd5c6

* A few RegExp issues have been resolved via ES6 RegExp syntax.

svaarala force-pushed the fix-op-assign-eval-order branch from 62a886d to 49cd5c6 Compare October 5, 2016 01:43

svaarala merged commit 740fc30 into master Oct 5, 2016

svaarala deleted the fix-op-assign-eval-order branch October 5, 2016 02:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix <op>= RHS/LHS eval order #992

Fix <op>= RHS/LHS eval order #992

svaarala commented Oct 4, 2016 •

edited

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016 •

edited

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 5, 2016

fatcerberus commented Oct 5, 2016

svaarala commented Oct 5, 2016

fatcerberus commented Oct 5, 2016

svaarala commented Oct 5, 2016

fatcerberus commented Oct 5, 2016

fatcerberus commented Oct 13, 2016

svaarala commented Oct 13, 2016

Fix <op>= RHS/LHS eval order #992

Fix <op>= RHS/LHS eval order #992

Conversation

svaarala commented Oct 4, 2016 • edited

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016 • edited

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

fatcerberus commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 4, 2016

svaarala commented Oct 5, 2016

fatcerberus commented Oct 5, 2016

svaarala commented Oct 5, 2016

fatcerberus commented Oct 5, 2016

svaarala commented Oct 5, 2016

fatcerberus commented Oct 5, 2016

fatcerberus commented Oct 13, 2016

svaarala commented Oct 13, 2016

svaarala commented Oct 4, 2016 •

edited

svaarala commented Oct 4, 2016 •

edited