JIT: Strengthen type-based optimizations #5664

bjorng · 2022-01-31T05:28:24Z

This pull requests strengthens the type-based optimizations in the JIT that were introduced in #5316.

The compiler has been enhanced to infer ranges for more operations. The ranges are now included in the type information stored in BEAM files.

With the enhanced type information, the JIT can now omit type checks and overflow checks for some arithmetic instructions, and can also omit some redundant tests when constructing binaries using the binary syntax.

github-actions · 2022-01-31T05:29:20Z

CT Test Results

      4 files   358 suites 44m 11s ⏱️
1 962 tests 1 908 ✔️ 54 💤 0 ❌
5 442 runs 5 371 ✔️ 71 💤 0 ❌

Results for commit 2362d1c.

♻️ This comment has been updated with latest results.

To speed up review, make sure that you have read Contributing to Erlang/OTP and that all checks pass.

See the TESTING and DEVELOPMENT HowTo guides for details about how to run test locally.

Artifacts

// Erlang/OTP Github Action Bot

garazdawi

I've not looked closely at the arm parts, but x86 looks good to me.

garazdawi · 2022-01-31T07:47:04Z

erts/emulator/beam/beam_file.c

@@ -607,7 +607,11 @@ static int parse_type_chunk(BeamFile *beam, IFF_Chunk *chunk) {
    beamreader_init(chunk->data, chunk->size, &reader);

    LoadAssert(beamreader_read_i32(&reader, &version));
-    LoadAssert(version == BEAM_TYPES_VERSION);
+    if (version != BEAM_TYPES_VERSION) {


Should we issue a run-time warning when this happens?

Probably not by default. It could get very annoying for systems that intentionally mix BEAM files compiled by different versions of the compiler. But there should be some way or some tool so that one could get aware that type-based optimizations are no longer active.

erts/emulator/beam/jit/arm/beam_asm.hpp

garazdawi · 2022-01-31T08:27:29Z

erts/emulator/beam/jit/x86/instr_arith.cpp

+        if (need_div) {
+            a.sal(x86::rax, imm(_TAG_IMMED1_SIZE));
+        }
+
+        if (need_rem) {
+            a.sal(x86::rdx, imm(_TAG_IMMED1_SIZE));
+        }
+
+        if (need_div) {
+            a.or_(x86::rax, imm(_TAG_IMMED1_SMALL));
+        }
+
+        if (need_rem) {
+            a.or_(x86::rdx, imm(_TAG_IMMED1_SMALL));
+        }


Why 4 ifs and not just 2?

I have kept the original order of the instructions, thinking that perhaps it could enable more instruction-level parallelism. Not sure whether it makes any difference, or whether the CPU itself can do the same reordering.

GCC and Clang seem to think that it does matter, so let us leave it like this.

michalmuskala · 2022-01-31T10:12:50Z

erts/emulator/beam/jit/arm/ops.tab

-      i_rem_div Fail Live LHS RHS Remainder Quotient
+gc_bif2 Fail Live u$bif:erlang:rem/2 LHS1 RHS1 Remainder | \
+    gc_bif2 A B u$bif:erlang:intdiv/2 LHS2 RHS2 Quotient | \
+    equal(LHS1, LHS2) | \


Given the changes to the types, would it make sense to not allow matching on equal arguments by repeating the pattern and always require explicit equal calls? This seems like something that would be easy to get wrong and might result in considerable performance issues

Yes, I have also had the same thought. It is a probably a good idea.

This is necessary if type information is added to instructions that have `list` operands (such as `bs_create_bin`).

The introduction of types broke fusing of `div` and `rem`.

Add ranges to the type for functions that returns a size for an Erlang term. Since any term is limited by the size of memory, it is possible to define an upper limit that can never be reached in practice in the foreseeable future. Also improve propagation of ranges through some more operations.

For expressions such as: Unknown band 42 the type would conservatively be assumed to be any integer. Be less conservative and assume that the type is an integer in the range 0 .. 42. That will give more opportunities for the JIT to eliminate type tests. Note that for: Unknown band 0 we will still infer the type to be any integer instead of the integer 0. The reason is that code such as: foo(Unknown) -> Unknown band 0. would be rewritten to: foo(Unknown) -> Unknown band 0, 0. if we were to assume that `Unknown band 0` is 0. This is a pessimization.

Find the type of a call to `erlang/2` where a literal tuple is used as a lookup table. Here is a truncated example from the `base64` module: element(X+1, {$A, $B, $C, $D, $E, $F, ...}) The type will be the join of the types of the elements.

Having type information for the `bs_create_bin` instruction will allow to omit some type checks.

Version 1 of the type information also include ranges for integers.

Test that the JIT does not omit the overflow check when it would be unsafe.

bjorng added team:VM Assigned to OTP team VM enhancement testing currently being tested, tag is used by OTP internal CI labels Jan 31, 2022

bjorng requested review from garazdawi and jhogberg January 31, 2022 05:28

bjorng self-assigned this Jan 31, 2022

garazdawi approved these changes Jan 31, 2022

View reviewed changes

michalmuskala reviewed Jan 31, 2022

View reviewed changes

bjorng added 8 commits January 31, 2022 14:15

emu: Mask type information from X and Y registers in lists

fc46785

This is necessary if type information is added to instructions that have `list` operands (such as `bs_create_bin`).

ops.tab: Mend fusing of div and rem operators

25f4cec

The introduction of types broke fusing of `div` and `rem`.

beam_call_types: Teach will_succeed/3 to handle bit operations

8320b5a

Annotate the bs_create_bin instruction with types

46aa327

Having type information for the `bs_create_bin` instruction will allow to omit some type checks.

Introduce a new version of type information

6b0d1e4

Version 1 of the type information also include ranges for integers.

bjorng force-pushed the bjorn/jit/type-based-optimizations branch 2 times, most recently from ac96abc to b054a03 Compare January 31, 2022 15:15

bjorng added 3 commits February 1, 2022 08:28

small_SUITE: Add test of multiplication

2e4c32b

Test that the JIT does not omit the overflow check when it would be unsafe.

x86 JIT: Optimize based on range information

29862e1

aarch64 JIT: Optimize based on range information

d6907be

bjorng force-pushed the bjorn/jit/type-based-optimizations branch 2 times, most recently from 2362d1c to d6907be Compare February 2, 2022 09:51

bjorng merged commit 9032dc5 into erlang:master Feb 2, 2022

bjorng deleted the bjorn/jit/type-based-optimizations branch February 2, 2022 09:51

bjorng mentioned this pull request Feb 7, 2022

Further strengthen the type-based optimizations #5688

Merged

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Strengthen type-based optimizations #5664

JIT: Strengthen type-based optimizations #5664

bjorng commented Jan 31, 2022

github-actions bot commented Jan 31, 2022 •

edited

Loading

garazdawi left a comment

garazdawi Jan 31, 2022

bjorng Jan 31, 2022

garazdawi Jan 31, 2022

bjorng Jan 31, 2022

garazdawi Feb 1, 2022

michalmuskala Jan 31, 2022

bjorng Jan 31, 2022

JIT: Strengthen type-based optimizations #5664

JIT: Strengthen type-based optimizations #5664

Conversation

bjorng commented Jan 31, 2022

github-actions bot commented Jan 31, 2022 • edited Loading

CT Test Results

Artifacts

garazdawi left a comment

Choose a reason for hiding this comment

garazdawi Jan 31, 2022

Choose a reason for hiding this comment

bjorng Jan 31, 2022

Choose a reason for hiding this comment

garazdawi Jan 31, 2022

Choose a reason for hiding this comment

bjorng Jan 31, 2022

Choose a reason for hiding this comment

garazdawi Feb 1, 2022

Choose a reason for hiding this comment

michalmuskala Jan 31, 2022

Choose a reason for hiding this comment

bjorng Jan 31, 2022

Choose a reason for hiding this comment

github-actions bot commented Jan 31, 2022 •

edited

Loading