Implement v128 instructions #508

dcodeIO · 2019-02-22T17:02:43Z

Edit: This escalated and is complete now.

dcodeIO · 2019-02-22T19:20:56Z

Hmm, it might actually be easier to defer i64x2.splat -> v128.splat<i64> in order to share most of the codegen here, quite similar to what we do with i32.clz -> clz<i32> etc. Suggesting v128.* over simd.* because there might be other types at some point potentially.

MaxGraey · 2019-02-22T19:23:04Z

I'm not sure this convention apply to rest operations

dcodeIO · 2019-02-22T19:37:46Z

From the proposal it seems that the name of the operation, i.e. add, plus a single type parameter, i.e. <i64> (... i32, i16, i8, f32, f64) is sufficient to represent which instruction to pick when bound to the underlying vector type (here v128). One v128 split into i64s gives us 2 lanes etc. Correct me if I'm wrong :)

MaxGraey · 2019-02-22T19:45:08Z

yeah, it seems for fixed length SIMD operations this correct)

MaxGraey · 2019-02-23T08:05:24Z

@dcodeIO shuffling lanes operation could be also not trivial for "first good issue". wdyt?

dcodeIO · 2019-02-23T08:07:50Z

Looks like that's essentially just a v128,v128->v128 binary plus an additional lane index like in extract_lane/replace_lane. Should be straight forward :)

MaxGraey · 2019-02-23T08:14:20Z

Nope) It's v128,v128,imm[16]->v128. This instruction is encoded with 16 bytes providing the indices of the elements to return. The indices i in range [0, 15] select the i-th element of a. The indices in range [16, 31] select the i - 16-th element of b. imm is array of constants and compiler should check this

dcodeIO · 2019-02-23T08:25:50Z

I understand. Well, actually I don't, so I guess you're right :)

dcodeIO · 2019-02-23T09:38:47Z

Regarding shuffle it might also be convenient to provide typed versions like i64x2.shuffle as well (even though these are not in the spec), so v8x16.shuffle would essentially become an alias of i8x16.shuffle which itself would become a deferred call to v128.shuffle<T>. Wdyt?

MaxGraey · 2019-02-25T19:21:42Z

What do you think about add optional align attribute for all load and store builtins? I guess it may some meaning for SIMD's load/store like translate to movaps sse instruction when align==16 instead movups which could lead to significant performance difference. Probably just add boolean param will be enough load<T>(ptr: usize, offset: i32 = 0, aligned: bool = false).

dcodeIO · 2019-02-26T09:11:03Z

For context: Currently all load/store ops use natural alignment, in the case of v128 that's 16 bytes. Natural alignment isn't displayed in text format iiuc. I must admit though that I don't fully understand the full implications of the alignment hint, and it appears to me to become relevant only when actually diverging from natural alignment?

…l_true, min/max, initial align parameter

dcodeIO · 2019-02-26T11:10:50Z

One strange thing about the alignment hint is that the spec speaks of "an alignment hint (in base 2 logarithmic representation)", while Binaryen's CreateLoad takes the size in bytes (which is fine) but the text format also outputs the size in bytes. Is this expected? Asking because I'm not sure anymore whether the new constantAlign parameter on i32.load etc., which is modeled after the text format, should take log2 or size.

dcodeIO · 2019-02-26T16:49:11Z

Oh, well .. Downloading and installing node v12.0.0-v8-canary201902266c298fffb9...

MaxGraey · 2019-02-26T16:49:31Z

Yeah, it's alive)

MaxGraey · 2019-02-26T19:48:52Z

It seems you need rebuild tests

dcodeIO · 2019-02-26T19:54:53Z

10/10 would do if node/v8-canary would work on Windows :)

kripken · 2019-02-26T21:56:43Z

(yeah, I think the binary has exponent of 2, while the text format is in bytes)

dcodeIO · 2019-02-27T11:15:58Z

Alright, the set of instructions should be complete now~~, except no real docs on definitions and incomplete tests~~. Also fixed a few definition hiccups I came across.

Whoever feels like giving this a review, please do. I know, I know, it's boring to the max.

Implement v128 initializers / exemplary add

d981b7e

dcodeIO added 2 commits February 23, 2019 04:59

defer to generic v128, implement extract_lane/replace_lane

5689b69

reuse write helpers, validate lane index

c83ce39

dcodeIO added the good first issue label Feb 23, 2019

dcodeIO added 3 commits February 23, 2019 09:40

add simd shuffle, bitselect and shift bindings

62ba17b

implement v8x16.shuffle

7c9fcf0

update actual fixture

3c6392e

fix currentType mismatch

7728857

dcodeIO changed the title ~~Implement v128 initializers / exemplary add~~ Implement initial v128 instructions Feb 23, 2019

dcodeIO added 2 commits February 23, 2019 11:03

add zero-cost abstraction experiment

74e5cc2

fix non-functional test

1884c72

dcodeIO mentioned this pull request Feb 23, 2019

Implement more SIMD instructions #510

Closed

dcodeIO removed the good first issue label Feb 25, 2019

dcodeIO added 4 commits February 25, 2019 13:11

generic shuffle

547eb6b

fix i64x2.add, accept isize/usize

dc81231

load/store, sub, mul, div

7893474

fix test

56a221e

neg, add/sub_saturate, shl, shr, and, or, xor, not, bitselect, any/al…

4bb239f

…l_true, min/max, initial align parameter

or if..

afcf625

dcodeIO added 2 commits February 26, 2019 20:31

try this

0f82469

enable others

aba5c1f

dcodeIO added 4 commits February 26, 2019 21:01

maybe just a fixture even though I can't run it locally

b6c8a75

let's test things

4e198dd

fixture

674e1a2

design some subtypes

a32366c

dcodeIO added 4 commits February 27, 2019 09:21

actual test assertions

121df6e

instructions for days

74d5b2f

instructions for weeks

21590c5

definitions for months

13549b0

dcodeIO changed the title ~~Implement initial v128 instructions~~ Implement v128 instructions Feb 27, 2019

dcodeIO added 10 commits February 27, 2019 13:43

universal field alignment, see #518

8b5bfd9

where did that come from..

ecd538f

progress

29218a4

progress

58e50c5

progress

ea84dfb

definitions for years

9f48ebe

Merge branch 'master' into simd-init

e15b539

remaining tests

466b7e2

revert subtypes

46c4fcd

apply new load/store immAlign argument

1a9c4d1

dcodeIO merged commit e1f1a3b into master Feb 28, 2019

dcodeIO deleted the simd-init branch March 7, 2019 23:48

Uh oh!

Implement v128 instructions #508

Implement v128 instructions #508

Uh oh!

Conversation

dcodeIO commented Feb 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcodeIO commented Feb 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaxGraey commented Feb 22, 2019

Uh oh!

dcodeIO commented Feb 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaxGraey commented Feb 22, 2019

Uh oh!

MaxGraey commented Feb 23, 2019

Uh oh!

dcodeIO commented Feb 23, 2019

Uh oh!

MaxGraey commented Feb 23, 2019

Uh oh!

dcodeIO commented Feb 23, 2019

Uh oh!

dcodeIO commented Feb 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaxGraey commented Feb 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcodeIO commented Feb 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcodeIO commented Feb 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcodeIO commented Feb 26, 2019

Uh oh!

MaxGraey commented Feb 26, 2019

Uh oh!

MaxGraey commented Feb 26, 2019

Uh oh!

dcodeIO commented Feb 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kripken commented Feb 26, 2019

Uh oh!

dcodeIO commented Feb 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dcodeIO commented Feb 22, 2019 •

edited

Loading

dcodeIO commented Feb 22, 2019 •

edited

Loading

dcodeIO commented Feb 22, 2019 •

edited

Loading

dcodeIO commented Feb 23, 2019 •

edited

Loading

MaxGraey commented Feb 25, 2019 •

edited

Loading

dcodeIO commented Feb 26, 2019 •

edited

Loading

dcodeIO commented Feb 26, 2019 •

edited

Loading

dcodeIO commented Feb 26, 2019 •

edited

Loading

dcodeIO commented Feb 27, 2019 •

edited

Loading