lane mutators #110

johnmccutchan · 2015-01-22T17:48:57Z

Instead of withX, withY how about something like replaceLane(int lane, T value)?

johnmccutchan · 2015-01-22T17:49:04Z

@PeterJensen @sunfishcode

sunfishcode · 2015-01-22T18:00:24Z

That covers the non-constant index case of LLVM insertelement, which is nice to have.

On the other hand, what platforms support a non-constant index without using memory operations?

PeterJensen · 2015-01-22T18:08:27Z

A slightly better name would be insertLane(), since the instruction is insertps.

My main concern here is that the insertps/extractps instructions require that lane indicators are constants. We impose this limitation implicitly with the with* operations. With this suggestion we'll need to check that 'lane' is a constant or alternatively allow non-constant 'lane' values, which would complicate the code generation. Another concern is the duality of lane indicators, using both x/y/z/w and integers. I much prefer the letters, because use of integers tends to cause big/little endian portability issues.

johnmccutchan · 2015-01-22T18:44:22Z

Using lane indexes will make us consistent with shuffle / swizzle methods which already use indices.

It will be easy for runtimes to fast path constant lane index and fall back to a slow path if the index isn't constant.

sunfishcode · 2015-01-22T19:16:40Z

All systems I'm aware of, including big-endian, store whatever they call SIMD lane 0 at offset 0 in memory, so I think we're safe from endianness issues here.

I agree that insertLane is a little nicer name than replaceLane.

Emscripten already has code to lower variable-index insertelement/extractelement into memory operations, which is pretty close to what JITs will lower variable-index replaceLane/insertLane into on all major platforms today anyway AFAIK, and it doesn't come up all that often in use cases I'm looking at, so I don't feel strongly either way about this feature.

Speaking of extractelement, if we add replaceLane/insertLane, should we add a variable-index extractLane too, for symmetry?

PeterJensen · 2015-01-22T19:24:44Z

There was a good talk at the LLVM dev meeting about big/little endian SIMD issues. It seems that Power looks at it the other way around: http://llvm.org/devmtg/2014-10/Slides/Schmidt-SupportingVectorProgramming.pdf

sunfishcode · 2015-01-22T20:14:11Z

Oh, I was unaware of that. On big-endian Power, LLVM already presents little-endian element ordering (what the presentation might call Little-on-Big) to its users, but I didn't realize that's not what the actual hardware does.

The presentation observes that "The True-LE model is appropriate and useful to most consumers", and I believe that that's what makes the most sense for SIMD.js too.

sunfishcode · 2015-04-09T15:47:42Z

Given that we already now use indices for shuffle/swizzle, and if we use indices here, the only things left using "xyzw" notation are the element getters and {load,store}{X,XY,XYZ}. I think it makes sense to make those consistent too. And I encourage the names "insert" and "extract", following LLVM's use of those names. So, I propose we:

replace the lane mutators with SIMD.<type>.insertLane(index, value)
replace the lane getters with SIMD.<type>.extractLane(index)
rename {load,store}{X,XY,XYZ} to {load,store}{1,2,3}
explicitly require that SIMD loads and stores transfer vector elements to SIMD lanes with little-endian ordering for the lane indices so that the element at memory offset 0 is always at SIMD index 0

This eliminates the last "xyzw" naming from the API, which is a little sad for programs where that naming fits the domain, but we gain consistency across SIMD types, we gain the ability to represent LLVM's insertelement and extractelement with non-constant indices, and it's still pretty usable, given that this is a low-level API.

johnmccutchan · 2015-04-10T14:53:20Z

sgtm

PeterJensen · 2015-04-10T15:23:11Z

Yes! I think we're converging on a very nice and consistent API. One nitpick though; I like .setLane()/.getLane() better. They're shorter names and arguably express the operation better. I think it was John that said use of 'insert' sort of implies an addition not a replacement.

huningxin · 2015-04-13T06:09:27Z

sgtm! Thanks for the proposal @sunfishcode.

huningxin · 2015-04-15T02:38:11Z

rename {load,store}{X,XY,XYZ} to {load,store}{1,2,3}

load2 might confuse people that it means loading lane 2, does it? How about {load,store}{1,2,3}Lane(s)?

@sunfishcode , your thoughts?

sunfishcode · 2015-04-15T16:07:42Z

Our current convention is for operations that only act on one lane to have Lane in the name, so a load that only loads lane 2 (if we were to add such a thing) would be SIMD.type.loadLane2(a, i) or SIMD.type.loadLane(2, a, i) or something.

load2 for loading 2 elements feels right to me, and load2Lanes feels unnecessarily verbose. It is just a feeling though.

johnmccutchan · 2015-04-15T16:47:38Z

I agree with @sunfishcode load2 sounds better to me than load2Lanes.

huningxin · 2015-04-16T01:03:59Z

Thanks for your comments. It makes sense to me. I put together a PR to rename the load/store API #136. Please take a look.

bnjbvr · 2015-04-16T14:54:41Z

Nice! It seems that #136 only addressed the third bullet point of @sunfishcode 's list, should we re-open the issue so as to not forget about renaming other functions?

huningxin · 2015-04-17T04:45:13Z

Agree with @bnjbvr . #136 only updates the load/store APIs. Before we close it, we need consensus on .setLane()/.getLane() or .insertLane()/.extractLane().

PeterJensen · 2015-04-17T05:13:25Z

We did reach consensus at the 4/14 meeting. The consensus was .replaceLane()/.extractLane(). @sunfishcode felt that use of the get/set verbs might create unwanted associations to getter and setter functions.

huningxin · 2015-04-30T01:47:11Z

Thanks for the clarification. I will come up with a PR.

sunfishcode mentioned this issue Feb 14, 2015

LLVM vector operations #84

Closed

johnmccutchan closed this as completed Apr 16, 2015

huningxin mentioned this issue Apr 30, 2015

Replace the lane mutators and getters with replaceLane and extractLane #138

Merged

Elchi3 mentioned this issue May 11, 2015

Replace .flag{X,Y,Z,W} getters ? #145

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lane mutators #110

lane mutators #110

johnmccutchan commented Jan 22, 2015

johnmccutchan commented Jan 22, 2015

sunfishcode commented Jan 22, 2015

PeterJensen commented Jan 22, 2015

johnmccutchan commented Jan 22, 2015

sunfishcode commented Jan 22, 2015

PeterJensen commented Jan 22, 2015

sunfishcode commented Jan 22, 2015

sunfishcode commented Apr 9, 2015

johnmccutchan commented Apr 10, 2015

PeterJensen commented Apr 10, 2015

huningxin commented Apr 13, 2015

huningxin commented Apr 15, 2015

sunfishcode commented Apr 15, 2015

johnmccutchan commented Apr 15, 2015

huningxin commented Apr 16, 2015

bnjbvr commented Apr 16, 2015

huningxin commented Apr 17, 2015

PeterJensen commented Apr 17, 2015

huningxin commented Apr 30, 2015

lane mutators #110

lane mutators #110

Comments

johnmccutchan commented Jan 22, 2015

johnmccutchan commented Jan 22, 2015

sunfishcode commented Jan 22, 2015

PeterJensen commented Jan 22, 2015

johnmccutchan commented Jan 22, 2015

sunfishcode commented Jan 22, 2015

PeterJensen commented Jan 22, 2015

sunfishcode commented Jan 22, 2015

sunfishcode commented Apr 9, 2015

johnmccutchan commented Apr 10, 2015

PeterJensen commented Apr 10, 2015

huningxin commented Apr 13, 2015

huningxin commented Apr 15, 2015

sunfishcode commented Apr 15, 2015

johnmccutchan commented Apr 15, 2015

huningxin commented Apr 16, 2015

bnjbvr commented Apr 16, 2015

huningxin commented Apr 17, 2015

PeterJensen commented Apr 17, 2015

huningxin commented Apr 30, 2015