HLSL: implement numthreads for compute shaders #572

ghost · 2016-10-28T18:49:27Z

HLSL: implement numthreads for compute shaders

This PR adds handling of the numthreads attribute for compute shaders, as well as a general
infrastructure for returning attribute values from acceptAttributes, which may be needed in other
cases, e.g, unroll(x), or merely to know if some attribute without params was given.

A map of enum values from TAttributeType to TIntermAggregate nodes is built and returned. It
can be queried with operator[] on the map. In the future there may be a need to also handle
strings (e.g, for patchconstantfunc), and those can be easily added into the class if needed.

New test is in hlsl.numthreads.comp.

johnkslang · 2016-10-29T06:17:41Z

hlsl/hlslGrammar.cpp

+
+    if ((attr = attributes.find("numthreads")) != attributes.end()) {
+        // TODO: handle multiple entry points.  TIntermediate presently only tracks one set of thread counts.
+        if (parseContext.parsingEntryPoint()) {


I considered exposing inEntryPoint before, for knowing if parameters were for functions or entry point, but the grammar doesn't know when that bit gets set, and it is not set until the entry point is partially parsed. That's how remapEntryPointIO() came about.

Given that the grammar can't depend on knowing whether parsingEntryPoint() is valid or not, is there a cleaner way, which keeps policy more cleanly in the parse helper?

E.g., acceptFunctionDefinition in the parse helper is where the decision is made about how to pass on numthreads?

johnkslang · 2016-10-29T06:21:47Z

hlsl/hlslAttributes.h

+namespace glslang {
+    class TIntermAggregate;
+
+    typedef std::unordered_map<TString, TIntermAggregate*> TAttributeMap;


Is string representing specific things like numthreads and flatten? What about the design alternative of using an enum { numthreads, flatten, ... max } and an array[max], so that string manipulation dies at parsing? Or does that break down?

ghost · 2016-10-30T18:35:19Z

Thanks for comments - repushed with these changes:

No more exposure of inEntryPoint: attribute handling is now done in HlslParseContext::handleFunctionDefinition. Grammar is insulated from this detail.
There's now an enum for valid attributes, defined in hlslAttributes.h, and a helper function attributeFromName() to accept a string and find the matching enum. The two operator[] functions on the TAttributeMap now accept an enum value. This slightly simplifies the consumer code.
I added some other attributes into the list of parsed ones.

johnkslang · 2016-10-31T01:48:54Z

hlsl/hlslParseHelper.cpp

+    if (numThreadliterals != nullptr && inEntryPoint) {
+        const TIntermSequence& sequence = numThreadliterals->getSequence();
+
+        // TODO: handle multiple entry points.  TIntermediate presently only tracks one set of thread counts.


Not sure I follow.

TIntermediate only handles one entry point, and generally the parser/intermediate design is centered around that. If a SPIR-V module were in the future to have multiple entry points, it would be because multiple TIntermediate each made a SPIR-V module, and the modules got merged (most recent thinking of how to do that), rather than the whole stack learned to handle multiple entry points.

Shader entry functions are different semantics from a regular function at the source level, in the AST, and in SPIR-V. So, I think the best way to handle, say, source with real multiple entry points is to compile it multiple times, once per entry point, and get correct SPIR-V modules for each, and then use a SPIR-V merger.

I see; that was down to a misunderstanding on my part then. Will fix comment.

johnkslang · 2016-10-31T01:51:44Z

I think the code is fine. I only questioned a comment; curious if we're on the same page about it.

This PR adds handling of the numthreads attribute for compute shaders, as well as a general infrastructure for returning attribute values from acceptAttributes, which may be needed in other cases, e.g, unroll(x), or merely to know if some attribute without params was given. A map of enum values from TAttributeType to TIntermAggregate nodes is built and returned. It can be queried with operator[] on the map. In the future there may be a need to also handle strings (e.g, for patchconstantfunc), and those can be easily added into the class if needed. New test is in hlsl.numthreads.comp.

ghost · 2016-10-31T15:30:26Z

Removed WIP, re-pushed with a number of comment changes, updated commit message, and forward declaring the TAttributeMap class to avoid #including it from inside other headers.

johnkslang requested changes Oct 29, 2016

View reviewed changes

johnkslang reviewed Oct 31, 2016

View reviewed changes

johnkslang approved these changes Oct 31, 2016

View reviewed changes

ghost changed the title ~~WIP: HLSL: numthreads~~ HLSL: implement numthreads for compute shaders Oct 31, 2016

johnkslang merged commit 89df3c2 into KhronosGroup:master Nov 1, 2016

ghost mentioned this pull request Nov 16, 2016

Complete HLSL -> SPIR-V translator #362

Open

51 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HLSL: implement numthreads for compute shaders #572

HLSL: implement numthreads for compute shaders #572

ghost commented Oct 28, 2016 •

edited by ghost

johnkslang Oct 29, 2016

johnkslang Oct 29, 2016

ghost commented Oct 30, 2016

johnkslang Oct 31, 2016

ghost Oct 31, 2016

johnkslang commented Oct 31, 2016

ghost commented Oct 31, 2016

HLSL: implement numthreads for compute shaders #572

HLSL: implement numthreads for compute shaders #572

Conversation

ghost commented Oct 28, 2016 • edited by ghost

johnkslang Oct 29, 2016

Choose a reason for hiding this comment

johnkslang Oct 29, 2016

Choose a reason for hiding this comment

ghost commented Oct 30, 2016

johnkslang Oct 31, 2016

Choose a reason for hiding this comment

ghost Oct 31, 2016

Choose a reason for hiding this comment

johnkslang commented Oct 31, 2016

ghost commented Oct 31, 2016

ghost commented Oct 28, 2016 •

edited by ghost