spvBinaryParse API deficient: client needs to handle endianness; not easily composable #1

dneto0 · 2015-11-14T14:54:00Z

The spvBinaryParse API isn't quite right:

It requires the client to translate endianness when the module being parsed is of a foreign endiannes. (e.g. I'm on a little-endian machine, but the module is big-endian). Frankly, most clients won't bother to write that code since the world seems dominated by little-endian machines. Also, there is some subtlety to the translation since most values need to be endian-translated, but literal strings do not. This is complexity we don't need.
Composability: The API is space-efficient: It does not copy the data in the underlying module. To get
values from instructions, the client must have retained the base pointer to the module, and then index into it via the instruction offset and the operand offset. That's fine as far as it goes, but it isn't very composable. For example, suppose you want to chain multiple transforms together. The consumer at
the end of the pipeline needs a base pointer somewhere to read values out of the instructions. But that means the last transform has to form a pointer relative to some array of words that was meaningful only at the beginning of the pipeline.

I think we can fix both problems in a reasonable way, as follows:
The spv_parsed_instruction_t should gain a "words" member of type "const uint32_t*". It points to native-endian sequence of SPIR-V words for the entire instruction. (So, for example, words[0] contains the combined word count and opcode.) As before, the contract is that the words array is only valid for as long as the callback is excecuting, then may be discarded or overwritten.

In the common case where the original module was in the machine's native endianness, then no copying is required: the words pointer is just computed from the original module's base pointer plus the instruction's offset. In the non-native endian case, there is some copying, but it's done by the parser and not the client. And since the lifetime of the data is restricted to the duration of the callback, we can reuse that space for the next instruction.

With the new design it will become easy to make pipelines of analysis or transform steps in a space efficient way. It should also be generally time efficient since the chain of callbacks should usually be working on the same chunk of instruction storage. It should be hot in the cache.

Add members: - words: a pointer to an array of words in the instruction, in host native endianness. - num_words: sizes the words member Remove member: - offset This simplifies clients of spvBinaryParse, because they don't have to handle endianness translation. Also, it makes the binary parse API more composable, allowing for easy chaining of binary parse clients. A binary parse client is handed the array of words directly instead of having to reference some external array of all the words in the SPIR-V binary. It also allows a binary parse client to mutate the instruction stream before handing off to a downstream consumer. TODO(dneto): Still need to write the unit tests for spvBinaryParse Fixes: KhronosGroup#1

In some shaders there are a lot of very large and deeply nested structures. This creates a lot of work for scalar replacement. Also, since commit ca4457b we have been very aggressive as rewriting variables. This has causes a large increase in compile time in creating and then deleting the instructions. To help low the costs, I want to run a cleanup of some of the easy loads and stores to remove. This reduces the number of symbols sroa has to work on. It also reduces the amount of code the simplifier has to simplify because it was not generated by sroa. To confirm the improvement, I ran numbers on three different sets of shaders: Time to run --legalize-hlsl: Set #1: 55.89s -> 12.0s Set #2: 1m44s -> 1m40.5s Set #3: 6.8s -> 5.7s Time to run -O Set #1: 18.8s -> 10.9s Set #2: 5m44s -> 4m17s Set #3: 7.8s -> 7.8s Contributes to #1328.

`DebugInfoManager::AddDebugValueIfVarDeclIsVisible()` adds OpenCL.DebugInfo.100 DebugValue from DebugDeclare only when the DebugDeclare is visible to the give scope. It helps us correctly handle a reference variable e.g., { // scope KhronosGroup#1. int foo = /* init */; { // scope KhronosGroup#2. int& bar = foo; ... in the above code, we must not propagate DebugValue of `int& bar` for store instructions in the scope KhronosGroup#1 because it is alive only in the scope KhronosGroup#2. We have an exception: If the given DebugDeclare is used for a function parameter, `DebugInfoManager::AddDebugValueIfVarDeclIsVisible()` has to always add DebugValue instruction regardless of the scope. It is because the initializer (store instruction) for the function parameter can be out of the function parameter's scope (the function) in particular when the function was inlined. Without this change, the function parameter value information always disappears whenever we run the function inlining pass and `DebugInfoManager::AddDebugValueIfVarDeclIsVisible()`.

`DebugInfoManager::AddDebugValueIfVarDeclIsVisible()` adds OpenCL.DebugInfo.100 DebugValue from DebugDeclare only when the DebugDeclare is visible to the give scope. It helps us correctly handle a reference variable e.g., { // scope #1. int foo = /* init */; { // scope #2. int& bar = foo; ... in the above code, we must not propagate DebugValue of `int& bar` for store instructions in the scope #1 because it is alive only in the scope #2. We have an exception: If the given DebugDeclare is used for a function parameter, `DebugInfoManager::AddDebugValueIfVarDeclIsVisible()` has to always add DebugValue instruction regardless of the scope. It is because the initializer (store instruction) for the function parameter can be out of the function parameter's scope (the function) in particular when the function was inlined. Without this change, the function parameter value information always disappears whenever we run the function inlining pass and `DebugInfoManager::AddDebugValueIfVarDeclIsVisible()`.

dneto0 self-assigned this Nov 14, 2015

dneto0 mentioned this issue Nov 20, 2015

spv_parsed_instruction_t cleanup #21

Closed

dneto0 closed this as completed in 7bff3eb Nov 23, 2015

dneto0 mentioned this issue Sep 19, 2017

spirv-opt: assert/crash with --eliminate-local-multi-store pass #824

Closed

dnovillo referenced this issue in dnovillo/SPIRV-Tools Sep 21, 2017

Review feedback. Round #1.

baf8287

dnovillo referenced this issue in dnovillo/SPIRV-Tools Sep 21, 2017

Review feedback. Round #1.

27bf580

dnovillo referenced this issue in dnovillo/SPIRV-Tools Sep 21, 2017

Review feedback. Round #1.

eba15f1

dnovillo mentioned this issue Oct 26, 2017

ADCE: Dead if elimination #916

Closed

This was referenced Nov 24, 2017

use-after-free in Instruction::unique_id() #1004

Closed

Change IRContext::KillInst to delete instructions. #1005

Merged

s-perron mentioned this issue Feb 22, 2018

Clean up variables before sroa #1344

Merged

This was referenced Jan 16, 2019

PassManagerTest is leaking an Instruction object #2296

Closed

Parsing of -Oconfig is leaking #2297

Closed

jaebaek mentioned this issue Oct 15, 2020

Add DebugValue for function param regardless of scope #3923

Merged

dneto0 mentioned this issue Jun 24, 2022

spirv-val: Validate zero product workgroup size #4828

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spvBinaryParse API deficient: client needs to handle endianness; not easily composable #1

spvBinaryParse API deficient: client needs to handle endianness; not easily composable #1

dneto0 commented Nov 14, 2015

spvBinaryParse API deficient: client needs to handle endianness; not easily composable #1

spvBinaryParse API deficient: client needs to handle endianness; not easily composable #1

Comments

dneto0 commented Nov 14, 2015