```
1
```

ECE 786, Spring 2023, HW 3

```
Considering the following code in a GPU kernel function:
int index = blockIdx.x*blockDim.x + threadIdx.x;
       if (index > x1) // Section 1 - THEN
       {
                  . . .
        }
        else
                                 //Section 1- ELSE
        {
                 . . .
        }
d_e [index] = d_c [index] + d_d [index] ;
   Assuming that the corresponding assembly code is as follows:
PC
                                    Comment
        Instruction
       MOV R1, c [0x1] [0x100];
*0000*
*0020*
        SSY 0xf0;
                                    SSY Instruction (push stack)
*0028*
        . . .
                                    . . .
       @PO BRA Oxb8;
                                    Branch corresponding to Outer IF Then-Else
*0090*
                                    (push stack if divergent)
*0098* LD.E R3, [R2];
                                    ELSE PART
                                   ELSE PART. Notice ".S" flag. (pop stack)
*00b0* ST.E.S [R6], R2;
*00b8* LD.E R5, [R4];
                                    IF THEN PART
. . .
        . . .
        NOP.S CC.T;
                                    Last Instruction of Outer If-Then. Notice ".S" flag.
*00e8*
                                    (pop stack)
*00f0* LD.E R3, [R10];
                                    Threads Synchronizes at this point.
*00f8* LD.E R4, [R8];
*0100* LD.E R2, [R6];
                                    Go till Exit.
```

1

Brian Park — 200190057

| ECE 786     | Advanced Computer Architecture: | Data Parallel Processors |
|-------------|---------------------------------|--------------------------|
| Spring 2023 | Huiyang Zhou                    | HW 3                     |
|             |                                 |                          |

\*0140\* EXIT;