MIOpen using wrong ISA for certain convolution operations #117

89Mods · 2019-08-29T15:58:08Z

I've just updated MIOpen to the latest version (package version 2.0.1.7405), which now apparently uses the wrong ISA for my GPU. I normally work with PyTorch for all my projects, however, this issue occurs while MIOpen is tuning the Perf. Db before even running my python code. As soon as it tries to do so, it repeatedly runs into errors similar to this:

`<instantiation>:4:3: error: instruction not supported on this GPU
                s_mul_hi_u32 s[tiles_w], div_const_1_4, s[tiles_w]
                ^
<instantiation>:2:2: note: while in macro instantiation
        _s_div_const_u32_u16 s[tiles_w], s[tiles_w], 4
        ^
<stdin>:796:2: note: while in macro instantiation
        _s_ceil_u32 s[tiles_w], s[S], %xformx_f_size
        ^
<instantiation>:4:3: error: instruction not supported on this GPU
                s_mul_hi_u32 s[tiles_h], div_const_1_4, s[tiles_h]
                ^
<instantiation>:2:2: note: while in macro instantiation
        _s_div_const_u32_u16 s[tiles_h], s[tiles_h], 4
        ^
<stdin>:797:2: note: while in macro instantiation
        _s_ceil_u32 s[tiles_h], s[R], %xformy_f_size
        ^
<instantiation>:8:2: error: instruction not supported on this GPU
        v_sub_u32         v[vtmp+1+3],   0,            v[vtmp+1+1]
        ^
<stdin>:857:2: note: while in macro instantiation
        ceil_2_32_div_u16 v[vtmp], v[vtmp], vtmp+1, stmp`

This goes on for several pages, full log is here: https://pastebin.com/dughVHes
I'm running an RX 470, though the error also occurs on a RX 560.
The only way I've found to replicate this is to try to run a convolution with exactly 3 output channels and 128 input channels, otherwise, this error occurs seemingly at random, as most operations complete successfully and only some trigger the error (as seen in the full log).

The text was updated successfully, but these errors were encountered:

daniellowell · 2019-08-29T16:05:38Z

@zjing14 Please take a look.

zjing14 · 2019-08-29T16:09:08Z

@daniellowell The Conv3x3AsmWrW is not supported on RX4/560 (i.e., Fiji/Polaris Arch). We need to disable the kernel on Fiji platform. Will create a PR.

daniellowell · 2019-08-29T16:23:33Z

@89Mods We will fix this in the next release. In the mean time can you see if setting the environment variable:
MIOPEN_DEBUG_GCN_ASM_KERNELS=0
unblocks you.

89Mods · 2019-09-14T20:11:19Z

@daniellowell I tried setting the environment variable as you suggested, but while it does hide the errors, nothing works correctly anymore. I also tried downgrading back to MIOpen 2.0.0, but doing that now causes a Segfault in PyTorch, so I'm pretty much stuck with the broken 2.0.1 release.

daniellowell · 2019-09-25T20:24:40Z

Resolved in 2.1.0

daniellowell · 2019-09-25T20:25:31Z

@89Mods please try the latest release. If your issue is not resolved please feel free to reopen this issue.

daniellowell assigned zjing14 Aug 29, 2019

daniellowell added the bug label Aug 29, 2019

daniellowell closed this as completed Sep 25, 2019

liamnr2 mentioned this issue Oct 5, 2019

Memory access fault by GPU node-1 (Agent handle: 0x2fb8040) on address 0x541e00000. Reason: Page not present or supervisor privilege. ROCm/pytorch#499

Closed

alexandraBara mentioned this issue Sep 11, 2020

Solver generic_search fail: ConvHipImplicitGemmBwdDataV1R1Xdlops and ConvHipImplicitGemmForwardV4R4Xdlops #427

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MIOpen using wrong ISA for certain convolution operations #117

MIOpen using wrong ISA for certain convolution operations #117

89Mods commented Aug 29, 2019 •

edited by atamazov

daniellowell commented Aug 29, 2019

zjing14 commented Aug 29, 2019

daniellowell commented Aug 29, 2019

89Mods commented Sep 14, 2019

daniellowell commented Sep 25, 2019

daniellowell commented Sep 25, 2019

MIOpen using wrong ISA for certain convolution operations #117

MIOpen using wrong ISA for certain convolution operations #117

Comments

89Mods commented Aug 29, 2019 • edited by atamazov

daniellowell commented Aug 29, 2019

zjing14 commented Aug 29, 2019

daniellowell commented Aug 29, 2019

89Mods commented Sep 14, 2019

daniellowell commented Sep 25, 2019

daniellowell commented Sep 25, 2019

89Mods commented Aug 29, 2019 •

edited by atamazov