[TOPI] Fix bug in Winograd on CUDA #4260

comaniac · 2019-11-05T22:31:27Z

Several topics [1, 2, 3] in the discuss mention that the conv2d failed to pass the shape checking in the runtime after the conv2d has been tuned by AutoTVM. This PR investigated the reason and resolved the issue. (thanks the help from @kevinthesun, @Laurawly and @vinx13 with the investigation).

Investigation
All errors happen at the same dimension: the output image height (arg2.shape[2]).
For example in [1], the workload has input (1, 80, 73, 73) with stride=1 and padding=0, so the output shape should be (1, 192, 71, 71). However, it encountered the following error:

TVMError: Check failed: ret == 0 (-1 vs. 0) : Assert fail: (73 == int32(arg2.shape[2])), Argument arg2.shape[2] has an unsatisfied constraint

It means that somehow the output shape is set to 73 instead of 71 during scheduling. After digging into the code, we found that Winograd schedule overrides the strides and padding to be 1 regardless the input workload.

Modification
Accordingly, this PR modified two parts. First, we get the stride and padding directly from the input workload and check if it is valid like previous. This change passed the isolated example that yyding provided in the TVM discuss.

Second, the Winograd unittest uses fallback config, but the Winograd schedule will fallback to the direct template if the config is fallback. It means the alter layer for Winograd schedule is never tested. This PR also forced the fallback to be False to enable the testing, which was suggested by @vinx13.

[1] https://discuss.tvm.ai/t/auto-tune-error-occurs-during-inference-when-using-auto-tuned-schedule
[2] https://discuss.tvm.ai/t/error-float16-for-cuda-with-autotvm
[3] https://discuss.tvm.ai/t/graphruntime-module-run-failed-when-created-with-logfile-from-autotvm-tuning

kevinthesun · 2019-11-05T23:22:34Z

@cbalint13

cbalint13 · 2019-11-06T11:01:38Z

@comaniac ,

Looks good to me (looked at winograd part).

topi/python/topi/cuda/conv2d_winograd.py

Laurawly

LGTM

comaniac · 2019-11-06T22:30:37Z

@vinx13 could you help merge it? Thanks.

vinx13 · 2019-11-06T22:32:35Z

@tqchen seems I don't have write permission

Laurawly · 2019-11-06T23:02:56Z

Thanks @comaniac @vinx13, this is now merged.

tqchen · 2019-11-06T23:10:19Z

@vinx13 you should have permission now, if now, please check if you have linked your github account. I will send you instructions

vinx13 · 2019-11-07T01:15:15Z

@tqchen I still don't have permission. I'm already in the Apache github org, is there anything I'm missing?

* fix winograd * move get padding after kernel transform

fix winograd

a00b5f8

vinx13 reviewed Nov 6, 2019

View reviewed changes

topi/python/topi/cuda/conv2d_winograd.py Outdated Show resolved Hide resolved

tqchen assigned vinx13 Nov 6, 2019

move get padding after kernel transform

f20ba53

vinx13 approved these changes Nov 6, 2019

View reviewed changes

Laurawly approved these changes Nov 6, 2019

View reviewed changes

comaniac unassigned vinx13 Nov 6, 2019

Laurawly merged commit 7211c27 into apache:master Nov 6, 2019

comaniac deleted the fix_winograd_cuda branch November 6, 2019 23:03

comaniac mentioned this pull request Nov 7, 2019

[TOPI][CUDA] Fix Winograd Kernel Size Support #4276

Merged

zxy844288792 pushed a commit to neo-ai/tvm that referenced this pull request Nov 13, 2019

[TOPI] Fix bug in Winograd on CUDA (apache#4260)

976d816

* fix winograd * move get padding after kernel transform

yzhliu mentioned this pull request Nov 16, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TOPI] Fix bug in Winograd on CUDA #4260

[TOPI] Fix bug in Winograd on CUDA #4260

comaniac commented Nov 5, 2019 •

edited

kevinthesun commented Nov 5, 2019

cbalint13 commented Nov 6, 2019

Laurawly left a comment

comaniac commented Nov 6, 2019

vinx13 commented Nov 6, 2019

Laurawly commented Nov 6, 2019

tqchen commented Nov 6, 2019

vinx13 commented Nov 7, 2019 •

edited

[TOPI] Fix bug in Winograd on CUDA #4260

[TOPI] Fix bug in Winograd on CUDA #4260

Conversation

comaniac commented Nov 5, 2019 • edited

kevinthesun commented Nov 5, 2019

cbalint13 commented Nov 6, 2019

Laurawly left a comment

Choose a reason for hiding this comment

comaniac commented Nov 6, 2019

vinx13 commented Nov 6, 2019

Laurawly commented Nov 6, 2019

tqchen commented Nov 6, 2019

vinx13 commented Nov 7, 2019 • edited

comaniac commented Nov 5, 2019 •

edited

vinx13 commented Nov 7, 2019 •

edited