ConvTransposeGrad CUDA Kernel#17201
Conversation
|
Thank you for the review @er3x3 :) |
|
@baijumeswani, please take a look at test error, and update the test: https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1122071&view=logs&j=5076e696-f193-5f12-2d8a-703dda41a79b&t=318cbefa-99dd-503e-0fe4-581779019aa9&l=14091 |
Cherry-pick 1st round for rel-1.16.0 from https://github.com/microsoft/onnxruntime/issues?q=label%3Arelease%3A1.16+label%3Atriage%3Aapproved+is%3Aclosed except #17201 because it caused UT failure and is not fixed yet. PR list: #16417 #16936 #17000 #17236 #17238 #17240 #17252 #17255 #17258 #17265 #17267 #17277
This pull request adds a CUDA kernel implementation for the convolution transpose gradient operation to ONNX Runtime.
Work here is heavily derived from the previous work by @er3x3 in #8613 and from PyTorch convolution kernels.