Deconvolutions improve performance of AH-Net #1023

mmarcinkiewicz · 2020-09-11T06:48:20Z

Describe the bug
In the current environment deconvolutions work significantly faster with mixed precision than trilinear interpolation for upsampling. In the current Monai implementation of AH-Net the upsample_mode defaults to "trilinear", which is suboptimal.

To Reproduce
Run training with upsample_mode="trilinear" and upsample_mode="transpose".

Expected behavior
The default upsample_mode is changed to transpose.

Screenshots
Throughput measured on V100 and A100 for a 3D workload.

Environment (please complete the following information):

Ubuntu 18.04
Python 3.6
MONAI version f1998b72a941d1e5f9578a66dc1c20b01913caab
CUDA 11, cuDNN 8.0.3/8.0.4
GPUs: V100, A100

Additional context
In cuDNN 8.0.3 deconvolutions have Tensor Core support which improves their performance significantly.
Currently there are issues being tracked for Pytorch and cuDNN which might improve performance. I'll monitor the status and update if needed.

The text was updated successfully, but these errors were encountered:

Nic-Ma · 2020-09-11T10:07:37Z

Hi @yiheng-wang-nv ,

I remember you fixed some determinism issue about the upsampling in AHNet, is it related to this ticket?

Thanks.

wyli · 2020-09-11T10:10:51Z

Hi @yiheng-wang-nv ,

I remember you fixed some determinism issue about the upsampling in AHNet, is it related to this ticket?

Thanks.

that's noted in the docstring @Nic-Ma

MONAI/monai/networks/nets/ahnet.py

Lines 353 to 355 in 8fc4f5d

    
                   upsample_mode: [``"transpose"``, ``"bilinear"``, ``"trilinear"``] 
        
                       The mode of upsampling manipulations. 
        
                       Using the last two modes cannot guarantee the model's reproducibility. Defaults to ``trilinear``.

yiheng-wang-nv · 2020-09-11T15:39:19Z

Hi @yiheng-wang-nv ,

I remember you fixed some determinism issue about the upsampling in AHNet, is it related to this ticket?

Thanks.

FYI, the non-determinism issue is explained in an old PR. Generally speaking, torch.nn.functional.interpolate() and torch.nn.MaxPool1d/2d/3d() (if kernel size != stride) use atomicAdd which introduces the non-determinism results.
This ticket is about the comparisons for computing speed between trilinear and deconvolution, and since our deconvolution mode ensures the determinism, the kernel size for maxpool layer has been changed (shown below), will it have an impact on the speed?

if upsample_mode == "transpose":
    self.maxpool = pool_type(kernel_size=(2, 2, 2)[-spatial_dims:], stride=2)
else:
    self.maxpool = pool_type(kernel_size=(3, 3, 3)[-spatial_dims:], stride=2, padding=1)

wyli added this to To-Do in v0.3.0 via automation Sep 11, 2020

wyli added the enhancement New feature or request label Sep 11, 2020

yiheng-wang-nv mentioned this issue Sep 11, 2020

Change default upsample_mode for ANHet #1028

Merged

6 tasks

wyli closed this as completed in #1028 Sep 11, 2020

v0.3.0 automation moved this from To-Do to Done Sep 11, 2020

wyli mentioned this issue Nov 10, 2020

Refine ahnet to support changeable psp module #1197

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deconvolutions improve performance of AH-Net #1023

Deconvolutions improve performance of AH-Net #1023

mmarcinkiewicz commented Sep 11, 2020

Nic-Ma commented Sep 11, 2020

wyli commented Sep 11, 2020 •

edited

Loading

yiheng-wang-nv commented Sep 11, 2020 •

edited

Loading

Deconvolutions improve performance of AH-Net #1023

Deconvolutions improve performance of AH-Net #1023

Comments

mmarcinkiewicz commented Sep 11, 2020

Nic-Ma commented Sep 11, 2020

wyli commented Sep 11, 2020 • edited Loading

yiheng-wang-nv commented Sep 11, 2020 • edited Loading

wyli commented Sep 11, 2020 •

edited

Loading

yiheng-wang-nv commented Sep 11, 2020 •

edited

Loading