AMD ROCm: `torch.backends.cudnn.benchmark` should be set to `False` by default on ROCm #2552

asumagic · 2024-05-21T11:48:06Z

Describe the bug

With AMD HIP, torch.backends.cudnn.benchmark defaults to True. On CUDA, it defaults to False.

While this is an upstream PyTorch decision (or bug), having benchmark default to True by default is probably a bad idea in our case, because we have wildly different input shapes. Since CNN kernel benchmarking is performed on a per-shape basis, it performs those slow benchmarks very often, so training times are worsened rather than improved.

Probably should add some common per-backend quirks file for this kind of change in SB?

Expected behaviour

On AMD ROCm, it should be set to False, probably with a warning since we break the default.

To Reproduce

No response

Environment Details

No response

Relevant Log Output

No response

Additional Context

No response

The text was updated successfully, but these errors were encountered:

asumagic added the bug Something isn't working label May 21, 2024

asumagic self-assigned this May 21, 2024

asumagic linked a pull request May 30, 2024 that will close this issue

Add quirks.py for GPU/platform-specific quirks; disable CuDNN benchmarking on ROCm #2558

Draft

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMD ROCm: `torch.backends.cudnn.benchmark` should be set to `False` by default on ROCm #2552

AMD ROCm: `torch.backends.cudnn.benchmark` should be set to `False` by default on ROCm #2552

asumagic commented May 21, 2024

AMD ROCm: torch.backends.cudnn.benchmark should be set to False by default on ROCm #2552

AMD ROCm: torch.backends.cudnn.benchmark should be set to False by default on ROCm #2552

Comments

asumagic commented May 21, 2024

Describe the bug

Expected behaviour

To Reproduce

Environment Details

Relevant Log Output

Additional Context

AMD ROCm: `torch.backends.cudnn.benchmark` should be set to `False` by default on ROCm #2552

AMD ROCm: `torch.backends.cudnn.benchmark` should be set to `False` by default on ROCm #2552