Add Hopper support #5538

crtrott · 2022-10-09T21:37:37Z

Addresses issue #5524

dalg24

kokkos/core/src/Cuda/Kokkos_Cuda_BlockSize_Deduction.hpp

Lines 218 to 232 in 61d7db5

    
           switch (compute_capability) { 
        
             case 30: 
        
             case 32: 
        
             case 35: return 16; 
        
             case 37: return 80; 
        
             case 50: 
        
             case 53: 
        
             case 60: 
        
             case 62: return 64; 
        
             case 52: 
        
             case 61: return 96; 
        
             case 70: 
        
             case 80: 
        
             case 86: return 8; 
        
             case 75: return 32;

crtrott · 2022-10-10T05:07:10Z

Added the shared config, and also the printconfig thing. Confirmed in tuning guide that it also can do 8kB shared memory.

masterleinad

What about cmake/compile_tests/cuda_compute_capability.cc?

crtrott · 2022-10-10T13:53:51Z

Fixed: also now actually checked for each use of ARCH_AMPERE and ARCH_VOLTA in our code base and made the appropriate adjustments.

crtrott · 2022-10-10T15:47:23Z

Fixed the logic mistake in the half precision thing.

Add Hopper support

3fe9540

dalg24 requested changes Oct 10, 2022

View reviewed changes

Add config output and shared mem config for Hopper

18cefac

masterleinad reviewed Oct 10, 2022

View reviewed changes

Add hopper to compute_capability detector

ce3014c

masterleinad approved these changes Oct 10, 2022

View reviewed changes

Fix up cases where the arch macro is used for HOPPER

27393a0

crtrott force-pushed the support-hopper branch from 7edac6b to 27393a0 Compare October 10, 2022 15:46

dalg24 approved these changes Oct 10, 2022

View reviewed changes

crtrott merged commit 833da38 into kokkos:develop Oct 10, 2022

crtrott deleted the support-hopper branch October 10, 2022 19:59

This was referenced Oct 12, 2022

CHANGELOG: 4.0 #5439

Closed

Document cmake option and macro for NVIDIA Hopper GPU arch kokkos/kokkos-core-wiki#184

Merged

PhilMiller mentioned this pull request Dec 15, 2022

[3.7.02] Add Hopper support and update nvcc_wrapper to work with CUDA 12 #5693

Merged

PhilMiller added Patch Release CHANGELOG Item to be included in release CHANGELOG Backend - CUDA labels Dec 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Hopper support #5538

Add Hopper support #5538

crtrott commented Oct 9, 2022

dalg24 left a comment

crtrott commented Oct 10, 2022

masterleinad left a comment

crtrott commented Oct 10, 2022

crtrott commented Oct 10, 2022

	switch (compute_capability) {
	case 30:
	case 32:
	case 35: return 16;
	case 37: return 80;
	case 50:
	case 53:
	case 60:
	case 62: return 64;
	case 52:
	case 61: return 96;
	case 70:
	case 80:
	case 86: return 8;
	case 75: return 32;

Add Hopper support #5538

Add Hopper support #5538

Conversation

crtrott commented Oct 9, 2022

dalg24 left a comment

Choose a reason for hiding this comment

crtrott commented Oct 10, 2022

masterleinad left a comment

Choose a reason for hiding this comment

crtrott commented Oct 10, 2022

crtrott commented Oct 10, 2022