Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Improve guards for arch-specific instructions (#368)
When choosing between a specialized implementation that uses architecture-specific functionality and a generic fallback, it is usually preferable to make the fallback the default. This will give the software the best possible chance of functioning without modification on future hardware. Of course, the library will still need code updates to function optimally on hardware released after the software was written. rocSPARSE can also be compiled with CXXFLAGS=-DROCSPARSE_USE_MOVE_DPP=0 to force the use of the fallback implementation. Or with the value 1 to force the use of the specialized __hip_move_dpp implementation. This change fixes the compilation error: Illegal instruction detected: Invalid dpp_ctrl value: broadcasts are not supported on GFX10+ when building for unsupported Navi 1x and Navi 2x GPUs as was reported in #250
- Loading branch information