Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Choose the value for the -arch nvcc flag depending on CUDA version #164

Merged

Conversation

sbastrakov
Copy link
Member

The original code has hardcoded 2.0, not supported since CUDA 9.
Closes #162.

The original code has hardcoded 2.0, not supported since CUDA 9.
Closes alpaka-group#162.
@sbastrakov
Copy link
Member Author

Note: the documentation and examples still have 2.0 hardcoded, while with this PR that would be 2.0 for CUDA < 9 and 3.0 for CUDA >= 9. After an offline discussion with @psychocoderHPC we decided to leave it as is for now.

@psychocoderHPC
Copy link
Member

psychocoderHPC commented Sep 24, 2019

This is only a quick fix we will modernize the make instead of doing it half wired. (see #163)

@psychocoderHPC psychocoderHPC self-assigned this Sep 24, 2019
@psychocoderHPC psychocoderHPC merged commit efd20bc into alpaka-group:dev Sep 25, 2019
@psychocoderHPC psychocoderHPC added this to the 2.4.0crp milestone Sep 25, 2019
@sbastrakov sbastrakov deleted the fix-nvccComputeCapability branch September 25, 2019 15:38
@@ -22,7 +22,12 @@ endif()
# CUDA
###############################################################################
find_package(CUDA REQUIRED)
set(CUDA_NVCC_FLAGS "-arch=sm_20;-use_fast_math;")
if(CUDA_VERSION VERSION_LESS 9.0)
Copy link
Member

@ax3l ax3l Sep 25, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add a if(NOT DEFINED COMPUTE_CAPABILITY) around this block, so one can configure it externally.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Hardcoded flag for compute capability 2.0
3 participants