Detect gpu architecture in `build.rs` and pass `--gpu-architecture` flag to nvcc #438

coreylowman · 2023-02-08T21:39:26Z

Unsure how to detect as of now, so part of this issue is to figure that out!

Narsil · 2023-02-19T22:00:17Z

nvidia-smi --query-gpu=compute_cap --format=csv

Or

cudaDeviceProp deviceProp;
cudaGetDeviceProperties(&deviceProp, dev);
std::printf("%d.%d\n", deviceProp.major, deviceProp.minor)

Might be better though (nvidia-smi is not necessarily installed)
https://stackoverflow.com/questions/48283009/nvcc-get-device-compute-capability-in-runtime

JoeOsborn · 2023-02-19T22:55:42Z

Two questions on this:

Is -arch=native what is being asked for here? (Per https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html#options-for-steering-gpu-code-generation )
Would -arch=all or -arch=all-major be a better choice for release builds?

Otherwise you might need to get the device name(s) and check them against this list or something: https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html#gpu-feature-list

Narsil · 2023-02-20T14:22:21Z

Would -arch=all or -arch=all-major be a better choice for release builds?

I think not, one asset of being a compiled framework, is that we can make it highly optimized for the highest possible compute capability making small and efficient binaries for a specific platform it's supposed to work on.

Making this overridable for users that want to create shared binaries would be good. But I don't think it should be the default.

coreylowman · 2023-02-22T14:51:54Z

Oh -arch=native seems exactly what we want! Easy change too, thanks for sharing that

* #438 using --gpu-architecture native for nvcc * Revert adding cuda to default features * Fixing ci-check for cuda

coreylowman mentioned this issue Feb 10, 2023

0.11.0 release #278

Closed

47 tasks

coreylowman added a commit that referenced this issue Feb 22, 2023

#438 using --gpu-architecture native for nvcc

29731d8

coreylowman mentioned this issue Feb 22, 2023

Using --gpu-architecture native with nvcc #474

Merged

coreylowman closed this as completed in #474 Feb 22, 2023

coreylowman added a commit that referenced this issue Feb 22, 2023

Using --gpu-architecture native with nvcc (#474)

c354ca1

* #438 using --gpu-architecture native for nvcc * Revert adding cuda to default features * Fixing ci-check for cuda

coreylowman mentioned this issue Mar 13, 2023

nvcc does not recognize gpu_architecture option native #553

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect gpu architecture in `build.rs` and pass `--gpu-architecture` flag to nvcc #438

Detect gpu architecture in `build.rs` and pass `--gpu-architecture` flag to nvcc #438

coreylowman commented Feb 8, 2023

Narsil commented Feb 19, 2023

JoeOsborn commented Feb 19, 2023 •

edited

Loading

Narsil commented Feb 20, 2023

coreylowman commented Feb 22, 2023

Detect gpu architecture in build.rs and pass --gpu-architecture flag to nvcc #438

Detect gpu architecture in build.rs and pass --gpu-architecture flag to nvcc #438

Comments

coreylowman commented Feb 8, 2023

Narsil commented Feb 19, 2023

JoeOsborn commented Feb 19, 2023 • edited Loading

Narsil commented Feb 20, 2023

coreylowman commented Feb 22, 2023

Detect gpu architecture in `build.rs` and pass `--gpu-architecture` flag to nvcc #438

Detect gpu architecture in `build.rs` and pass `--gpu-architecture` flag to nvcc #438

JoeOsborn commented Feb 19, 2023 •

edited

Loading