-
Notifications
You must be signed in to change notification settings - Fork 202
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when building relion with many recent commits that changed CMakeLists #1076
Comments
Apologies, obviously an error on our end, we were using a flag -DCUDA_ARCH=12.0.0 erroneously. I wonder why it ever worked? Works after this is taken out. Perhaps a follow-up question - is there an advantage of making separate compilations of relion for nodes having different gpus? We have P100s, V100s, and A100s, all supporting different compute capabilities. We can go for a single install with the lowest common denominator (P100 = sm60), but will we have better performance on A100s if we compile it there with sm80? If yes, would multi-node jobs still work between nodes that have relion compiled with different cuda compute capabilities? |
Theoretically yes but when I tested this before, the difference was less than 5 %. Of course this depends on the card and the task (is the GPU really limiting?). You should test on your hardware and dataset.
Yes. |
Ok, given that we have many people running very different projects and so can't do focused optimization, 5 % potential gain we'll probably skip. Many thanks for your input! |
Can't build relion v5.0 for commits newer than fbf4f71, starting from fb5d7c9. Using CMake via easybuild. Error is in the build part, doesn't seem to be related to easybuild.
Full log attached:
easybuild-relion-test_ja240205_5.0-beta-fb5d7c9e693c6e9ba1deaac5727e487092d82a40-20240205.184401.cZJzJ.log
Relevant part:
Trying to install the latest commit (b75b38c) gives a different error:
Environment:
OS: CentOS7
MPI runtime: OpenMPI 3.1.6
Cmake 3.24.3
CUDA 12.0.0
Please advise
The text was updated successfully, but these errors were encountered: