-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
QUDA with MILC #313
Comments
Well, these two warning messages look QUDA generated. What makes you think you're not running in the GPU? Did you try to log in the node you were running and call nvidia-smi to see the GPU workload? Aldo take into accout that, if there is not cache file, the first time you run it might take a while...
|
Which part of QUDA are you trying to use exactly? Did you enable VERBOSE output? |
The application ks_imp_rhmc seems that trying to use HISQ fermion force and gauge tools,and I just use the configure file in the quda's directory named configure.milc.titan to build up QUDA. |
It might give you some info, but I don’t guarantee anything. To tell you the truth, I’m not familiar at all with MILC code. However, I contributed to gauge tools... What are you using exactly of gauge tools? |
In the MILC code,the readme file in the application ks_imp_rhmc said that measurements include plaquette...,so I add the flag to the configure file.But I'm not sure whether it will work with the executable ks_imp_rhmc generate. |
Which version of MILC and QUDA do you use? Can you share your MILC Makefile and QUDA make.inc ? |
The NUMA affinity message is from QUDA. It is a known issue that might affect performance on some systems. |
I use the latest version,version 7.7.11 of MILC and version 0.7.1 of QUDA. |
Thanks. I will try to have a look later. Did you try to run on of the test input files in the ks_imp_rhmc/test directory? Which binary exactly did you use in the ks_imp_rhmc directory? su3_rhmc_hisq ? |
From my first look in your Makefile: You only offload the gauge force to the GPU. Everything else is kept on the CPU. So that should explain why your GPU is idle most of the time. I assume the code is running on the GPU, it is just only the gauge force. Things you can check to verify it is running on the GPU:
You might want to try to put the inversions also on the GPU by using If you still have troubles feel free to share your output file. To reduce its length you can change line 216 in the MILC Makefile to
|
Yes,I've tried to run the test input file in the ks_imp_rhmc/test directory before,and I used the executable su3_rhmd_hisq with double precision.And then I called nvidia-smi and it showed the GPU information above. So should I put the inversions for all the QUDA Options or just change this option? |
Well, your nvidia-smi output shows that the GPU is used. But with only the gauge force on the GPU the utilization is probably pretty low. That is what you see. If you want to put the inversion on the GPU the |
For the ks_imp_rhmc applications, you will need the full suite of HISQ Perhaps the following example Makefile for ks_imp_rhmc would help http://www.physics.utah.edu/~detar/milc/Makefile-Drink7 This is for a somewhat later version of the MILC code than 7.7.11, but On 7/10/2015 8:48 AM, Drink7 wrote:
Carleton DeTar |
OK. I'll try to set those option to true and see how they change the performance about GPU. Thank you very much for your help! |
Are you part of the student cluster competition? If so, you should use the MILC tar all prepared for the competition. Sent from my iPad On Jul 11, 2015, at 3:35 AM, Drink7 <notifications@github.commailto:notifications@github.com> wrote: OK. I'll try to set those option to true and see how they change the performance about GPU. Thank you very much for your help. — |
Yes,I meet some problem when building QUDA with MILC and trying to ask for help. |
If this is part of the student cluster competition I would prefer to take the further support away from the QUDA bug tracker. I think QUDA performs as expected. @stevengottlieb , @detar : Do you provide the support the student cluster competition? |
Could you please introduce yourself? Are you part of the student cluster competition? On 7/11/2015 8:35 AM, Drink7 wrote:
|
Please use the google group set up for the student cluster competition, Read the instructions on the competition webpage for MILC that were I will no longer respond to github posts on this issue and will On Sat, 2015-07-11 at 14:35 +0000, Drink7 wrote:
|
I should use the google group to ask for help,not the github here. |
Thanks for moving that to the right place. @stevengottlieb , @detar : If anything comes up during the cluster competition that is QUDA related please feed it back here. Also If you want some of QUDA developers to sometimes have a look into the issues popping up in the student competition let us know. |
I have some problem about the quda with MILC.
I'm using the application in the MILC called ks_imp_rhmc,and the Makefile setting is below.
//compiler
CC = mpicc
//Linker flag
LD=mpicxx
//QUDA option
WANTQUDA = true
WANT_GF_GPU = true
Other QUDA options are commented.
But I still cannot run on the GPU,an executable is generated but it seems that it doesn't run on the GPU.
I don't know some warning message mean,I see this two warning message in the log.
WARNING: Failed to determine NUMA affinity for device 0 (possibly not applicable)
WARNING: Cache file not found. All kernels will be re-tuned (if tuning is enabled).
Can someone help me ? Thanks.
(My quda version is 0.7.1)
The text was updated successfully, but these errors were encountered: