-
Notifications
You must be signed in to change notification settings - Fork 138
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate rocsolver for LU and inversion #3756
Comments
Trying to get straight all the different inversion methods, options, and code paths. They get set up in SlaterDetBuilder::putDeterminant QMC_CUDA - legacy CUDA acceleration in DiracDeterminantCUDA. Rest of this applies if QMC_CUDA is not defined. input options:
The selections:
Looking at the classes to see how the inversion method fits in
Now the other non-batched choice:
Looking to the batched options:
The DET_ENGINE options;
|
Files, classes, and code involved in testing matrix inversion
|
Please ask questions and request for documentation/write up. I think we need to do better documentation. |
Is it expected that QMC_CUDA2HIP needs to be set for this to work? And not just ENABLE_ROCM? |
Do you need any bits of HIP to run a test? Assume yes to allocate memory, then set QMC_CUDA2HIP. |
Is your feature request related to a problem? Please describe.
After #3755 in DelayedUpdateCUDA.h, the LU and inversion solver is ifdef to run on the host on AMD GPU. Due to the fact that rocsolver API differs from cusolver, we will need rocSolverInverter equivalent to cuSolverInverter.
Describe the solution you'd like
Note that rocSolver buffer handling API doesn't look the same as cuSolver.
Also need a unit test to cover both cuSolverInverter and rocSolverInverter.
The text was updated successfully, but these errors were encountered: