Add experimental support of cuQuantum #1400

doichanj · 2021-12-13T09:36:48Z

Summary

This is the experimental support for NVIDIA's cuQuantum Beta 2 (ver 0.1.0).

Details and comments

We can use cuStateVec APIs instead of Aer's GPU implementations by setting options at runtime (see CONTRIBUTING.md for details). cuStateVec is enabled when building with CUSTATEVEC_ROOT with the path to cuQuantum.
By using cuStateVec, we can speed up x2 for large qubits (larger than 22 qubits) but Aer's implementation is still faster for smaller qubits.

Since cuQuantum is beta version, there are some limitations:

cuStateVec is not thread safe, multi-chunk parallelization (cache blocking) is done by single thread (slow)
Multi-shots parallelization is disabled (single thread, slow)
Multi-shots batched optimization is not support for cuStateVec

chriseclectic · 2021-12-14T05:10:48Z

I added On-hold until 0.10 release is out

…atevec

…ontainer

…atevec

CONTRIBUTING.md

qiskit/providers/aer/backends/aer_simulator.py

hhorii

I think introducing a new namespace is necessary for AER::QV to identify chunk-based or not. For example, GateFuncBase sounds too general and it should be in AER::QV::CHUNK or something.

src/controllers/aer_controller.hpp

src/simulators/density_matrix/densitymatrix_state.hpp

src/simulators/state.hpp

src/simulators/statevector/chunk/chunk_container.hpp

src/simulators/statevector/chunk/chunk_manager.hpp

hhorii · 2022-02-01T05:01:23Z

@doichanj a release note is necessary.

jakelishman · 2022-02-02T15:41:47Z

This probably should also wait for Aer 0.11 - it's a big new feature, and patch releases are usually for bugfixes.

hhorii

Now I found that OpenMP does not work well in any devices. Let me investigate this phenomena is from only my configuration or common.

…teVec

…atevec

hhorii · 2022-02-03T15:43:57Z

src/controllers/aer_controller.hpp

+  if(cuStateVec_enable_){
+    enable_batch_multi_shots_ = false;    //cuStateVec does not support batch execution of multi-shots
+    parallel_shots_ = 1;    //cuStateVec is currently not thread safe
+    return;


if cuStateVec_enable=True is configured in AerSimulator.run(), parallel_state_update_ is not set. This will produce performance regression if application accidientaly sets cuStateVec_enable with device='CPU'.

Question: when enable_batch_multi_shots_=true would you create nShots copies of the statevector for parallelization? If so & IIUC, I think a proper "workaround" is to create multiple cuStateVec handles (or just retain and reuse a pool of handles at init time to reduce overhead) and use them in parallel.

IMHO though it's beyond a "workaround": even after we fix the thread safety issue, generally speaking it is still challenging for library handles to be shared by multiple host threads. For example, despite cuBLAS supports this usage pattern they explicitly recommend to not do so. Thus the handle pool approach is commonly seen in ML/DL frameworks.

enable_batch_multi_shots_=true is not applicable for cuStateVec currently, because multiple state vectors are calculated in a single CUDA kernel and each state vector refers classical registers to handle branch operations, this is not implemented in cuStateVec.
Multiple cuStateVec handle is required when enable_batch_multi_shots_=false and shot level parallelization is required. In this case, state vectors are independently calculated using OpenMP threads. (Currently cuStateVec is not thread safe and we disable OpenMP parallelization)

Thanks for explanation @doichanj. I understand better now. So once we fix thread safety we can unblock you for the shot-level parallelization.

…atevec

hhorii · 2022-02-15T13:51:04Z

test/terra/backends/simulator_test_case.py

+            #'GPU_cuStateVec' is used only inside tests not available in Aer
+            #and this is converted to "device='GPU'" and option "cuStateVec_enalbe = True" is added
+            if cuStateVec:
+                data_args.append((method, 'GPU_cuStateVec'))


@chriseclectic could you review this change? This is a hack to minimize changes of tests for tests of cuStateVec option. cuStateVec is an option only for device=GPU. Current annotator supported_methods() requires tests to take two argument method and device. Adding new option cuStateVec_enable to all the tests is not productive, I believe.

hhorii · 2022-02-28T12:39:27Z

I confirmed that no regressions will be happened with this PR.

doichanj added 2 commits December 13, 2021 18:27

add cuStateVec support

309c73d

Merge remote-tracking branch 'upstream/main' into cuStatevec

54dc128

doichanj requested review from chriseclectic, hhorii, mtreinish and vvilpas as code owners December 13, 2021 09:36

delete space

a5bc75e

chriseclectic added the on hold Can not fix yet label Dec 13, 2021

chriseclectic removed the on hold Can not fix yet label Dec 14, 2021

chriseclectic and others added 7 commits December 14, 2021 17:04

Merge branch 'main' into cuStatevec

b1bd96e

disable batched shots optimization for cuStateVec

a40898c

Merge branch 'cuStatevec' of github.com:doichanj/qiskit-aer into cuSt…

adfc125

…atevec

Fix cuStateVec test fails

26c4538

Fix qasm_simulator.py

87afff5

update for the latest cuQuantum / added diagonal matrix

f16a35c

resolved conflict

5533b76

chriseclectic assigned hhorii Jan 4, 2022

doichanj added 11 commits January 18, 2022 14:47

add more cuStateVec support / refactor qubitvector_thrust and chunk_c…

0c10325

…ontainer

Merge remote-tracking branch 'upstream/main' into cuStatevec

181eb2c

Merge branch 'main' into cuStatevec

54d1a68

Merge branch 'cuStatevec' of github.com:doichanj/qiskit-aer into cuSt…

4d502ed

…atevec

Fix norm() for Thrust CPU

eba2594

change cuStateVec from device to option

5a93807

Fix unchanged device=cuStateVec

983773b

Add build option to link cuStateVec statically

5bea04d

removed whitespace

1fb5031

Merge remote-tracking branch 'upstream/main' into cuStatevec

1d01542

Merge branch 'main' into cuStatevec

da0f42d

hhorii reviewed Jan 31, 2022

View reviewed changes

CONTRIBUTING.md Show resolved Hide resolved

hhorii reviewed Jan 31, 2022

View reviewed changes

qiskit/providers/aer/backends/aer_simulator.py Outdated Show resolved Hide resolved

hhorii reviewed Jan 31, 2022

View reviewed changes

doichanj added 2 commits February 1, 2022 17:33

reflecting review comments

c781208

added release note

0f4a93e

hhorii approved these changes Feb 1, 2022

View reviewed changes

hhorii added this to the Aer 0.10.3 milestone Feb 2, 2022

hhorii modified the milestones: Aer 0.10.3, Aer 0.11.0 Feb 2, 2022

hhorii requested changes Feb 3, 2022

View reviewed changes

doichanj added 4 commits February 3, 2022 17:07

set cuStateVec_enable to False as default, added test cases for cuSta…

c509131

…teVec

Merge remote-tracking branch 'upstream/main' into cuStatevec

5458b7c

Merge branch 'main' into cuStatevec

61083cb

Merge branch 'cuStatevec' of github.com:doichanj/qiskit-aer into cuSt…

046036d

…atevec

hhorii reviewed Feb 3, 2022

View reviewed changes

doichanj added 5 commits February 4, 2022 18:33

Fix omp setting for non-GPU / Fix omp nested loops

3a31cef

Merge branch 'main' into cuStatevec

de4c978

Implemented optimized rotation gates

88d7d95

Merge branch 'cuStatevec' of github.com:doichanj/qiskit-aer into cuSt…

3ffabcf

…atevec

Merge branch 'main' into cuStatevec

7cf50ee

hhorii reviewed Feb 15, 2022

View reviewed changes

chriseclectic self-assigned this Feb 15, 2022

Merge branch 'main' into cuStatevec

879a4ac

hhorii approved these changes Feb 24, 2022

View reviewed changes

hhorii merged commit db91e7d into Qiskit:main Mar 1, 2022

hhorii mentioned this pull request Mar 29, 2022

Release 0.10.4 #1481

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add experimental support of cuQuantum #1400

Add experimental support of cuQuantum #1400

doichanj commented Dec 13, 2021 •

edited

chriseclectic commented Dec 14, 2021

hhorii left a comment

hhorii commented Feb 1, 2022

jakelishman commented Feb 2, 2022

hhorii left a comment

hhorii Feb 3, 2022

leofang Feb 3, 2022

doichanj Feb 4, 2022

leofang Feb 4, 2022

hhorii Feb 15, 2022

hhorii commented Feb 28, 2022

Add experimental support of cuQuantum #1400

Add experimental support of cuQuantum #1400

Conversation

doichanj commented Dec 13, 2021 • edited

Summary

Details and comments

chriseclectic commented Dec 14, 2021

hhorii left a comment

Choose a reason for hiding this comment

hhorii commented Feb 1, 2022

jakelishman commented Feb 2, 2022

hhorii left a comment

Choose a reason for hiding this comment

hhorii Feb 3, 2022

Choose a reason for hiding this comment

leofang Feb 3, 2022

Choose a reason for hiding this comment

doichanj Feb 4, 2022

Choose a reason for hiding this comment

leofang Feb 4, 2022

Choose a reason for hiding this comment

hhorii Feb 15, 2022

Choose a reason for hiding this comment

hhorii commented Feb 28, 2022

doichanj commented Dec 13, 2021 •

edited