custom initial_state GPU #306

scarrazza · 2020-12-28T12:03:34Z

I am opening this PR to complement the implementation in #305 for GPU.
Now the code compiles and passes several tests on GPU, however there is something wrong with some of distributed circuit tests.

@stavros11 could you please have a look? Looks like the local state vectors are not recovered from the devices after applying gates, this may be due to the replacement of .assign with =.

codecov · 2020-12-28T12:45:51Z

Codecov Report

Merging #306 (8e27923) into custominitop (30fe445) will not change coverage.
The diff coverage is 100.00%.

@@              Coverage Diff               @@
##           custominitop      #306   +/-   ##
==============================================
  Coverage        100.00%   100.00%           
==============================================
  Files                57        57           
  Lines             10806     10808    +2     
==============================================
+ Hits              10806     10808    +2

Flag	Coverage Δ
unittests	`100.00% <100.00%> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/qibo/tensorflow/circuit.py	`100.00% <100.00%> (ø)`
src/qibo/tensorflow/distcircuit.py	`100.00% <100.00%> (ø)`
src/qibo/tensorflow/distutils.py	`100.00% <100.00%> (ø)`
src/qibo/tests/test_custom_operators.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 30fe445...8e27923. Read the comment docs.

stavros11

@stavros11 could you please have a look? Looks like the local state vectors are not recovered from the devices after applying gates, this may be due to the replacement of .assign with =.

Thanks for the fix, now it works on GPU. Regarding the .assign, is there any important reason it should be changed to =? I am not sure why tests fail with = but the reason we were using tf.Variables and .assign was that it was easier to keep track of the device that each part of the state is stored. State is stored as pieces represented by tf.Variables and the pieces are transferred to each GPU for the calculation. I guess by changing state.pieces[i].assign(piece) to state.pieces[i] = piece after each calculation we are not transfering the updated piece back to the CPU but we refer state.pieces[i] to the GPU tensor which is not what we want.

I think the simplest solution is to leave all .assign calls as they are and just pass the new initial state op inside a tf.Variable when initializing state.pieces[0]. I checked this and all tests pass while performance is equivalent to master for both single and multi GPU. If you like I can push my branch with these changes here.

stavros11 · 2020-12-29T10:48:35Z

src/qibo/tensorflow/distutils.py

+          state.pieces[0] = op.initial_state(nqubits=state.nlocal,
+                                             dtype=DTYPES.get('DTYPECPX'),
+                                             is_matrix=False, omp_num_threads=get_threads())


We can leave all the .assign calls as they are and change this to:

Suggested change

state.pieces[0] = op.initial_state(nqubits=state.nlocal,

dtype=DTYPES.get('DTYPECPX'),

is_matrix=False, omp_num_threads=get_threads())

piece = op.initial_state(nqubits=state.nlocal,

dtype=DTYPES.get('DTYPECPX'),

is_matrix=False, omp_num_threads=get_threads())

state.pieces[0] = tf.Variable(piece, dtype=piece.dtype)

This way all tests pass and performance is the same as master.

Thanks for spotting this issue.

However, I had to change all assign with = because, at least on CPU, the code crashes with:

> state.pieces[i].assign(piece) E AttributeError: 'tensorflow.python.framework.ops.EagerTensor' object has no attribute 'assign'

Does this happens to you too?

If I do the above change .assign works both on CPU and GPU. I think changing the initialization as I wrote above using tf.Variable(op.initial_state(...)) will solve this crash because all .assign calls will happen on variables not tensors.

Thanks, let me try to revert all assigns after applying your initialization.

scarrazza added 2 commits December 28, 2020 11:47

changing interface for gpu support

4453849

fixing gpu type

62903c5

scarrazza requested a review from stavros11 December 28, 2020 12:03

cleanup, setting omp threads globally

034ca6a

stavros11 approved these changes Dec 29, 2020

View reviewed changes

scarrazza added 2 commits December 29, 2020 11:55

fixing cuda typo

34144b3

implementing stavros suggestion, fixes GPU tests

8e27923

scarrazza merged commit e749430 into custominitop Dec 29, 2020

scarrazza deleted the custominittopgpu branch February 6, 2021 13:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

custom initial_state GPU #306

custom initial_state GPU #306

scarrazza commented Dec 28, 2020

codecov bot commented Dec 28, 2020 •

edited

Loading

stavros11 left a comment •

edited

Loading

stavros11 Dec 29, 2020

scarrazza Dec 29, 2020

stavros11 Dec 29, 2020

scarrazza Dec 29, 2020

custom initial_state GPU #306

custom initial_state GPU #306

Conversation

scarrazza commented Dec 28, 2020

codecov bot commented Dec 28, 2020 • edited Loading

Codecov Report

stavros11 left a comment • edited Loading

Choose a reason for hiding this comment

stavros11 Dec 29, 2020

Choose a reason for hiding this comment

scarrazza Dec 29, 2020

Choose a reason for hiding this comment

stavros11 Dec 29, 2020

Choose a reason for hiding this comment

scarrazza Dec 29, 2020

Choose a reason for hiding this comment

codecov bot commented Dec 28, 2020 •

edited

Loading

stavros11 left a comment •

edited

Loading