PyTorch-backed forward simulation #390

rileyjmurray · 2024-01-18T13:01:45Z

This PR introduces TorchForwardSimulator, a forward simulator (for computing circuit outcome probabilities) based on PyTorch. It uses automatic differentiation to compute the Jacobian of the map from model parameters to circuit outcome probabilities. In the future we could extend it to do computations on a system's GPU, or to use PyTorch-based optimization algorithms instead of pyGSTi's custom algorithms for MLE.

Approach

My approach required creating a new ModelMember subclass called Torchable. This subclass adds two required functions, called stateless_data and torch_base. Their meanings are given below:

pyGSTi/pygsti/modelmembers/torchable.py

Lines 18 to 43 in 1ec6909

    
               def stateless_data(self) -> Tuple: 
        
                   """ 
        
                   Return this ModelMember's data that is considered constant for purposes of model fitting. 
        
                   Note: the word "stateless" here is used in the sense of object-oriented programming. 
        
                   """ 
        
                   raise NotImplementedError()    
        
               @staticmethod 
        
               def torch_base(sd : Tuple, t_param : Tensor) -> Tensor: 
        
                   """ 
        
                   Suppose "obj" is an instance of some Torchable subclass. If we compute 
        
                       vec = obj.to_vector() 
        
                       t_param = torch.from_numpy(vec) 
        
                       sd = obj.stateless_data() 
        
                       t = type(obj).torch_base(sd, t_param) 
        
                   then t will be a PyTorch Tensor that represents "obj" in a canonical numerical way. 
        
                   The meaning of "canonical" is implementation dependent. If type(obj) implements 
        
                   the ``.base`` attribute, then a reasonable implementation will probably satisfy 
        
                       np.allclose(obj.base, t.numpy()). 
        
                   """ 
        
                   raise NotImplementedError()

In principle, TorchForwardSimulator can handle all models for which constituent parameterized ModelMembers are Torchable. So far I've only extended TPState, FullTPOp, and TPPOVM to be Torchable; these are the classes used in "full TP" GST.

The Python file that contains TorchForwardSimulator also defines two helper classes: StatelessCircuit and StatelessModel. I think it's fine to keep these classes as purely internal implementation-specific constructs for now. Depending on future performance optimizations of TorchForwardSimulator we might want to put them elsewhere in pyGSTi.

What should come after this PR

We should compare performance of TorchForwardSimulator to MapForwardSimulator on problems of interest. There's a chance that the former isn't faster than the latter with the current implementation. If that's the case then I should look at possible performance optimizations specifically inside TorchForwardSimulator.

We should add implementations of stateless_data and torch_base to GST models beyond "Full TP" (in particular I'd like to try CPTP).

Incidental changes

My implementation originally interacted with the following evotype classes

    <class 'pygsti.evotypes.densitymx[_slow].statereps.StateRepDense'>
    <class 'pygsti.evotypes.densitymx[_slow].opreps.OpRepDenseSuperop'>
    <class 'pygsti.evotypes.densitymx[_slow].effectreps.EffectRepConjugatedState'>

When I write [_slow] in the class names above you can put the empty string or just _slow, depending on the default evotype specified in evotypes.py.

To my surprise, I found that interacting with evotypes was neither necessary nor sufficient for what I wanted to accomplish. So while I did make changes in evotypes/densitymx_slow/ to remove unnecessary class inheritances and to add documentation, those changes were only to make life a little easier for future pyGSTi contributors.

…breakpoints in debugging

…ovms, and gates (as they appear in TorchOpModel._compute_circuit_outcome_probabilities)

…ase classes

…ot need

…ding attempts to use that class

…lso SimpleMapForwardSimulator ...) that the dict returned by circuit.expand_instruments_and_separate_povm(...) has at most one element.

…s _compute_circuit_outcome_probabilities

…ting any circuit probabilities

…ce for differentiation yet)

…delmembers (needed to construct differentiable torch tensors). Have a new torch_base property of TPState objects. Need such a property for FullTPOp objects. Unclear how to implement for povms, since right now we`re bypassing the POVM abstraction and going directly into the effects abstraction of the circuit.

…n, rather than only through ConjugatedStatePOVMEffect objects associated with a SeparatePOVMCircuit

…so it allows require_grad=True.

…vn`t used it to speed up derivative computations yet.

…r before converting to a numpy array and writing to array_to_fill in TorchForwardSimulator._bulk_fill_probs_block.

rileyjmurray · 2024-02-03T04:17:16Z

pygsti/tools/basistools.py

This change is to resolve a deprecation warning.

rileyjmurray · 2024-02-03T04:17:46Z

pygsti/modelmembers/states/densestate.py

This change is just to improve readability.

…odelMember API

sserita · 2024-02-07T18:45:17Z

I'm giving a few quick comments here because I think this PR will actually take me some time to get through.

The main thing I am concerned and thinking about is the choice to extend the ModelMember class as opposed to adjusting the evotypes. The general purpose of the split between modelmembers and evotypes is so that we don't have to go through and implement these abstract methods in all modelmembers - we can make the change in the evotype and then any modelmember works.
It is totally possible that extending the API is the best way to do this. The TermSimulator follows a similar pattern where the API extensions are the cleanest way to do it. But I'll probably spend some time thinking about whether this is way we want to implement this. Probably a point of discussion for us in our dev meetings in the upcoming weeks.

rileyjmurray · 2024-02-09T14:39:15Z

@sserita, regarding

The main thing I am concerned and thinking about is the choice to extend the ModelMember class as opposed to adjusting the evotypes. The general purpose of the split between modelmembers and evotypes is so that we don't have to go through and implement these abstract methods in all modelmembers - we can make the change in the evotype and then any modelmember works.

Unfortunately there's no way to do this just through evotypes. Using pytorch's AD capabilities requires knowing the free parameters in a modelmember and how those free parameters map to the common parameterization-agnostic representation (i.e., representations in evotypes).

rileyjmurray · 2024-02-13T17:35:25Z

pygsti/forwardsims/torchfwdsim.py

+# Below: variables for type annotations.
+#   We have to create variable aliases rather than importing the types
+#   directly, since importing the types would cause circular imports.
+Label = TypeVar('Label')
+ExplicitOpModel = TypeVar('ExplicitOpModel')
+SeparatePOVMCircuit = TypeVar('SeparatePOVMCircuit')
+CircuitOutcomeProbabilityArrayLayout = TypeVar('CircuitOutcomeProbabilityArrayLayout')


Try to use from __future__ import annotations to avoid use of TypeVars.

rileyjmurray · 2024-02-13T21:44:07Z

Notes from today's meeting:

I removed empty base classes for OpRep, EffectRep, StateRep. I should make sure we aren't doing type checking elsewhere in pyGSTi that would be broken by this. (There's a chance we are and it simply isn't showing up in current tests.)

rileyjmurray · 2024-05-07T13:30:00Z

@coreyostrove, @sserita, @enielse: this is ready for review.

rileyjmurray · 2024-05-07T13:43:06Z

pygsti/modelmembers/povms/tppovm.py

+        first_basis_vec = torch.zeros(size=(1, dim), dtype=torch.double)
+        first_basis_vec[0,0] = dim ** 0.25
+        t_param_mat = t_param.reshape((num_effects - 1, dim))
+        t_func = first_basis_vec - t_param_mat.sum(axis=0, keepdim=True)
+        t = torch.row_stack((t_param_mat, t_func))
+        return t


@coreyostrove, @sserita: is the dim ** 0.25 scale appropriate when the underlying vector space is something other than "tensor product of qubit space"?

rileyjmurray · 2024-05-07T13:44:01Z

pygsti/modelmembers/states/tpstate.py

+    def torch_base(sd: Tuple[int], t_param: _Torchable.Tensor) -> _Torchable.Tensor:
+        torch = _Torchable.torch_handle
+        dim = sd[0]
+        t_const = (dim ** -0.25) * torch.ones(1, dtype=torch.double) 


@coreyostrove, @sserita: same question as above. Is the scale factor of dim ** -0.25 appropriate for general spaces?

rileyjmurray added 3 commits January 17, 2024 09:14

resolve deprecation warning

fbe23e9

tiny bugfix

985404f

starting point for building the TorchForwardSimulator class

8f36247

rileyjmurray mentioned this pull request Jan 18, 2024

WIP: PyTorch-backed forward simulation #389

Closed

rileyjmurray added 26 commits January 18, 2024 16:46

notes

842c0f7

infrastructure

1b698e6

I understand how I am stuck and will get help

cbc15b5

change list comprehension into for-loop in order to simplify setting …

b3ac3da

…breakpoints in debugging

leave comments describing object inheritance structures for states, p…

73363d1

…ovms, and gates (as they appear in TorchOpModel._compute_circuit_outcome_probabilities)

comments indicating class types of povm-related objects

9983d1b

improve readability

bd345f6

remove unnecessary dependence of certain Evotypes on trivial Cython b…

2e76f32

…ase classes

left out of last commit

bd82b41

comments explaining that densitymx_slow is really "superket_slow"

e158c21

left out of last commit

c6b4d8f

remove commented-out functions which I now clearly understand we do n…

ae73090

…ot need

remove abstraction layers in TorchForwardSimulator

ffa7ea0

remove more abstractions

d787025

remove references to new TorchLayerRules class and discussion surroun…

b510b2e

…ding attempts to use that class

make an apparent limitation of TorchForwardSimulator (and I suppose a…

6fc59dd

…lso SimpleMapForwardSimulator ...) that the dict returned by circuit.expand_instruments_and_separate_povm(...) has at most one element.

remove unused function

6aac2af

explicitly override the function that iterates over circuits and call…

107b26b

…s _compute_circuit_outcome_probabilities

get array representations of all quantities as prep work before compu…

c1fcfc2

…ting any circuit probabilities

use torch to compute circuit probabilities (infrastructure not in pla…

761496c

…ce for differentiation yet)

more progress on modelmember.torch_base(...) pattern

9b56b2a

demonstrate how we can access povm data through the TPPOVM abstractio…

0c9b103

…n, rather than only through ConjugatedStatePOVMEffect objects associated with a SeparatePOVMCircuit

write basic TPPOVM.torch_base function. Need to modify that function …

243b757

…so it allows require_grad=True.

forward simulation codepath that computes gradients seems to work. Ha…

0bea829

…vn`t used it to speed up derivative computations yet.

can build the entire vector of outcome probabilities as a torch Tenso…

b88643a

…r before converting to a numpy array and writing to array_to_fill in TorchForwardSimulator._bulk_fill_probs_block.

rileyjmurray added 3 commits February 2, 2024 23:11

remove unused function

1cc944c

undo change

0e2f051

removed unused file

cfa9232

rileyjmurray commented Feb 3, 2024

View reviewed changes

pygsti/tools/basistools.py Outdated

Copy link

Collaborator Author

rileyjmurray Feb 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This change is to resolve a deprecation warning.

rileyjmurray commented Feb 3, 2024

View reviewed changes

pygsti/modelmembers/states/densestate.py Outdated

Copy link

Collaborator Author

rileyjmurray Feb 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This change is just to improve readability.

rileyjmurray added 6 commits February 6, 2024 11:12

documentation

cf05d9a

remove comment logged as GitHub Issue #397

f312b92

unify the API for torch_base and getting necessary ModelMember metadata

a55efde

remove old comments and unused imports. Style tweaks.

a8f6145

formally declare the stateless_data and torch_base functions in the M…

e72dbad

…odelMember API

reenable commented-out tests in test_forwardsim.py

d2c8d38

rileyjmurray marked this pull request as ready for review February 7, 2024 13:54

rileyjmurray requested review from a team as code owners February 7, 2024 13:54

rileyjmurray requested a review from sserita February 7, 2024 13:54

gracefully handle when pytorch is not installed

2435a50

rileyjmurray commented Feb 13, 2024

View reviewed changes

stash

2e4c3cf

sserita added this to the 0.9.13 milestone Apr 2, 2024

rileyjmurray added 4 commits May 6, 2024 10:04

better workaround for circular imports in type annotations

a3ffa68

Create Torchable subclass of ModelMember

f5383b9

remove static constant from TorchForwardSimulator class

ac2e8e7

docstring changes

5a1be5d

docstring changes

1ec6909

rileyjmurray commented May 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyTorch-backed forward simulation #390

PyTorch-backed forward simulation #390

rileyjmurray commented Jan 18, 2024 •

edited

rileyjmurray Feb 3, 2024

rileyjmurray Feb 3, 2024

sserita commented Feb 7, 2024

rileyjmurray commented Feb 9, 2024

rileyjmurray Feb 13, 2024

rileyjmurray commented Feb 13, 2024

rileyjmurray commented May 7, 2024

rileyjmurray May 7, 2024

rileyjmurray May 7, 2024

	def stateless_data(self) -> Tuple:
	"""
	Return this ModelMember's data that is considered constant for purposes of model fitting.

	Note: the word "stateless" here is used in the sense of object-oriented programming.
	"""
	raise NotImplementedError()

	@staticmethod
	def torch_base(sd : Tuple, t_param : Tensor) -> Tensor:
	"""
	Suppose "obj" is an instance of some Torchable subclass. If we compute

	vec = obj.to_vector()
	t_param = torch.from_numpy(vec)
	sd = obj.stateless_data()
	t = type(obj).torch_base(sd, t_param)

	then t will be a PyTorch Tensor that represents "obj" in a canonical numerical way.

	The meaning of "canonical" is implementation dependent. If type(obj) implements
	the ``.base`` attribute, then a reasonable implementation will probably satisfy

	np.allclose(obj.base, t.numpy()).
	"""
	raise NotImplementedError()

PyTorch-backed forward simulation #390

Are you sure you want to change the base?

PyTorch-backed forward simulation #390

Conversation

rileyjmurray commented Jan 18, 2024 • edited

Approach

What should come after this PR

Incidental changes

rileyjmurray Feb 3, 2024

Choose a reason for hiding this comment

rileyjmurray Feb 3, 2024

Choose a reason for hiding this comment

sserita commented Feb 7, 2024

rileyjmurray commented Feb 9, 2024

rileyjmurray Feb 13, 2024

Choose a reason for hiding this comment

rileyjmurray commented Feb 13, 2024

rileyjmurray commented May 7, 2024

rileyjmurray May 7, 2024

Choose a reason for hiding this comment

rileyjmurray May 7, 2024

Choose a reason for hiding this comment

rileyjmurray commented Jan 18, 2024 •

edited