SVD data processing #48

eggerdj · 2021-05-12T12:57:32Z

Summary

This PR implements a

SVD node and an IQ averaging node for the data processor,
a data processor training mechanism,
run-time options that can be given to a data processor, and
an error propagation mechanism.

Details and comments

The SVD allows the user to project the IQ data on the main axis of the data. This also means that this axis must be determined before using the SVD. Furthermore, this training is not done on a single datum but either on all the data or a subset of the data (the data processing calibration data).

DataProcessors can now be trained one node at a time using the train method. Each node has a is_trained property which by default is True. Nodes that require a training process must overwrite this property.

The error propagation and run-time options are implemented by changing the signature of DataAction calls from

def __call__(self, data: Any) -> Any:

to

def __call__(self, data: Any, error: Optional[Any] = None, **options) -> Tuple[Any, Any]:

and making the required changes in DataProcessor.

TODO:

DataProcessor training mechanisme
IQ averaging node
Run-time options
Error propagation for SVD

* Added tests. * Added train method to DataAction.

eggerdj · 2021-05-12T12:59:39Z

This is needed for #31

…processing-node

* Added SVD training test. * Added required dimension to SVD.

* Added error propagation mechanisme to the data processor. * Added optional outcome to Probability. * Added AverageIDData Node. * Added averaging and SVD test.

* Black and lint.

blakejohnson · 2021-05-14T17:45:36Z

Can we have an option to "self-train" this from the data source? This is often good enough if your data set has a reasonable mixture of 0s and 1s.

qiskit_experiments/data_processing/nodes.py

eggerdj · 2021-05-16T07:51:07Z

In #22 we decided to have the data processor apply to an individual datum, i.e. processor(experiment_data.data(index)). It therefore only sees one point of the data at the time. Self-fitting is thus done in the analysis class, see e.g. #46. This also gives us the possibility to train on either all or part of the data. E.g. an experiment might include a few ground-state and excited-state calibration circuits (which we can single out for training) before measuring the actual thing. PR #48 introduces a mechanism that allows nodes and data processors to self train on the data they are given.

nkanazawa1989

Overall this looks good.

qiskit_experiments/data_processing/data_processor.py

qiskit_experiments/data_processing/nodes.py

qiskit_experiments/data_processing/data_processor.py

Co-authored-by: Naoki Kanazawa <nkanazawa1989@gmail.com>

qiskit_experiments/data_processing/nodes.py

chriseclectic

I'm not sure about the changes to the base class here which seem to be very tied to this specific SVD function. Training doesn't seem like it is a general enough property for the base class.

Also do we need to allow runtime options for these objects, or can we take it in a direction of specifically setting options for node and data processors at initialization or using set_options? Passing generic kwargs through the entire processing chain might lead to some issues later on similar to the current issues in experiment classes with the various layers of options.

qiskit_experiments/data_processing/data_action.py

qiskit_experiments/data_processing/data_processor.py

eggerdj · 2021-05-18T18:27:16Z

Here is some rational for the current implementation. Consider the case of spectroscopy. When running spectroscopy you have no pi-pulse and cannot run any calibrations. Therefore, you need to train your SVD (or whatever projection method you use) on the data you measure in the spectroscopy experiment. Therefore the data processor needs to self train on the data it is given. This would look like this:

processor = DataProcessor("memory", [SVD()])

processor.train(experiment_data.data())

y_sigmas = np.array([processor(datum) for datum in experiment_data.data()])

…iments into svd-processing-node

nkanazawa1989

LGTM

chriseclectic

Looks better with the subclasses. One question is does is_trained need to be an abstract method? Or can the object just have an attribute like self._trained = False that this property returns, and the subclasses should set this to True as part of the train abstract method implementation?

eggerdj · 2021-05-19T15:58:47Z

One advantage with the abstract property is that it forces the person implementing the node to carefully think about what it means for the node to be trained.

* * First draft of SVD node. * Added tests. * Added train method to DataAction. * * Added node and data processor training functionality. * * Fixed bug in _call_internal. * Added SVD training test. * Added required dimension to SVD. * * Added run-time options to data processor * Added error propagation mechanisme to the data processor. * Added optional outcome to Probability. * Added AverageIDData Node. * Added averaging and SVD test. * * Added error propagation for SVD node and tests. * Black and lint. * * Added call to super().setUp() * * Removed redundent setUp * * Made the averaging node independent of IQData. * * Docstring. * * Black * * Lint * * Black. * * Fix docstring. * Update qiskit_experiments/data_processing/data_processor.py Co-authored-by: Naoki Kanazawa <nkanazawa1989@gmail.com> * * Made means a function to improve readability. * * Removed RealAvg and ImagAvg. * * Added TrainableDataAction as a subclass of DataAction. * * Removed _required_dimension. * * Removed **options from data processing. * * Used the property * * Removed raises in SVD. * * Made scale default to 1.0 Co-authored-by: Naoki Kanazawa <nkanazawa1989@gmail.com>

* First draft of SVD node.

8615bf5

* Added tests. * Added train method to DataAction.

eggerdj mentioned this pull request May 12, 2021

Spectroscopy #31

Merged

eggerdj mentioned this pull request May 13, 2021

Curve Analysis class #46

Merged

eggerdj added 7 commits May 13, 2021 21:06

* Added node and data processor training functionality.

3659d66

Merge branch 'main' of github.com:Qiskit/qiskit-experiments into svd-…

e5b6adc

…processing-node

* Fixed bug in _call_internal.

10d78d4

* Added SVD training test. * Added required dimension to SVD.

* Added run-time options to data processor

eb3fe65

* Added error propagation mechanisme to the data processor. * Added optional outcome to Probability. * Added AverageIDData Node. * Added averaging and SVD test.

* Added error propagation for SVD node and tests.

9ae1ff0

* Black and lint.

* Added call to super().setUp()

c470fd6

* Removed redundent setUp

75d83d0

blakejohnson reviewed May 14, 2021

View reviewed changes

qiskit_experiments/data_processing/nodes.py Outdated Show resolved Hide resolved

blakejohnson reviewed May 14, 2021

View reviewed changes

qiskit_experiments/data_processing/nodes.py Show resolved Hide resolved

* Made the averaging node independent of IQData.

35f903f

eggerdj changed the title ~~[WIP] SVD data processing~~ SVD data processing May 16, 2021

eggerdj added 5 commits May 16, 2021 17:28

* Docstring.

4cf97ab

* Black

ce614c3

* Lint

76d72b7

* Black.

77e4a8e

* Fix docstring.

ba72f03

nkanazawa1989 requested changes May 18, 2021

View reviewed changes

eggerdj and others added 4 commits May 18, 2021 07:25

Update qiskit_experiments/data_processing/data_processor.py

586432a

Co-authored-by: Naoki Kanazawa <nkanazawa1989@gmail.com>

* Made means a function to improve readability.

88f0415

* Removed RealAvg and ImagAvg.

9a252e2

Merge branch 'main' into svd-processing-node

70fdd11

nkanazawa1989 reviewed May 18, 2021

View reviewed changes

chriseclectic reviewed May 18, 2021

View reviewed changes

eggerdj and others added 8 commits May 18, 2021 20:56

* Added TrainableDataAction as a subclass of DataAction.

74e928b

* Removed _required_dimension.

24e0d3b

* Removed **options from data processing.

e7884bc

* Used the property

1a79dcd

* Removed raises in SVD.

abbce07

Merge branch 'main' into svd-processing-node

460b8d8

* Made scale default to 1.0

935afda

Merge branch 'svd-processing-node' of github.com:eggerdj/qiskit-exper…

2e5e609

…iments into svd-processing-node

nkanazawa1989 approved these changes May 19, 2021

View reviewed changes

chriseclectic reviewed May 19, 2021

View reviewed changes

eggerdj merged commit fbe19dd into qiskit-community:main May 19, 2021

eggerdj deleted the svd-processing-node branch May 19, 2021 16:55

coruscating added this to the Release 0.1 milestone Jun 16, 2021

SVD data processing #48

SVD data processing #48

Uh oh!

Conversation

eggerdj commented May 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details and comments

Uh oh!

eggerdj commented May 12, 2021

Uh oh!

blakejohnson commented May 14, 2021

Uh oh!

Uh oh!

Uh oh!

eggerdj commented May 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nkanazawa1989 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chriseclectic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eggerdj commented May 18, 2021

Uh oh!

nkanazawa1989 left a comment

Choose a reason for hiding this comment

Uh oh!

chriseclectic left a comment

Choose a reason for hiding this comment

Uh oh!

eggerdj commented May 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

eggerdj commented May 12, 2021 •

edited

Loading

eggerdj commented May 16, 2021 •

edited

Loading