[UnitaryHack] Make ansatz class a training hyperparameter #101

neiljdo · 2023-06-10T16:03:56Z

PR adds the ansatz class and necessary arguments for instantiation (e.g. ob_map and ansatz-specific kwargs) as hyperparameters to the trainer class. The functions related to creating and loading checkpoints have also been updated to accommodate the new ansatz class hyperparameter.

Addresses #86.

* main: Fix IBMAccountError

ianyfan · 2023-06-12T15:23:09Z

Hi Neil, thanks for your submission, it looks quite good already. I've enabled the tests to run on this PR, so would you be able to have a look at where they're failing and see if you could fix them up?

neiljdo · 2023-06-12T15:43:41Z

Hi @ianyfan, thank you for the review. I just pushed a new commit to address the flake8 violations. I'll wait for the workflow run results to see if I missed some.

neiljdo · 2023-06-12T16:20:48Z

@ianyfan just saw the failing jobs related to the notebook tests and type checking - will push a fix to those soon.

dimkart · 2023-06-12T18:08:54Z

@ianyfan just saw the failing jobs related to the notebook tests and type checking - will push a fix to those soon.

@neiljdo Hi, it would be great if you could that by tomorrow, 13/6, which is the last day of the hackathon.

nathanshammah · 2023-06-13T08:06:50Z

@dimkart @neiljdo PRs can be reviewed also shortly after June 13th and still be awarded a bounty in uHACK if accepted.

dimkart · 2023-06-13T08:11:50Z

@nathanshammah

PRs can be reviewed also shortly after June 13th and still be awarded a bounty in uHACK if accepted.

Thanks, noted.

nikhilkhatri

Hi @neiljdo ,
Thanks for this PR!
it looks good in general, though I have some comments:

The ansatz is applied to the diagrams at each epoch. This can slow down training a lot, especially for large datasets / diagrams. Instead, it would be better to save the train and val circuits as properties of the Trainer object during init and checkpoint-load.
It would be good to add tests / manually test that all the ansatze are serialisable through pickle.
It would be much better if the model did not take the test set as input during training, for a clear separation. This would be particularly natural once the UNK feature is completed. (UnitaryHACK: Replace unknown words in diagrams with UNK token #84)
Currently, the ansatz is a property of the Trainer, though it makes conceptual sense for this to be a property of the Model. In this case, it would be possible to make the forward and get_diagram_output functions take a diagram as input, and apply the ansatz internally.

Since this is already a complex task and there are some design decisions involved, it would be great if you could address points 1 and 2 so the issue can be assigned to you for the hack.

The remaining points can be discussed later.

nikhilkhatri · 2023-06-13T10:25:21Z

Would also be great if you could run the clean_notebooks.py script, which removes some metadata from the examples and tutorials.

neiljdo · 2023-06-13T15:48:14Z

@nikhilkhatri,
Thank you for the review. Here are my thoughts on your comments above.

We should save the circuits in order to do the diagram-to-circuit conversion only once. I'll implement a fix for this one right away.
For the ansatz serialization, I only save the components (i.e. the ansatz class, ob_map, and other ansatz-specific kwargs) required to initialize the ansatz during trainer checkpoint creation and re-initialize the ansatz during checkpoint load. I thought the ansatz deterministic and could be re-instantiated, so I went this route. I would like to do tests as part of the testing suite (not manually) - this implies adding new tests that use different ansatzes for the trainers.

For (3) and (4) above, I realized that the model is coupled with the circuits while working on the issue. I saw this more while updating the demo notebooks and saw that the Model.from_diagram call also needs to get the test circuits. Let's discuss these two later once I've resolved the first two.

nikhilkhatri · 2023-06-13T15:59:11Z

@neiljdo you are correct in that the ansatze are deterministic. My point was to ensure that there are no Exceptions raised when pickling an ansatz class (for example, if they use a lambda function).

nikhilkhatri · 2023-06-13T16:02:44Z

w.r.t the lambda pickling, your code may be alright since it pickles the class, not the instance. It would still be good to make sure.

ACE07-Sev · 2023-06-14T05:01:47Z

May I drop a suggestion here? Based on what I understand from the code :

If Model is from checkpoint, so you just skip the _init_model_from_datasets method , but still have to recreate the circuits at training_step and validation_step (assuming number 1 is fixed).
If Model is not from checkpoint, you make the circuits at both the _init_model_from_datasets method and the training_step and validation_step.

Since the dataset method is not run every time, I think it could be better if you make the circuits once, pass the list to _init_model_from_datasets method, and also use the list in the training and validation step (based on the index of the diagram of course).

My suggestion is after fixing 1, to have the _init_model_from_datasets also have the reference from 1's fix. @nikhilkhatri dear, am I correct? Please correct me if I am wrong.

nikhilkhatri · 2023-06-14T08:38:39Z

Hi @ACE07-Sev ,
Thanks for your comments. Indeed, the circuits should only be created once, and then used wherever the ansatz is currently applied. The circuits need to be created wherever the dataset and ansatz parameters are first available. If the dataset is taken as input first during fit, then this is where the ansatz can be applied, and circuits from this can be reused throughout the setup, training and validation.

This allows for direct serialization of the ansatzes.

neiljdo · 2023-06-14T09:47:22Z

Hi @nikhilkhatri,

I've addressed points (1) and (2) from your original comment in my latest update. Specifically,

Circuits are now generated once at the start of the Trainer.fit() execution. Circuits are also included in the checkpoint.
I've used dill as a drop-in replacement for pickle for checkpoint generation as it handles lambda functions well. I've added tests for all available ansatzes to make sure that it still generates the correct output after serialization-deserialization. With this, we can simplify the ansatz checkpoint creation i.e. no need to re-instantiate.

Let me know if there are still issues, thank you very much!

EDIT: I ran the clean_notebooks.py script but it did not update the notebooks.

nikhilkhatri · 2023-06-14T10:19:59Z

Hi @neiljdo
Thanks a lot for these changes!

I have a couple of comments:

Saving all the circuits will create a very large checkpoint file, depending on the dataset size, and circuit complexity. I dont think this is required. If a trainer is loaded from a checkpoint, the user will likely call fit, at which point they can provide the necessary circuits.
The tests seem to be failing because dill hasnt been added to requirements.
(Minor) The tests for pickling the ansatze are very verbose. Using something like this can help here.

It would be great if you could address issues 1 and 2 for the hack.

neiljdo · 2023-06-14T10:23:32Z

@nikhilkhatri
Noted, thanks. I'll remove the circuits from the checkpoint.

ACE07-Sev · 2023-06-14T10:28:36Z

Neil dear :

lambeq/training/checkpoint.py:27: error: Cannot find implementation or library
stub for module named "dill" [import]

The test with pytest passed, so I think you just need to fix this import error.

dimkart · 2023-06-14T10:56:56Z

@neiljdo Hey Neil please leave a comment in the Issue #86, so you can be assigned to it (otherwise your handle cannot be found).

dimkart

Thanks, good work! The issue will be assigned to you, however the PR will be merged a little later.

dimkart · 2023-10-30T14:52:19Z

Moved to lambeq's private repository for further development. This feature will be included in one of the upcoming releases.

neiljdo added 10 commits June 7, 2023 02:35

Update base, Pytorch trainer to get ansatz class

6ad973c

Fix issue with optimizer on checkpoint load

3894c17

Save ansatz info on checkpoint create; load ansatz

ff6b4df

Update quantum trainer to get ansatz class

c3fa08a

Update docstrings

d053102

WIP: Fix issues when training Pennylane model

ac89205

Use Numpy backend with ansatz in Pytorch fit()

01c3529

Update backend tests

6258662

Update docstrings

48a1079

Merge branch 'main' into serialize-ansatz

8087165

* main: Fix IBMAccountError

Fix flake8 violations

a762c72

neiljdo added 4 commits June 13, 2023 04:22

Update example notebooks

b5857cf

Fix issues from type checks

6caaae6

Update tutorial notebooks

55daf46

Remove unused type: ignore comments

c6ebf0a

nikhilkhatri reviewed Jun 13, 2023

View reviewed changes

Compute circuits from diagrams once

e322c6c

Substitute dill for pickle serialization

9de9315

This allows for direct serialization of the ansatzes.

Add dill to dependencies

13f43b3

dimkart linked an issue Jun 14, 2023 that may be closed by this pull request

UnitaryHACK: Serialise ansatz and include in model #86

Closed

Exclude circuits from checkpoint

ccbe950

ianyfan assigned neiljdo Jun 14, 2023

neiljdo mentioned this pull request Jun 14, 2023

UnitaryHACK: Serialise ansatz and include in model #86

Closed

neiljdo added 2 commits June 14, 2023 20:36

Suppress errors from missing type hints for dill

7aba061

Specify error code

ac50cd5

dimkart approved these changes Jun 19, 2023

View reviewed changes

Thommy257 added 4 commits June 26, 2023 15:01

merge main and clean up nbs

548df5a

merge main

a8856e7

update hybrid trainer

17c525e

fix trainer_hybrid

c606a98

dimkart closed this Oct 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[UnitaryHack] Make ansatz class a training hyperparameter #101

[UnitaryHack] Make ansatz class a training hyperparameter #101

neiljdo commented Jun 10, 2023

ianyfan commented Jun 12, 2023 •

edited

Loading

neiljdo commented Jun 12, 2023

neiljdo commented Jun 12, 2023

dimkart commented Jun 12, 2023

nathanshammah commented Jun 13, 2023

dimkart commented Jun 13, 2023

nikhilkhatri left a comment •

edited

Loading

nikhilkhatri commented Jun 13, 2023

neiljdo commented Jun 13, 2023

nikhilkhatri commented Jun 13, 2023

nikhilkhatri commented Jun 13, 2023

ACE07-Sev commented Jun 14, 2023 •

edited

Loading

nikhilkhatri commented Jun 14, 2023

neiljdo commented Jun 14, 2023 •

edited

Loading

nikhilkhatri commented Jun 14, 2023

neiljdo commented Jun 14, 2023

ACE07-Sev commented Jun 14, 2023 •

edited

Loading

dimkart commented Jun 14, 2023

dimkart left a comment

dimkart commented Oct 30, 2023

[UnitaryHack] Make ansatz class a training hyperparameter #101

[UnitaryHack] Make ansatz class a training hyperparameter #101

Conversation

neiljdo commented Jun 10, 2023

ianyfan commented Jun 12, 2023 • edited Loading

neiljdo commented Jun 12, 2023

neiljdo commented Jun 12, 2023

dimkart commented Jun 12, 2023

nathanshammah commented Jun 13, 2023

dimkart commented Jun 13, 2023

nikhilkhatri left a comment • edited Loading

Choose a reason for hiding this comment

nikhilkhatri commented Jun 13, 2023

neiljdo commented Jun 13, 2023

nikhilkhatri commented Jun 13, 2023

nikhilkhatri commented Jun 13, 2023

ACE07-Sev commented Jun 14, 2023 • edited Loading

nikhilkhatri commented Jun 14, 2023

neiljdo commented Jun 14, 2023 • edited Loading

nikhilkhatri commented Jun 14, 2023

neiljdo commented Jun 14, 2023

ACE07-Sev commented Jun 14, 2023 • edited Loading

dimkart commented Jun 14, 2023

dimkart left a comment

Choose a reason for hiding this comment

dimkart commented Oct 30, 2023

ianyfan commented Jun 12, 2023 •

edited

Loading

nikhilkhatri left a comment •

edited

Loading

ACE07-Sev commented Jun 14, 2023 •

edited

Loading

neiljdo commented Jun 14, 2023 •

edited

Loading

ACE07-Sev commented Jun 14, 2023 •

edited

Loading