Generative Replay #931

travela · 2022-03-09T15:45:51Z

closes #927

This is adds a GenerativeReplayPlugin and a GenerativeReplay strategy to the library with which one can train models according to the vanilla Generative Replay algorithm. One can either apply it to a generator model alone or a pair consisting of a classifier and a generator.

In particular I have added two usage examples where we train two models on the splitMNIST scenario, i.e.:

the SimpleMLP model in examples/generative_replay_splitMNIST.py and
a VAE model in examples/generative_replay_MNIST_generator.py

into init of solver strategy.

extend GR plugin to work without generator initialization; clean up GR template and make it more modular; rename VAETraining

…g class.

VAETraining can now be trained alone, with or without GR simply by adding the Plugin.

Modularization.

…lugin.

Pass device to VAE model

AntonioCarta · 2022-03-11T11:39:18Z

Hey @travela thanks for your contribution. It looks solid in general, I just left a couple of minor comments.
Did you reproduce the results on Split MNIST? It would be ideal to have a script to add to the reproducible-cl repository, as we do for the other strategies.

coveralls · 2022-03-11T11:45:25Z

Pull Request Test Coverage Report for Build 1958289232

56 of 158 (35.44%) changed or added relevant lines in 5 files are covered.
2 unchanged lines in 1 file lost coverage.
Overall coverage decreased (-0.5%) to 77.416%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
avalanche/training/supervised/strategy_wrappers.py	8	24	33.33%
avalanche/training/plugins/generative_replay.py	12	46	26.09%
avalanche/models/generator.py	34	86	39.53%

Files with Coverage Reduction	New Missed Lines	%
avalanche/benchmarks/scenarios/generic_definitions.py	2	85.37%

Totals
Change from base Build 1958111169:	-0.5%
Covered Lines:	11682
Relevant Lines:	15090

💛 - Coveralls

travela · 2022-03-11T18:19:18Z

Hey @travela thanks for your contribution. It looks solid in general, I just left a couple of minor comments. Did you reproduce the results on Split MNIST? It would be ideal to have a script to add to the reproducible-cl repository, as we do for the other strategies.

Hi @AntonioCarta, great thanks! I don't seem to be able to see your comments when looking in the "Files changed" tab. Did you publish them or could it be that they are still pending?

And yes, I reproduced the results on the 10 classes splitMNIST scenario in generative_replay_splitMNIST.py. I will look into the reproducible-cl repo and try to convert my example into a test script that can be added.

avalanche/models/generator.py

AntonioCarta · 2022-03-11T11:33:46Z

avalanche/training/plugins/generative_replay.py

+        # Sample data from generator
+        memory = self.generator.generate(
+            len(strategy.adapted_dataset) *
+            (strategy.experience.current_experience)).to(strategy.device)


can you explain why the generated data has the same length of the original data? Can't we generate the data on-demand with a dataloader?

Good question! There are two approaches (as mentioned here): either we generate all data needed for the experience beforehand and then start the training, or we save a copy of the "old" generator and generate our data on demand. I chose the former approach also because this way I could stay in line with the existing Replay strategy, where we update the strategy.data_loader once before each experience.

Do you you think the other method would be more efficient? How would the implementation of an additional dataloader roughly look like? Since the memory variable is only used temporarily and it is then passed to the ReplayDataloader in the next step, I was hoping that we are indeed already making use of the dataloader properties and do not clog RAM too much (but I am not entirely sure about this).

Both are reasonable solutions. If you want to use this one, you should:

batch the results of the Generator. In general you want minibatch size < data size, because the entire dataset may be too large to generate in one step.

for the same reasons, unless the dataset is small, it's better to keep it in cpu and move only the minibatch to the gpu when needed.

If you don't do this you will get an out-of-memory error.

I see. How would the first point tie in with the ReplayDataLoader? I could generate the data one mini-batch at a time to make it more resistant for larger datasets, but then I would still have to obtain an AvalancheDataset (ie. to concatenate them again?). This is because ReplayDataLoader expects an AvalancheDataset and not some kind of Dataloader, if I understand it correctly?

This is because ReplayDataLoader expects an AvalancheDataset and not some kind of Dataloader, if I understand it correctly?

Exactly. I would make another dataloader similar to ReplayDataLoader that accepts a dataset (the current data), and an iterator (data generator).

I definitely like the idea of using a Python Generator. But that would then correspond to the second of the two approaches I mentioned above, as when yielding the next batch we need access to the old version of the generator. I was trying to avoid having to store an additional model as well as trying to stick to the existing ReplayDataLoader as I understood it to be designed for any "rehearsal/replay strategies".

I am aiming to update the pull request by the beginning of next week.

Hi! I just merged my changes (bba78b9). As mentioned in my last comment I now opted for the "on-demand" replay data generation, where we store the old generator+model and create replay data before each training iteration. This way I got around using the whole ReplayDataLoader syntax and instead just extend the current mini-batch with newly generated replay data.

Accuracy on splitMNIST stayed the same.

I saw your implementation and IMO is clearer now.
Only a note:
when you create new data, you create as much replay data as the current minibatch dimension times the experience counter. This can result in huge minibatches (especially in benchmarks with a lot of experiences) that can easily fill the GPU memory if cuda is used, especially with datasets with bigger images than MNIST.
We should take this into consideration if we want to use this implementation as the starting point for all the generative replay strategies.

Good point. I have actually implemented an alternative solution where we use a weighted loss to gradually reduce the importance of a new experience/class as the total number grows. As mentioned below, this solution yielded a lower accuracy (in my particular example). However not using any of the two methods yielded much lower accuracy, suggesting that at least one of the two is necessary.

When implementing the weighted loss I had to overwrite the criterions of both, the model and the generator, which maybe makes it less intuitive, but I guess proper documentation will take care of that.

I was thinking of adding the weighted loss as an option for the user. Maybe I could make it the default option and offer the increasing minibatch size as an alternative (for simple cases like MNIST)?

Makes sense. Consider that the simple replay in Avalanche does not change the batch size, so it would be better to have consistent behavior here.

avalanche/models/generator.py

* Extend current mbatch with replay data dynamically before each iteration. * Update boolean after first experience. * Fix mbatch[-1] extension * Put replay_output to device. * Resolve change requests: class names; VAELoss doc * Documentation.

ggraffieti

Very nice work!
I added some comments and little requests, but this seems a great starting point!
I labeled this review as request changes only to follow in details the next steps, but the work to be done is fairly low.
As a general comment, many classes are named quite generally (Generator, VAE, VAEEncoder..) but lack this generality in the implementation (e.g. a CNN VAE). I'd suggest to call them in different ways (e.g. SimpleVAE or MlpVAE) in order to don't have the need, in the future, to rename them losing compatibility with old code.

avalanche/models/generator.py

avalanche/training/plugins/generative_replay.py

ggraffieti · 2022-03-25T14:56:24Z

avalanche/training/plugins/generative_replay.py

+        or we use the strategy's model as the generator. 
+        If the generator is None after initialization 
+        we assume that strategy.model is the generator."""
+        if not self.generator_strategy:


This is clear but confusing. If I understand well the plugin needs a generative strategy, which contains the generative model. If no strategy is passed, the "default" strategy is used, and the model defined for the strategy is used as generative model. In this case, is the generative model the only model used? (no classifier).

Yes exactly: if no generative strategy is passed, we assume that the strategy our plugin was added to already has a model which is generative. E.g. this allows us to easily train a generator with generative replay by simply adding the plugin. In that case there would be no classifier. (Another scenario is that the classifier and generator are combined in a single model, as in this paper about Generative replay with feedback connections).

I agree that it can be confusing (a result of trying to combine all scenarios in a single plugin). I added another line in the doc string (4f5246f) to refer to the example of training a generator, hoping this would make it clearer. Or do you have another suggestion?

No it's fine, I read the code sequentially, so I didn't have a complete idea until the end.
I still believe is a bit confusing, but I don't have a better idea at the moment.
I really like the generality of the strategy, and overall the pros of this implementation greatly outshine a bit of confusion in this part 😉

ggraffieti · 2022-03-25T14:59:11Z

avalanche/training/plugins/generative_replay.py

+        replay = self.old_generator.generate(
+            len(strategy.mbatch[0]) * (strategy.experience.current_experience)
+            ).to(strategy.device)  


Why for each minibatch the number of generated data increases with the experiences? Is it a particular implementation used in some paper?

I actually got inspired to do this from this intro_to_generative_replay notebook from the Avalanche/colab repo. In the literature I usually see authors employ a weighted loss instead to achieve a similar effect (e.g. as described here in equation 3). It could be due to the VAE and classifier I use, but in my case the increasing generated data fared a few percentage points higher in accuracy than the weighted loss approach.

avalanche/training/plugins/generative_replay.py

ggraffieti · 2022-03-25T15:09:20Z

avalanche/training/supervised/strategy_wrappers.py

+    def __init__(
+        self,
+        model: Module,
+        optimizer: Optimizer,
+        criterion=CrossEntropyLoss(),
+        train_mb_size: int = 1,
+        train_epochs: int = 1,
+        eval_mb_size: int = None,
+        device=None,
+        plugins: Optional[List[SupervisedPlugin]] = None,
+        evaluator: EvaluationPlugin = default_evaluator,
+        eval_every=-1,
+        generator_strategy: BaseTemplate = None,
+        **base_kwargs
+    ):


Really like the idea of having 2 different strategies, one for the classifier (this one) and one for the generative model (passed as an argument). This allows a great generalization.

Requested changes

travela · 2022-03-27T18:30:02Z

Ciao @ggraffieti! Thanks a lot for the detailed review and the helpful suggestions. I added all requested changes just now.

Optional enhancements

… generative_replay

AntonioCarta · 2022-04-04T12:59:56Z

@ggraffieti are there any other changes that you want on this PR?

AntonioCarta · 2022-04-08T09:45:30Z

I checked the test manually and it seems to work. I'm merging this, and we can investigate the CI problems separately.

@travela thanks for your contribution and please remember to push the reproducibility script to the reproducibility-repo.

travela added 30 commits February 27, 2022 00:40

Add templates, plugins and models for GR using VAE.

0dc03fa

Update __init__.py and imports.

945777f

PEP8 formatting.

4d6f498

Incorporate plugins and generator strategy

82d5dc8

into init of solver strategy.

Add documentation to VAE model.

2d63e89

Introduce latent dimension variable for VAE encoder.

2cef28a

Fix from last commit.

8fb9575

Documentation;

6ccff43

extend GR plugin to work without generator initialization; clean up GR template and make it more modular; rename VAETraining

Fix introduced bug.

bf2c20e

Add boolean to VAETraining call.

bc8d7ee

Fix 2.0

baf80e4

Try to move the GenerativeReplayPlugin call outside of the VAETrainin…

7ba6f96

…g class.

Remove redundant code.

5c9b523

VAETraining can now be trained alone, with or without GR simply by adding the Plugin.

Merge pull request #1 from travela/refactor

5d4e91b

Modularization.

Document all GR plugins.

8473084

Module header.

2d677bd

Merge branch 'refactor' into generative_replay

660d6ba

Make VAE more general: any input shape is allowed.

0b96f38

Removing reliance on VAEPlugin.

1af464e

Set default evaluator for VAE to None.

5edd13c

Add interactive logger to VAETraining.

12bcc4e

Bug fix: Generator doesn't have to label its replay data; Remove VAEP…

9de2963

…lugin.

Doc.

5bec8a9

Change CI workflow to run unittest for generative_replay branch.

ff87dbb

Create splitMNIST example.

171bd39

Create VAE on MNIST example for GenerativeReplayPlugin.

e2eddf3

Change VAE loss function

b383954

detach() samples.

387e711

save plot of generated samples and try to open window.

e73d3be

Lower number of exp for testing

7265e23

travela added 4 commits March 9, 2022 15:53

[General] Docstring

6d81799

Merge pull request #2 from travela/remove_device

3982c69

Pass device to VAE model

Reverse unit-test.yml changes.

3d66711

Remove commented line

e910cf1

AntonioCarta requested changes Mar 12, 2022

View reviewed changes

lrzpellegrini requested a review from ggraffieti March 16, 2022 17:38

Resolve Requested Changes (#3)

bba78b9

* Extend current mbatch with replay data dynamically before each iteration. * Update boolean after first experience. * Fix mbatch[-1] extension * Put replay_output to device. * Resolve change requests: class names; VAELoss doc * Documentation.

travela requested a review from AntonioCarta March 23, 2022 18:47

ggraffieti requested changes Mar 25, 2022

View reviewed changes

travela added 8 commits March 27, 2022 18:09

Move general modules from VAE to utils.

c9f5a23

Add condition as an input to the abstract generator class.

d9c1f1c

Clarify confusing generator generator_strategy naming.

3f096b1

Update documentation of the GenerativeReplayPlugin

03150d1

before_training doc string

4f5246f

Renaming of VAE models.

746e4e3

Remove TrainGeneratorAfterExpPlugin plugins indexing.

d36e541

Merge pull request #5 from travela/requested_changes

fddf3cc

Requested changes

travela added 5 commits April 1, 2022 14:26

Make increasing replay batch size optional.

99427ac

Pass new arguments from strategy to plugin.

1a8becf

Merge pull request #6 from travela/optional_enhancements

a8fb34f

Optional enhancements

Merge branch 'master' into generative_replay

e0a4029

Merge branch 'generative_replay' of github.com:travela/avalanche into…

91504b0

… generative_replay

update multihead test

dfa1d69

AntonioCarta merged commit 26b5cb2 into ContinualAI:master Apr 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generative Replay #931

Generative Replay #931

travela commented Mar 9, 2022

AntonioCarta commented Mar 11, 2022

coveralls commented Mar 11, 2022

travela commented Mar 11, 2022

AntonioCarta Mar 11, 2022

travela Mar 14, 2022

AntonioCarta Mar 15, 2022

travela Mar 15, 2022

AntonioCarta Mar 15, 2022

travela Mar 17, 2022

travela Mar 23, 2022

ggraffieti Mar 25, 2022 •

edited

travela Mar 27, 2022

AntonioCarta Mar 28, 2022

ggraffieti left a comment

ggraffieti Mar 25, 2022

travela Mar 27, 2022

ggraffieti Apr 1, 2022

ggraffieti Mar 25, 2022

travela Mar 27, 2022

ggraffieti Mar 25, 2022

travela commented Mar 27, 2022

AntonioCarta commented Apr 4, 2022

AntonioCarta commented Apr 8, 2022

Generative Replay #931

Generative Replay #931

Conversation

travela commented Mar 9, 2022

AntonioCarta commented Mar 11, 2022

coveralls commented Mar 11, 2022

Pull Request Test Coverage Report for Build 1958289232

💛 - Coveralls

travela commented Mar 11, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ggraffieti Mar 25, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ggraffieti left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

travela commented Mar 27, 2022

AntonioCarta commented Apr 4, 2022

AntonioCarta commented Apr 8, 2022

ggraffieti Mar 25, 2022 •

edited