Fixing Bug with HPO #345

wistuba · 2023-07-21T13:54:23Z

Hyperparameter defined as part of the components were overwritten by values in state_dict. To address this issue, Component no longer is a nn.Module.

Moving from torch.tensor to Python floats had led to changes for CLS (and therefore Super-ER) at the 8th digit. These errors accumulate and therefore give some different numbers. To account for this, I've changed the expected values for CLS and Super-ER.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

github-actions · 2023-07-25T16:00:00Z

Coverage report

The coverage rate went from 85.68% to 86.02% ⬆️

96.07% of new lines are covered.

Diff Coverage details (click to unfold)

src/renate/updaters/experimental/er.py

100% of new lines are covered (92.02% of the complete file).

src/renate/updaters/learner_components/reinitialization.py

100% of new lines are covered (95.65% of the complete file).

src/renate/updaters/learner_components/losses.py

92.59% of new lines are covered (62.01% of the complete file).
Missing lines: 154, 194

src/renate/updaters/learner_components/component.py

100% of new lines are covered (95.45% of the complete file).

lballes · 2023-07-26T08:24:28Z

src/renate/updaters/learner_components/losses.py

+    def on_save_checkpoint(self, checkpoint: Dict[str, Any]) -> None:
+        """Add plastic and stable model to checkpoint."""
+        super().on_save_checkpoint(checkpoint)
+        checkpoint["component-cls-plastic-model"] = self._plastic_model


We should save and load self._plastic_model.state_dict() instead of the model object itself.

lballes · 2023-07-26T08:25:44Z

test/integration_tests/configs/suites/quick/cls-er.json

@@ -5,6 +5,6 @@
  "dataset": "mnist.json",
  "backend": "local",
  "job_name": "class-incremental-mlp-cls-er",
-  "expected_accuracy_linux": [[0.9839243292808533, 0.9740450382232666]],
+  "expected_accuracy_linux": [[0.9834515452384949, 0.9740450382232666]],


Shouldn't the results remain unaffected by the changes?

See PR description for more details. In short, working with torch.tensor(1.0) and float(1.0) seem to cause differences.

component refactoring

892f00e

wistuba marked this pull request as draft July 21, 2023 13:57

wistuba added 14 commits July 21, 2023 17:31

rename _weight to weight

98cfc42

backup

da2795d

backup

ffdc56a

backup

cd3ac5c

backup

c93e984

add checkpoint loading interface to components

281bd6f

backup

0ffb263

cls fixed

c526a5f

complete refactoring

602d2fb

update docstrings

eda8e81

update test

2481631

Merge branch 'dev' into mw-fix-component-loading

ea6cefe

wrap up test writing

d5bb0b4

update test results

f136f32

wistuba marked this pull request as ready for review July 25, 2023 14:25

wistuba added 3 commits July 25, 2023 17:43

rename _weight to weight in tests

8a07110

remove outdated test

32831d8

move and extend test

3a5eece

lint

f8896b1

wistuba assigned lballes Jul 25, 2023

wistuba requested a review from lballes July 25, 2023 16:28

lballes suggested changes Jul 26, 2023

View reviewed changes

wistuba added 3 commits July 26, 2023 14:02

state_dict()

4c8f4f9

remove prints

dbe7d88

Merge branch 'dev' into mw-fix-component-loading

f1a68b2

wistuba requested a review from lballes July 26, 2023 15:21

lballes approved these changes Jul 27, 2023

View reviewed changes

lballes merged commit 2e071d8 into dev Jul 27, 2023
18 checks passed

lballes deleted the mw-fix-component-loading branch July 27, 2023 13:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing Bug with HPO #345

Fixing Bug with HPO #345

wistuba commented Jul 21, 2023 •

edited

github-actions bot commented Jul 25, 2023 •

edited

src/renate/updaters/experimental/er.py

src/renate/updaters/learner_components/reinitialization.py

src/renate/updaters/learner_components/losses.py

src/renate/updaters/learner_components/component.py

lballes Jul 26, 2023

wistuba Jul 26, 2023

lballes Jul 26, 2023

wistuba Jul 26, 2023

Fixing Bug with HPO #345

Fixing Bug with HPO #345

Conversation

wistuba commented Jul 21, 2023 • edited

github-actions bot commented Jul 25, 2023 • edited

Coverage report

src/renate/updaters/experimental/er.py

src/renate/updaters/learner_components/reinitialization.py

src/renate/updaters/learner_components/losses.py

src/renate/updaters/learner_components/component.py

lballes Jul 26, 2023

Choose a reason for hiding this comment

wistuba Jul 26, 2023

Choose a reason for hiding this comment

lballes Jul 26, 2023

Choose a reason for hiding this comment

wistuba Jul 26, 2023

Choose a reason for hiding this comment

wistuba commented Jul 21, 2023 •

edited

github-actions bot commented Jul 25, 2023 •

edited