Wild Time Benchmarks and Small Memory Hack #363

wistuba · 2023-08-07T07:26:56Z

Add files required to run experimentation with Wild Time Benchmarks.

Save PIL images rather than tensors in buffers to keep memory space lower.

Found bug: optimizer_fn doesn't work as intended. If it exists, it must return an optimizer. We would like that it can exist but it either returns an optimizer or None. Currently "fixed" by adding all arguments expected for an optimizer to the function.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…to keep the buffer smaller

github-actions · 2023-08-07T07:45:31Z

Coverage report

The coverage rate went from 85.68% to 84.99% ⬇️

62.5% of new lines are covered.

Diff Coverage details (click to unfold)

src/renate/benchmark/experiment_config.py

100% of new lines are covered (98.4% of the complete file).

src/renate/benchmark/datasets/vision_datasets.py

0% of new lines are covered (37.03% of the complete file).
Missing lines: 71, 74, 262, 265, 387, 390

benchmarks/experiment_configs/datasets/fmow.json

benchmarks/experiment_configs/datasets/arxiv.json

benchmarks/experiment_configs/scenarios/arxiv-16updates.json

src/renate/benchmark/datasets/wild_time_data.py

prabhuteja12 · 2023-08-07T09:38:16Z

src/renate/benchmark/experiment_config.py

@@ -317,6 +318,8 @@ def _get_normalize_transform(dataset_name):

 def train_transform(dataset_name: str) -> Optional[Callable]:
    """Returns a transform function to be used in the training."""
+    if dataset_name == "fmow":
+        return FMoW.default_transform


There seems to exist a default_transform(datasetname) in the package. Any reason to go this route?

prabhuteja12 · 2023-08-07T09:39:52Z

src/renate/benchmark/experiment_config.py

+    momentum: float = 0.0,  # TODO: fix problem that occurs when removing this
+) -> Callable:
+    if optimizer == "AdamW":
+        return partial(AdamW, lr=learning_rate, weight_decay=weight_decay)


can this written as:
partial(getattr(torch.optim, optimizer), lr=learning_rate, weight_decay=weight_decay)? Why the specific handling of AdamW?

in principle there is no problem if we make it more general. however, how would that work with SGD + momentum? right now, this logic will only be triggered for AdamW and otherwise we will fall back to the standard optimizers (SGD, Adam), we have in the library.

test/renate/benchmark/test_experimentation_config.py

add most wildtime benchmarks. no longer convert pil images to tensor …

281b51c

…to keep the buffer smaller

add missing configs, update tests

ad8d689

wistuba assigned prabhuteja12 Aug 7, 2023

wistuba requested a review from prabhuteja12 August 7, 2023 09:08