feat: `max_batch_size` refactoring #67

johannaSommer · 2025-04-11T11:32:15Z

Description

This PR refactors the usage of a batch size arguments for various methods. The user now sets the expected inference batch size once in the SmashConfig and this is used throughout the algorithms. The batch_size hyperparameters in the algorithms are deprecated accordingly. Additionally, I deprecated the naming of max_batch_size and renamed it to batch_size, as "max" might be misleading.

Related Issue

None.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Ran tests for all affected methods and tested the deprecation locally - works as intended.

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Additional Notes

None.

gsprochette

Looks pretty much good to me, I'm looking forward to using this unified batch_size and having access to data from outside the smash_config :) I only suggested a couple of minor changes which should take just a minute.

src/pruna/config/smash_config.py

begumcig

Everything already looks super solid! Just left a small comment regarding batch_size mismatches btw the pipeline & dataloader and how it could affect evaluation.
Great job overall 🥹

begumcig · 2025-04-25T08:38:07Z

src/pruna/algorithms/quantization/gptq_model.py

Does the batch_size argument here override the calib_data's batch_size?

yeah good question, in this case the calib data is the string of all text snippets from the dataset as a whole and doesnt have an inherent batch size... and then GPTQ slices and embedds as necessary

begumcig · 2025-04-25T08:59:45Z

src/pruna/algorithms/batching/ifw.py

Okay, just wanted to flag two things here:

1 Since this batch_size can directly impact inference performance (latency, memory, etc.), I'm a bit concerned that changes here (or in any future algorithm that plays with this setting) could unintentionally affect our evaluation metrics, especially when comparing against the base model. Would it make sense to pass a config to the evaluation agent as well, so we're running everything under the same conditions?

2 This is not a problem but more of a question but what happens if the data we pass later (e.g., from a DataLoader) is already batched differently? Does this lead to re-batching under the hood? Might be worth double-checking that the batch sizes align, or that we’re not unintentionally introducing extra batching logic from the pipeline itself.

after our discussion, I added the following warning:

from pruna.evaluation.evaluation_agent import EvaluationAgent from pruna.evaluation.task import Task from pruna.data.pruna_datamodule import PrunaDataModule import torch from diffusers import StableDiffusionPipeline from pruna import SmashConfig, smash pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4", torch_dtype=torch.float16) config = SmashConfig() # batch size is 1 by default config["cacher"] = "deepcache" smashed_pipe = smash(pipe, config) task = Task(["gpu_memory"], datamodule=PrunaDataModule.from_string("LAION256", dataloader_args={"batch_size": 3})) agent = EvaluationAgent(task) agent.evaluate(smashed_pipe)

would now output:

INFO - Starting cacher deepcache... INFO - cacher deepcache was applied successfully. INFO - Loaded only training, splitting train 80/10/10 into train, validation and test... INFO - Testing compatibility with functools.partial(<function image_generation_collate at 0x7f6950b86f80>, img_size=512)... INFO - Creating metrics from names: ['gpu_memory'] INFO - Evaluating a smashed model. INFO - Detected diffusers model. Using DiffuserHandler with fixed seed. - The first element of the batch is passed as input. - The generated outputs are expected to have .images attribute. WARNING - Batch size mismatch between evaluation datamodule and smashed model's smash config. This may lead to incorrect metric computation due to compression algorithms being batch size specific. Adjust the datamodule creation to match the smashed model's batch size, e.g., datamodule = PrunaDataModule.from_string(dataset_name, dataloader_args={'batch_size': 1}) INFO - Evaluating stateful metrics. INFO - Evaluating isolated inference metrics.

begumcig · 2025-05-05T09:00:01Z

src/pruna/evaluation/evaluation_agent.py

        model.inference_handler.log_model_info()
+        if (
+            "batch_size" in self.task.datamodule.dataloader_args
+            and self.task.datamodule.dataloader_args["batch_size"] != model.smash_config.batch_size


This is a step in the right direction! My only concern is that since the smash_config always includes a batch_size attribute by default, we might end up showing this warning every time—even when the model itself isn't changing in a way that would affect inference. Ideally, the warning should only appear if the model will actually run inference with a different batch size internally.

I'm not entirely sure how to reliably detect that. But if this issue only occurs with specific batching algorithms (is this the case???), maybe we could check whether the smashing algorithm is "batcher" instead of just relying on the batch_size in the config, for instance. What do you think?

begumcig · 2025-05-06T14:37:37Z

src/pruna/evaluation/evaluation_agent.py

+            "batch_size" in self.task.datamodule.dataloader_args
+            and self.task.datamodule.dataloader_args["batch_size"] != model.smash_config.batch_size
+            and not is_base
+            and model.smash_config.is_batch_size_locked()


💅💅💅

gsprochette

There's only the matter of the deprecation warning message: a one line change if you agree. Other than this it looks super good to me 💅 thanks for taking care of this!

gsprochette · 2025-05-07T09:05:43Z

src/pruna/config/smash_config.py

-        self.max_batch_size = max_batch_size
+        if max_batch_size is not None:
+            warn(
+                "max_batch_size is soon to be deprecated. Please use batch_size instead.",


Why "soon to be" and not "is"?

yeah youre completely right 🙈 thanks!

johannaSommer requested review from begumcig and gsprochette April 11, 2025 11:32

gsprochette reviewed Apr 14, 2025

View reviewed changes

src/pruna/config/smash_config.py Outdated Show resolved Hide resolved

src/pruna/config/smash_config.py Outdated Show resolved Hide resolved

src/pruna/config/smash_config.py Outdated Show resolved Hide resolved

johannaSommer force-pushed the feat/batch-size-refactor branch from 3f598b9 to 27a87e3 Compare April 20, 2025 13:00

begumcig requested changes Apr 25, 2025

View reviewed changes

johannaSommer requested review from begumcig and gsprochette May 2, 2025 15:00

johannaSommer added 7 commits May 2, 2025 15:38

refactor: rename data attribute of SmashConfig

03cf386

refactor: batch size usage

664c781

docs: add batch size guidance to methods using the parameter

9820a7f

tests: remove overwrite test that is no longer needed

37976f8

fix: adjust documentation and warning logging

24cb40c

fix: config override test parametrization

28fbe05

feat: add warning for inconsistent batch size usage in evaluation

7710ef3

johannaSommer force-pushed the feat/batch-size-refactor branch from e849161 to 7710ef3 Compare May 2, 2025 15:38

begumcig requested changes May 5, 2025

View reviewed changes

johannaSommer added 3 commits May 6, 2025 08:49

feat: add backwards compatibility with old smash configs

1254c1b

fix: improve warnings for "locked" batch size

6fe4033

fix: locked batch size docstring

32af5dc

johannaSommer requested a review from begumcig May 6, 2025 09:15

begumcig approved these changes May 6, 2025

View reviewed changes

feat: refactor torch_compile batch size argument

37d4718

johannaSommer mentioned this pull request May 6, 2025

refactor: move argument compatibility checks #102

Merged

10 tasks

gsprochette reviewed May 7, 2025

View reviewed changes

fix: deprecation warning wording

a7db354

johannaSommer merged commit 68cbf37 into main May 7, 2025
6 checks passed

johannaSommer deleted the feat/batch-size-refactor branch May 7, 2025 12:32

feat: max_batch_size refactoring #67

feat: max_batch_size refactoring #67

Uh oh!

Conversation

johannaSommer commented Apr 11, 2025

Description

Related Issue

Type of Change

How Has This Been Tested?

Checklist

Additional Notes

Uh oh!

gsprochette left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

begumcig left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gsprochette left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: `max_batch_size` refactoring #67

feat: `max_batch_size` refactoring #67