Add parameter to control flattening behavior of built-in MLP model. #78

romerojosh · 2025-09-18T21:48:20Z

The built-in MLP model currently flattens input tensors from an assumed input dimension of [batch_size, ...] to [batch_size, -1] before passing to the first linear layer. For example, if a user passes in an input tensor of dimension [8, 32, 32], that input is reshaped to [8, 1024], assuming that the first (or last dimension if in Fortran) is the batch_size and all other dimensions should be considered as features. This behavior is a bit over-prescriptive and not necessarily obvious, especially as it deviates from standard PyTorch broadcasting behavior.

This PR better codifies this behavior by adding a new parameter to the MLP config (flatten_non_batch_dims) that enables this automatic flattening. If set to false, no reshaping will take place and standard PyTorch broadcasting rules will apply to the inputs and the MLP dimensions. To maintain backwards compatibility, this parameter defaults to true.

romerojosh · 2025-09-18T22:02:34Z

/build_and_test

github-actions · 2025-09-18T22:02:43Z

🚀 Build workflow triggered! View run

github-actions · 2025-09-18T22:17:09Z

✅ Build workflow passed! View run

romerojosh · 2025-09-22T18:31:08Z

/build_and_test

github-actions · 2025-09-22T18:31:19Z

🚀 Build workflow triggered! View run

github-actions · 2025-09-22T18:47:55Z

✅ Build workflow passed! View run

Signed-off-by: Josh Romero <joshr@nvidia.com>

romerojosh changed the title ~~Add parameter to control flattening behavior of built-in MLP models.~~ Add parameter to control flattening behavior of built-in MLP model. Sep 18, 2025

romerojosh force-pushed the mlp_flatten_option branch from 1574533 to 0afb90c Compare September 22, 2025 18:28

romerojosh added 6 commits September 24, 2025 14:02

Add parameter to control flattening behavior of built-in MLP models.

f3650f2

Signed-off-by: Josh Romero <joshr@nvidia.com>

Remove unneeded changes to SACMLP model.

fff928d

Signed-off-by: Josh Romero <joshr@nvidia.com>

Add 1D MLP tests.

17677c4

Signed-off-by: Josh Romero <joshr@nvidia.com>

Add new MLP param to docs.

b408b90

Signed-off-by: Josh Romero <joshr@nvidia.com>

Remove extraneous added space.

a838b2b

Signed-off-by: Josh Romero <joshr@nvidia.com>

clang-format.

eb0f8d9

Signed-off-by: Josh Romero <joshr@nvidia.com>

romerojosh force-pushed the mlp_flatten_option branch from 0afb90c to eb0f8d9 Compare September 24, 2025 21:02

azrael417 merged commit 90c385d into master Sep 30, 2025
4 checks passed

romerojosh deleted the mlp_flatten_option branch October 2, 2025 16:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add parameter to control flattening behavior of built-in MLP model. #78

Add parameter to control flattening behavior of built-in MLP model. #78

Uh oh!

romerojosh commented Sep 18, 2025

Uh oh!

romerojosh commented Sep 18, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

romerojosh commented Sep 22, 2025

Uh oh!

github-actions bot commented Sep 22, 2025

Uh oh!

github-actions bot commented Sep 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add parameter to control flattening behavior of built-in MLP model. #78

Add parameter to control flattening behavior of built-in MLP model. #78

Uh oh!

Conversation

romerojosh commented Sep 18, 2025

Uh oh!

romerojosh commented Sep 18, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

romerojosh commented Sep 22, 2025

Uh oh!

github-actions bot commented Sep 22, 2025

Uh oh!

github-actions bot commented Sep 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants