Rollup: No-batch-dim support for torch.nn modules #60585

jbschlosser · 2021-06-23T21:39:30Z

Background

A previous issue (#47149) requested support of arbitrary batch dimensions across module inputs. This rollup issue details the torch.nn module updates required to address a subset of this functionality: specifically, the case of no batch dimensions. This particular case is useful for composability with a future vmap.

Module support

Semantically incompatible modules

nn.BatchNorm1d - Only defined over batch
nn.BatchNorm2d - Only defined over batch
nn.BatchNorm3d - Only defined over batch
nn.LazyBatchNorm1d - Only defined over batch
nn.LazyBatchNorm2d - Only defined over batch
nn.LazyBatchNorm3d - Only defined over batch
nn.SyncBatchNorm - Only defined over batch

Modules that require BC-breaking changes

nn.ChannelShuffle - While the docs say that the module accepts shape (*, C, H, W), the implementation assumes shape (N, C, *) with * being 1 or more dimensions; switching to (C, *) is BC-breaking because it would reinterpret dims
nn.EmbeddingBag - Already supports 1D inputs with different semantics
nn.GroupNorm - Supports arbitrary spatial dims; switching to (C, *) instead of (N, C, *) is BC-breaking since it would reinterpret dims
nn.LocalResponseNorm - Supports (N, C, *) shape now; switching to (C, *) is BC-breaking since it would reinterpret dims
nn.PReLU - Always assumes the 2nd dim is the channels dim
nn.Unfold - Supports 4D inputs; switching to (C, *) instead of (N, C, *) is BC-breaking since it would reinterpret dims
nn.Upsample - Supporting unbatched inputs would be BC-breaking because it would reinterpret dims

Irrelevant modules

nn.Module
nn.ModuleDict
nn.ModuleList
nn.ParameterDict
nn.ParameterList
nn.Sequential
nn.TransformerDecoder - see nn.TransformerDecoderLayer instead
nn.TransformerEncoder - see nn.TransformerEncoderLayer instead

cc @albanD @mruberry @jbschlosser

The text was updated successfully, but these errors were encountered:

Summary: Towards #60585 This PR updates docs for `Linear` and adds a non-batch test case to `common_nn.py`. Pull Request resolved: #60992 Reviewed By: VitalyFedyunin Differential Revision: D29518451 Pulled By: jbschlosser fbshipit-source-id: 6dd79c0f21ac5b6f693e3e1ba954379d2606d4e0

thomasjpfan · 2021-07-08T20:01:14Z

What does the semantics of no-batch-dim for criterion look like when there is a reduction? Consider:

import torch
import torch.nn as nn

loss_mean = nn.MSELoss(reduction='mean')
input = torch.randn(3, 5, 5, requires_grad=True)
target = torch.randn(3, 5, 5)

torch.isclose(
    loss_mean(input, target),
    # Need to scale the sum down by the number of batches
    sum(loss_mean(input[i], target[i]) for i in range(3)) / 3
)

jbschlosser · 2021-07-08T20:36:31Z

What does the semantics of no-batch-dim for criterion look like when there is a reduction? Consider:

Great question - I'd say that "reduction" only has non-trivial meaning when being applied over a batch. If there's only a single item in an implicit "batch" (i.e. the no batch dim case), I don't see a problem with the output being equivalent across all reduction types.

Note that you could tweak your example to pass 3 inputs of shape (1, 5, 5) and you'd run into the same thing.

Summary: Towards #60585 Pull Request resolved: #61264 Reviewed By: iramazanli Differential Revision: D29615292 Pulled By: jbschlosser fbshipit-source-id: 826d1c87d67261a7211270e90e3a1022bbbe37bd

…61300) Summary: Towards #60585 This PR updates docs and tests for activation modules that already support no-batch dims. Pull Request resolved: #61300 Reviewed By: heitorschueroff Differential Revision: D29660543 Pulled By: jbschlosser fbshipit-source-id: 5edad45f7e9995aca6c3403469668e6e1cbb94b6

…atch (#61262) Summary: Toward #60585 Pull Request resolved: #61262 Reviewed By: mrshenli Differential Revision: D29660554 Pulled By: jbschlosser fbshipit-source-id: d5e3dc7096fcf8621bce4a1063d521b84092e0ca

Summary: Toward #60585 This PR adds a `single_batch_reference_fn` that uses the single batch implementation to check no-batch. Pull Request resolved: #61060 Reviewed By: mrshenli Differential Revision: D29739823 Pulled By: jbschlosser fbshipit-source-id: d90d88a3671177a647171801cc6ec7aa3df35482

Summary: Towards #60585 I think `Dropout` is already tested in `test_Dropout` for no batch dims. Pull Request resolved: #61911 Reviewed By: albanD Differential Revision: D29810928 Pulled By: jbschlosser fbshipit-source-id: 7716a1a808e9e34aae43573f38706212552afbb4

Summary: Towards #60585 Pull Request resolved: #61860 Reviewed By: albanD Differential Revision: D29826382 Pulled By: jbschlosser fbshipit-source-id: 47e12073d866f0604310fc1ff270cde9907e516d

…61984) Summary: Towards #60585 (Interesting how the maxpool tests are currently in `test/test_nn.py`) Pull Request resolved: #61984 Reviewed By: suo Differential Revision: D29883846 Pulled By: jbschlosser fbshipit-source-id: 1e0637c96f8fa442b4784a9865310c164cbf61c8

…t no-batch-dims (#61461) Summary: Towards #60585 This PR does not use `check_sum_reduction` because I wanted to test every reduction option. Pull Request resolved: #61461 Reviewed By: suo Differential Revision: D29883744 Pulled By: jbschlosser fbshipit-source-id: cdad0effb41f0484938caad0d4c9d6d83e2aec07

kshitij12345 · 2022-01-07T05:27:29Z

Oops! Had incorrectly marked the PR (GRU and RNN) with Fixes

Summary: Reference #60585 Reland: #70442 Pull Request resolved: #70977 Reviewed By: dagitses, george-qi Differential Revision: D33477256 Pulled By: jbschlosser fbshipit-source-id: 2035c2d00b2f627c7046fd9b13c71b9360cd6fad

jbschlosser · 2022-01-10T15:16:14Z

Wrongly closed again

Summary: Reference: #60585 cc albanD mruberry jbschlosser walterddr kshitij12345 Pull Request resolved: #71055 Reviewed By: anjali411 Differential Revision: D33567403 Pulled By: jbschlosser fbshipit-source-id: 4d0a311ad7419387c4547e43e533840c8b6d09d8

Summary: Reference: #60585 TODO: * [x] Update docs Pull Request resolved: #71056 Reviewed By: samdow Differential Revision: D33638643 Pulled By: jbschlosser fbshipit-source-id: c0949829de8a8e6e7b2873f459a8d7da597a3be3

Summary: Reference: #60585 TODO: * [x] Update docs Pull Request resolved: #71056 Reviewed By: samdow Differential Revision: D33638643 Pulled By: jbschlosser fbshipit-source-id: c0949829de8a8e6e7b2873f459a8d7da597a3be3 (cherry picked from commit f94d584)

Summary: Reference: pytorch/pytorch#60585 TODO: * [x] Update docs Pull Request resolved: pytorch/pytorch#71056 Reviewed By: samdow Differential Revision: D33638643 Pulled By: jbschlosser fbshipit-source-id: c0949829de8a8e6e7b2873f459a8d7da597a3be3 (cherry picked from commit f94d584)

jbschlosser added module: nn Related to torch.nn module: batching labels Jun 23, 2021

jbschlosser added this to To Do in torch.nn Jun 23, 2021

VitalyFedyunin added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jun 24, 2021

This was referenced Jun 29, 2021

MAINT Adds test and docs for Linear with no batch dims #60992

Closed

ENH Enables No-batch for *Pad1d Modules #61060

Closed

This was referenced Jul 5, 2021

ENH Adds tests and docs for 2d & 3d modules that already support no batch #61262

Closed

ENH Adds support for no-batch dim in AdaptiveAvgPool1d #61264

Closed

ENH Updates docs and tests for activation modules for no-batch dims #61300

Closed

thomasjpfan mentioned this issue Jul 9, 2021

ENH Updates docs and tests for regression modules that already support no-batch-dims #61461

Closed

zou3519 mentioned this issue Jul 14, 2021

nn.functional.conv2d is not very compatible with vmap pytorch/functorch#71

Open

thomasjpfan mentioned this issue Jul 19, 2021

ENH Adds no batch dim support for AdativeMaxPool*D #61847

Closed

This was referenced Jul 19, 2021

ENH Adds no batch dim support for AvgPool1d #61860

Closed

ENH Updates docs and tests for classification modules that already support no batch dims #61874

Closed

ENH Adds test and docs for dropout for no batch dims #61911

Closed

thomasjpfan mentioned this issue Jul 21, 2021

ENH Adds no_batch_dim support for maxpool and unpool for 2d and 3d #61984

Closed

thomasjpfan mentioned this issue Jul 26, 2021

ENH Adds no_batch_dim support for pad 2d and 3d #62183

Closed

facebook-github-bot closed this as completed in 6eba936 Jan 7, 2022

torch.nn automation moved this from Needs Triage to Done Jan 7, 2022

kshitij12345 reopened this Jan 7, 2022

torch.nn automation moved this from Done to Needs Triage Jan 7, 2022

kshitij12345 mentioned this issue Jan 7, 2022

[rnn/gru] : no batch dim #70977

Closed

github-actions bot closed this as completed Jan 7, 2022

torch.nn automation moved this from Needs Triage to Done Jan 7, 2022

This was referenced Jan 8, 2022

[nn] cross_entropy: no batch dim support #71055

Closed

[nn] lstm : no batch dim support #71056

Closed

jbschlosser reopened this Jan 10, 2022

torch.nn automation moved this from Done to Needs Triage Jan 10, 2022

zou3519 mentioned this issue Jan 11, 2022

Things that would be nice to have with the functorch release pytorch/functorch#394

Closed

29 tasks

atalman closed this as completed Jan 26, 2022

torch.nn automation moved this from Needs Triage to Done Jan 26, 2022

zou3519 mentioned this issue Feb 3, 2022

Confirm that we do have tests for no-batch-dim support and the batching rules actually support them pytorch/functorch#443

Closed

zou3519 mentioned this issue Feb 10, 2022

nn.functional No-batch-dim support should have OpInfo examples #72672

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rollup: No-batch-dim support for torch.nn modules #60585

Rollup: No-batch-dim support for torch.nn modules #60585

jbschlosser commented Jun 23, 2021 •

edited

thomasjpfan commented Jul 8, 2021

jbschlosser commented Jul 8, 2021

kshitij12345 commented Jan 7, 2022 •

edited

jbschlosser commented Jan 10, 2022

Rollup: No-batch-dim support for torch.nn modules #60585

Rollup: No-batch-dim support for torch.nn modules #60585

Comments

jbschlosser commented Jun 23, 2021 • edited

Background

Module support

Semantically incompatible modules

Modules that require BC-breaking changes

Irrelevant modules

thomasjpfan commented Jul 8, 2021

jbschlosser commented Jul 8, 2021

kshitij12345 commented Jan 7, 2022 • edited

jbschlosser commented Jan 10, 2022

jbschlosser commented Jun 23, 2021 •

edited

kshitij12345 commented Jan 7, 2022 •

edited