Expand use_linear to all models and UpsampleInSpace module by TsChala · Pull Request #52 · ORNL/MATEY

TsChala · 2026-04-22T15:20:01Z

Following #48 , I expanded the option to use a linear layer instead of conv3D. The following are the proposed changes:

Expand use_linear to AViT, SViT and TurbT models as an option, default is False.
In the TurbT model the UpsampleInSpace module also has conv3D's, these are replaced with linear layers as well if use_linear=True
expand_projections is now compatbile with use_linear=True, Fix smooth layer and expand_projections regressions from #48 #50 already adressed the hMLP_output, I added the necessary parts for UpsampleInSpace as well.
I found a small misalignment between the linear and conv3D version in the hMLP_output's out_head bias. The dimension of the bias for this term is out_chans if use_linear=False, while it is out_chans * kD * kH * kW if use_linear=True. I suggest to add the bias after rearranging so it's the same size as in the conv3D case. This keeps the parameter count consistent between linear and conv3D options.

pzhanggit

Thanks @TsChala. I have two comments: (1) could we be explcit about the assumptions made for the linear changes to be functionally equivalent? For conv2d/conv3d, we’re assuming the patches are non-overlapping. Could we raise an error when that’s not the case but use_linear is on?
(2) The current nn.linear implementations of some functions like UpsampleConv3d do not reproduce the same functionality.

pzhanggit · 2026-04-27T15:13:10Z

+                self.out_proj.append(nn.Linear(channels, channels * kD * kH * kW, bias=False))
+                self.out_proj.append(nn.InstanceNorm3d(channels, affine=True))
+                self.out_proj.append(nn.GELU())
+            # Final head
+            kD, kH, kW = self.ks[0]
+            self.out_head = nn.Linear(channels, channels * kD * kH * kW)


The nn.Linear operations here are not equivalent to UpsampleConv3d, as UpsampleConv3d consists of nn.Upsample and nn.Conv3d.

Good point. Upon further investigation I think it doesn't make sense to include the use_linear option for the UpsampleConv3D part. We do the upsampling to change the size, then do the Conv3D block such that it doesn't change the input size. We hardcoded stride=1 and padding="same" exactly for this reason. Therefore, it is not really possible to have the non-overlapping scenario here (when kernel size == stride). I'll remove the use_linear option from UpsampleConv3d parts.

pzhanggit · 2026-04-27T15:14:47Z

+                kD, kH, kW = self.ks[-(ilayer+1)]
+                # Apply linear, norm, activation
+                x = rearrange(x, 'tb c d h w -> (tb d h w) c')
+                x = self.out_proj[layer_idx](x)  # Linear layer


See my comment above. We will need to fix it to make it consistent.

expand use_linear to all models and UpsampleInSpace module

af35b28

TsChala requested a review from pzhanggit April 22, 2026 15:20

pzhanggit reviewed Apr 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expand use_linear to all models and UpsampleInSpace module#52

Expand use_linear to all models and UpsampleInSpace module#52
TsChala wants to merge 1 commit intoORNL:mainfrom
TsChala:Conv3DToLinear

TsChala commented Apr 22, 2026

Uh oh!

pzhanggit left a comment

Uh oh!

pzhanggit Apr 27, 2026

Uh oh!

TsChala Apr 28, 2026

Uh oh!

pzhanggit Apr 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TsChala commented Apr 22, 2026

Uh oh!

pzhanggit left a comment

Choose a reason for hiding this comment

Uh oh!

pzhanggit Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

TsChala Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

pzhanggit Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants