Unsqueeze op #1236

skewballfox · 2024-02-02T15:49:24Z

Pull Request Template

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

None, just adding unsqueeze_dims to get us closer to full ONNX support

Changes

adds a new function unsqueeze_dims to burn-tensor/src/tensor/api/base, which takes a slice of isizes as the second argument (to make it compatible with the onnx unsqeeze op.

For burn import, if the rhs of the op node is constant, the output shape is calculated, if it isn't and the output shape has an explicit value already, the op is replaced with a reshape.

it might be desireable to just remap to reshape in every case if the output shape is explicit. and/or to truncate multiple unsqueezes to a single unsqueeze.

as we were discussing on the discord a while ago, this implementation doesn't yet support the third case: where the dimensions of the output are symbolic (determined by the outputs of previous steps at runtime), but I'm not sure if supporting that would be possible in burn right now for any op.

Testing

There was a discrepancy between the unsqueeze function for torch and ONNX. Torch only supported as single axis argument, ONNX supported multiple, so I wrote some code to generate the Onnx model directly through Onnx helper and runtime. We might want to move it into it's own directory to use as a python module if we want to use it for other operations.

right now the onnx model takes a second input argument that is not present in the burn forward function, but I needed the second op to test remapping to a reshape node.

Graph:

…te_outputs function

codecov · 2024-02-04T00:24:16Z

Codecov Report

Attention: 229 lines in your changes are missing coverage. Please review.

Comparison is base (9df2071) 84.41% compared to head (98f56f2) 84.49%.
Report is 36 commits behind head on main.

Files	Patch %	Lines
burn-dataset/src/vision/image_folder.rs	79.91%	49 Missing ⚠️
backend-comparison/src/burnbenchapp/base.rs	0.00%	40 Missing ⚠️
burn-wgpu/src/codegen/dialect/gpu/vectorization.rs	77.66%	23 Missing ⚠️
backend-comparison/src/burnbenchapp/term/base.rs	0.00%	17 Missing ⚠️
burn-wgpu/src/codegen/dialect/wgsl/base.rs	83.50%	16 Missing ⚠️
burn-wgpu/src/fusion/elemwise/builder.rs	92.30%	16 Missing ⚠️
burn-wgpu/src/codegen/dialect/wgsl/compiler.rs	94.78%	12 Missing ⚠️
burn-import/src/onnx/from_onnx.rs	72.22%	10 Missing ⚠️
burn-import/src/onnx/dim_inference.rs	90.81%	9 Missing ⚠️
burn-tch/src/ops/int_tensor.rs	0.00%	9 Missing ⚠️
... and 11 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1236      +/-   ##
==========================================
+ Coverage   84.41%   84.49%   +0.07%     
==========================================
  Files         549      563      +14     
  Lines       61952    63340    +1388     
==========================================
+ Hits        52295    53517    +1222     
- Misses       9657     9823     +166

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

antimora

Thank for adding this new OP!

There are minor issues that need to be cleaned up because merging. I haven't verified if all cases are covered - it seems the implementation was not trivial.

burn-import/out/reshape.rs

burn-import/src/burn/node/base.rs

burn-import/src/burn/node/reshape.rs

burn-import/src/burn/node/unsqueeze.rs

burn-import/src/onnx/dim_inference.rs

burn-import/src/onnx/from_onnx.rs

antimora · 2024-02-13T01:35:39Z

burn-import/src/onnx/from_onnx.rs

+//this is an extremely hacky temporary solution while I figure out how to properly handle this
+//situation


Will this be handled in this PR or later? We should replace with a TODO with instructions if not handled in this PR.

Also please explain the context why this is needed and if any potential issues it might cause down the road.

Down the road, the primary issue is speed for building larger graphs. I'm working on a better solution in #1296

The new function is necessary because the values for the axes to insert at aren't available unless the rhs of an unsqueeze op is constant. The only way to determine what the arguments of unsqueeze should be in that case is either:

from the output shape if it's explicit in the graph, though in this case it's better to just remap to a reshape and avoid the runtime inference of a shape since we already know the result.

at runtime which isn't covered here because correct me if I'm wrong, but I don't think burn supports runtime inference of shapes yet.

burn-import/src/onnx/from_onnx.rs

burn-import/src/onnx/op_configuration.rs

burn-import/src/onnx/to_burn.rs

antimora · 2024-02-13T01:52:29Z

Added @nathanielsimard as a reviewer the new tensor OP.

nathanielsimard

Just commented on the added op, which should be simplified.

nathanielsimard · 2024-02-13T14:56:05Z

burn-tensor/src/tensor/api/base.rs

+    /// }
+    /// ```
+    pub fn unsqueeze_dims<const D2: usize>(self, dims: &[isize]) -> Tensor<B, D2, K> {
+        let mut new_dims = [1; D2];


I feel like this can be implemented in a simpler way:

let new_dims = [1; D2]; let mut counter = 0; for i in 0..D2 { if !dims.contains(i) { new_dims[i] = old_dims[counter]; counter += 1; } } tensor.reshape(new_dims);

No allocation, and contains is extremely fast on small slices (less than 10 elems).

I don't think that would work, but there might be a way to avoid the allocation.

the reason why I say that isdims can have negative values which needs to be converted to a usize.

I'm using a vec due to slices not having a len function and I haven't figured out a way to infer the length.

I could avoid the allocation by making the second argument &'op mut [isize], and then mutate the new dims in place.

I just realized that the dims can also contain duplicates. The Onnx spec never specified that the values had to be unique, so I wrote it so that there could be duplicates.

essentially the thinking was unsqueeze dims was equivalent to doing a series of single dim unsqueezes, and if you executed those unsqueezes, it wouldn't matter what order the operations happened: 2 values of -1 would just result in two axes in the resulting tensor at the end

antimora

From my end it looks good.

Thank you for the improvements and OP addition. Burn ONNX is getting better!

skewballfox added 12 commits January 28, 2024 17:06

working on onnx compatible unsqueeze

6a8b147

I think I figured it out, need to pivot to another branch though

803026e

got unsqueeze_dims test to pass

0471165

messed up the doc test

5b11eef

working on adding unsqueeze to onnx

dd2dcc1

working on something to generate onnx ops directly

a584d85

finally

8aa2d56

Add unsqueeze operation to onnx_tests.rs and implement unsqueeze_upda…

ed0911d

…te_outputs function

Merge branch 'tracel-ai:main' into unsqueeze_op

69a65dd

Refactor onnx model generation and validation

7909033

got constants working

862451d

op works for burn import

52b8b52

skewballfox added 4 commits February 9, 2024 10:00

Merge branch 'tracel-ai:main' into unsqueeze_op

69a889c

Reworked onnx gen code to support chaining nodes

9541b63

regenerated model

86b684e

unsqueze is implemented

6e1fcb1

skewballfox marked this pull request as ready for review February 11, 2024 21:11

antimora requested changes Feb 13, 2024

View reviewed changes

antimora requested a review from nathanielsimard February 13, 2024 01:51

skewballfox added 2 commits February 13, 2024 08:24

implemented changes from code review

4ac829b

updated comment on function

fc6f6a7

nathanielsimard requested changes Feb 13, 2024

View reviewed changes

skewballfox added 3 commits February 13, 2024 10:47

slight cleanup of unsqueeze dims

04479ec

removed variable and updated doc string

a620717

added test for unsqueeze codegen

98f56f2

antimora approved these changes Feb 14, 2024

View reviewed changes

antimora requested a review from nathanielsimard February 14, 2024 18:18

nathanielsimard approved these changes Feb 15, 2024

View reviewed changes

nathanielsimard merged commit d1273d4 into tracel-ai:main Feb 15, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unsqueeze op #1236

Unsqueeze op #1236

skewballfox commented Feb 2, 2024 •

edited by antimora

codecov bot commented Feb 4, 2024 •

edited

antimora left a comment

antimora Feb 13, 2024

skewballfox Feb 13, 2024

antimora commented Feb 13, 2024

nathanielsimard left a comment

nathanielsimard Feb 13, 2024 •

edited

skewballfox Feb 13, 2024

skewballfox Feb 13, 2024

antimora left a comment

		//this is an extremely hacky temporary solution while I figure out how to properly handle this
		//situation

Unsqueeze op #1236

Unsqueeze op #1236

Conversation

skewballfox commented Feb 2, 2024 • edited by antimora

Pull Request Template

Checklist

Related Issues/PRs

Changes

Testing

codecov bot commented Feb 4, 2024 • edited

Codecov Report

antimora left a comment

Choose a reason for hiding this comment

antimora Feb 13, 2024

Choose a reason for hiding this comment

skewballfox Feb 13, 2024

Choose a reason for hiding this comment

antimora commented Feb 13, 2024

nathanielsimard left a comment

Choose a reason for hiding this comment

nathanielsimard Feb 13, 2024 • edited

Choose a reason for hiding this comment

skewballfox Feb 13, 2024

Choose a reason for hiding this comment

skewballfox Feb 13, 2024

Choose a reason for hiding this comment

antimora left a comment

Choose a reason for hiding this comment

skewballfox commented Feb 2, 2024 •

edited by antimora

codecov bot commented Feb 4, 2024 •

edited

nathanielsimard Feb 13, 2024 •

edited