Request for `torch.split` to accept a tensor for input `split_size_or_sections` #47479

edqwerty10 · 2020-11-06T00:46:33Z

🚀 Feature

For the pytorch operator torch.split(tensor, split_size_or_sections, dim) we would like the input split_size_or_sections to also handle tensors. Right now it handles list of ints and an int.

Motivation

When tracing a model, we currently do split_size_or_sections=tensor.tolist() when calling torch.split but tracing can't record this conversion of tensor to list of ints so the traced model fails on other inputs. This is currently blocking a diff that tries to improve runtime performance by using tensor operations, D24595761.

Pitch

Have torch.split(tensor, tensor, int) work so that tracing can properly record the operations for a traced model.

Alternatives

We currently don't have alternatives (I think). We are migrating to a scripted model (which works in this case) but support for tracing is still requested.

Additional context

Here is a diff D24595761 that tries to improve performance by using torch operators only but fails for a traced model

The text was updated successfully, but these errors were encountered:

dzhulgakov · 2020-11-06T01:07:24Z

The issue is that there's some shady conversion going on here:

pytorch/torch/tensor.py

Line 509 in 3549141

return super(Tensor, self).split_with_sizes(split_size, dim)

so today it seems that we convert 1-element tensor as a "number of split" not "sizes of splits".

What requested can be safely applied to split_with_sizes though because it's unambiguous.

cc @gchanan @mruberry for opinions

mruberry · 2020-11-06T06:40:48Z

We also have a NumPy compatible tensor_split that should throw an error currently when passed a tensor. We could avoid a BC concern and document this function treating a CPU tensor as a list.

This raises the more general questions:

do we really want CPU tensors to be interpreted as lists consistently?
is tracing going to continue to be around, or do we expect changes to better-support tracing are temporary?

I think it'd be OK (from a UX perspective) to consistently interpret CPU tensors as lists when an operand can take a list and as a scalar when an operand can only be a scalar. Device types like XLA will probably want the tensor to be passed as an XLA tensor, however. See #31558 and cc @ailzhang. Unfortunately I don't think we can always match these tensors to the device type that will run the operation because passing a CUDA tensor and converting it to a list would cause cross-device data movement.

For the second question, I suppose tracing will be around long enough that adding this support to a few functions would be OK. Especially since if we used tensor_split I don't think we'd have any BC concerns.

Do you think tensor_split would work for you, @edqwerty10?

edqwerty10 · 2020-11-09T15:14:26Z

Hi @mruberry, this is great thank you for the breakdown. Yes tensor_split should work for us as well!

mruberry · 2020-11-10T05:03:59Z

OK. We would accept a PR updating tensor_split to accept a CPU tensor in place of a list. Note this is consistent with NumPy:

import numpy as np
a = np.array((1, 2, 3, 4))
np.array_split(a, np.array((1, 2)))
: [array([1]), array([2]), array([3, 4])]

np.array_split(a, [1, 2])
: [array([1]), array([2]), array([3, 4])]

mruberry · 2020-11-11T18:21:34Z

The fastest way would be for you to submit a PR implementing the behavior. If that's a pain you can ping me internally and we can prioritize the request.

xw285cornell · 2020-12-15T22:20:36Z

Any chance we can allow a cuda tensor for indices?

mruberry · 2020-12-17T11:20:22Z

Any chance we can allow a cuda tensor for indices?

No, because the indices tensor is used to define the tensor outputs, and the metadata for tensors lives on the CPU, so the CPU has to be able to access the data in indices.

…rgument (pytorch#49169) Summary: Pull Request resolved: pytorch#49169 Trying to solve PR request pytorch#47479. This diff tries to overload method `torch.tensor_split` to also accept a tensor for argument `split_size_or_sections` which currently accepts a python list or int. The motivation is to avoid converting a tensor to a list so that when tracing a model/module the tensor operations can be recorded. Implementation is following the diff that originally added the `tensor_split` method D24166164 (pytorch@ef4817f). Test Plan: ``` buck test caffe2/test:torch -- tensor_split ``` https://www.internalfb.com/intern/testinfra/testconsole/testrun/5910974550563805/ ``` buck test caffe2/test:others -- tensor_split ``` https://www.internalfb.com/intern/testinfra/testconsole/testrun/1688849905082678/ Reviewed By: mruberry Differential Revision: D25440885 fbshipit-source-id: ca5d134cfb91fa0efc3dec5257dbc97532eb2d74

…rgument (#49169) Summary: Pull Request resolved: #49169 Trying to solve PR request #47479. This diff tries to overload method `torch.tensor_split` to also accept a tensor for argument `split_size_or_sections` which currently accepts a python list or int. The motivation is to avoid converting a tensor to a list so that when tracing a model/module the tensor operations can be recorded. Implementation is following the diff that originally added the `tensor_split` method D24166164 (ef4817f). Test Plan: ``` buck test caffe2/test:torch -- tensor_split ``` https://www.internalfb.com/intern/testinfra/testconsole/testrun/5910974550563805/ ``` buck test caffe2/test:others -- tensor_split ``` https://www.internalfb.com/intern/testinfra/testconsole/testrun/1688849905082678/ Reviewed By: mruberry Differential Revision: D25440885 fbshipit-source-id: 6705dc551279e3a5eb1e5ec1ede2728eab85ffb1

mruberry · 2021-01-03T12:17:15Z

Closing since we resolved this request by implementing the functionality in torch.tensor_split, and the request for this functionality in torch.split is a dupe of #16703.

…rgument (pytorch#49169) Summary: Pull Request resolved: pytorch#49169 Trying to solve PR request pytorch#47479. This diff tries to overload method `torch.tensor_split` to also accept a tensor for argument `split_size_or_sections` which currently accepts a python list or int. The motivation is to avoid converting a tensor to a list so that when tracing a model/module the tensor operations can be recorded. Implementation is following the diff that originally added the `tensor_split` method D24166164 (pytorch@ef4817f). Test Plan: ``` buck test caffe2/test:torch -- tensor_split ``` https://www.internalfb.com/intern/testinfra/testconsole/testrun/5910974550563805/ ``` buck test caffe2/test:others -- tensor_split ``` https://www.internalfb.com/intern/testinfra/testconsole/testrun/1688849905082678/ Reviewed By: mruberry Differential Revision: D25440885 fbshipit-source-id: 6705dc551279e3a5eb1e5ec1ede2728eab85ffb1

glaringlee added feature A request for a proper, new feature. triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Nov 6, 2020

mruberry added module: ux module: viewing and reshaping labels Nov 6, 2020

ngimel added enhancement Not as big of a feature, but technically not a bug. Should be easy to fix and removed feature A request for a proper, new feature. labels Nov 6, 2020

edqwerty10 mentioned this issue Dec 10, 2020

Add support for torch.tensor_split to accept a tensor for indices argument #49169

Closed

mruberry closed this as completed Jan 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for `torch.split` to accept a tensor for input `split_size_or_sections` #47479

Request for `torch.split` to accept a tensor for input `split_size_or_sections` #47479

edqwerty10 commented Nov 6, 2020

dzhulgakov commented Nov 6, 2020

mruberry commented Nov 6, 2020 •

edited

edqwerty10 commented Nov 9, 2020

mruberry commented Nov 10, 2020

mruberry commented Nov 11, 2020

xw285cornell commented Dec 15, 2020

mruberry commented Dec 17, 2020

mruberry commented Jan 3, 2021

Request for torch.split to accept a tensor for input split_size_or_sections #47479

Request for torch.split to accept a tensor for input split_size_or_sections #47479

Comments

edqwerty10 commented Nov 6, 2020

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

dzhulgakov commented Nov 6, 2020

mruberry commented Nov 6, 2020 • edited

edqwerty10 commented Nov 9, 2020

mruberry commented Nov 10, 2020

mruberry commented Nov 11, 2020

xw285cornell commented Dec 15, 2020

mruberry commented Dec 17, 2020

mruberry commented Jan 3, 2021

Request for `torch.split` to accept a tensor for input `split_size_or_sections` #47479

Request for `torch.split` to accept a tensor for input `split_size_or_sections` #47479

mruberry commented Nov 6, 2020 •

edited