torch.cat without copying memory #70600

defoishugo · 2022-01-04T15:22:32Z

🚀 The feature, motivation and pitch

Principle

Today, the concatenation implemented on pytorch consists in the allocation of a new tensor.
I would like to know if it is possible to realize a concatenation of contiguous and/or non-contiguous tensors without memory duplication.

Example 1 : contiguous concatenation

The following code is executed with allocation of a new tensor concatenated_tensor:

import torch 

tensor1 = torch.rand(2,3,4)
tensor2 = torch.rand(2,3,4)
concatenated_tensor = torch.cat(tensor1, tensor2)

I'd like to enable the same scenario, but have concatenated_tensor as a view of tensor1 and tensor2.
In terms of UX, I don't know what to propose.

Note: since I'm a new pytorch user, maybe the word "view" is not appropriate. The low-level idea is to consider concatenated_tensor as a list of pointers to tensors.

Example 2 : non-contiguous concatenation

Next, I would like to enable the following scenario, if possible:

import torch 

tensor1 = torch.rand(2,3,4).expand(10, -1, -1)
tensor2 = torch.rand(2,3,4).expand(10, -1, -1)
concatenated_tensor = torch.cat(tensor1, tensor2)

Alternatives

No response

Additional context

Discussed in #70283 with @ejguan.
See discussion/34609.

cc @mruberry @rgommers

The text was updated successfully, but these errors were encountered:

rgommers · 2022-01-04T15:51:30Z

I would like to know if it is possible to realize a concatenation of contiguous and/or non-contiguous tensors without memory duplication.

The answers of @albanD on https://discuss.pytorch.org/t/concatenate-tensors-without-memory-copying/34609/13 explain how difficult this is. It does not fit the strided model of a tensor at all. It'd be a ton of work - I think it's safe to say this won't happen.

I'll let someone else decide, but I propose to close this issue.

defoishugo · 2022-01-04T15:53:38Z

No problem, we'll close this issue.

ngimel · 2022-01-10T19:22:21Z

nestedtensor https://github.com/pytorch/nestedtensor might be doing what you want. Also, if you don't need autograd, _foreach_* ops support operations on the lists of tensors, so you don't need to concatenate them in advance.

ejguan added module: numpy Related to numpy support, and also numpy compatibility of our operators module: viewing and reshaping triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Jan 4, 2022

defoishugo closed this as completed Jan 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.cat without copying memory #70600

torch.cat without copying memory #70600

defoishugo commented Jan 4, 2022 •

edited by pytorch-probot bot

rgommers commented Jan 4, 2022

defoishugo commented Jan 4, 2022

ngimel commented Jan 10, 2022

torch.cat without copying memory #70600

torch.cat without copying memory #70600

Comments

defoishugo commented Jan 4, 2022 • edited by pytorch-probot bot

🚀 The feature, motivation and pitch

Principle

Example 1 : contiguous concatenation

Example 2 : non-contiguous concatenation

Alternatives

Additional context

rgommers commented Jan 4, 2022

defoishugo commented Jan 4, 2022

ngimel commented Jan 10, 2022

defoishugo commented Jan 4, 2022 •

edited by pytorch-probot bot