[feature request] `torch.to(obj, device)` supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting `torch.distributed.utils._recursive_to` #69431

vadimkantorov · 2021-12-05T13:46:06Z

Often it is needed to move model results to cpu (or inputs to gpu). Once the data structures get a bit complicated, dicts and lists appear often in model results. Often we have to roll a little utility method like below. If indeed other people had to write this sort of utilities, may make sense to include something like this to core.

def to(obj, device):
  if torch.is_tensor(obj):
    return obj.to(device)
  if isinstance(obj, dict):
    return {k : to(v, device) for k, v in obj.items()}
  if isinstance(obj, tuple):
    return tuple(to(v, device) for v in obj)
  if isinstance(obj, list):
    return [to(v, device) for v in obj]
  return obj

cc @albanD @mruberry @jbschlosser @walterddr

albanD · 2021-12-07T18:06:42Z

Wouldn't this be nicely solved by just using pytree from here
where you can do tree_map(torch.to, your_obj) ?

vadimkantorov · 2021-12-07T18:15:35Z

maybe! but I think it's still useful to have this primitive in core as it appears a lot in user code and very few people know about pytrees (i certainly don't). it could also be to better have a simple manual version instead to be exactly sure what's happening. e.g. a simple version makes it clear that it would be slow-ish for giant lists of python numbers (as there would be a lot of checks before returning the value), not clear what's happening with pytree. it also seems to have some flatten/unflatten concept...

also, torch.to doesn't exist now or isn't documented

albanD · 2021-12-07T18:36:33Z

also, torch.to doesn't exist now or isn't documented

Ho I used that because you mentioned it above :p But indeed that doesn't even exist.

mruberry · 2021-12-08T04:52:24Z

The lack of torch.to isn't actually an impediment because it's easy to make the method a function using a lambda.

We do write "crawlers" like this occasionally. assert_close is the most recent example that comes to mind and has one. I think there's also a slightly different version in common_methods_invocations.py. I'm inclined to suggest people write their own crawlers or use a Python utility package for functionality like this. There are a lot of vagaries (like what if the "elements" in the containers aren't other containers or tensors?) and the functionality is more about dealing with Python datastructures than it is dealing with tensors.

NumPy doesn't have any functionality like this, does it?

vadimkantorov · 2021-12-08T08:35:54Z

Yeah, I just propose to use the opportunity and introduce torch.to supporting slightly more generic version, supporting basic Python structs (also supported by TorchScript), be it with pytree utils or not. I'm proposing this on improving UX grounds, so can't argue with that this isn't an "impediment". However, similar functions get re-rolled practically in every project.

I somewhat agree, but even in implementing custom device-movers for custom structures having an existing thing working for some known types will make the code simpler. My code above just passes through non PyTorch things. And I also agree/think that it would have been nicer to have more modular/reusable (but still simple enough) "collation" / "conversion" routines.

NumPy doesn't have a concept of device anyway, so it doesn't have this functionality obviously (the need for other automated conversions like dtype conversions is much less), but the reason isn't relevant IMO.

vadimkantorov · 2022-03-29T14:20:09Z

I saw somewhere more examples of pytree, so I now understand better that pytree traversal indeed can be used for making this recursive device transfer :) But I would still suggest, that it's a good default behavior for torch.to (to be introduced) without forcing users to make their own pytree'd version, since this recursive mode is often needed.

There might be some optimizations like preserving data sharing so that if we do torch.to(torch.split(...), ...) the converted tensors are still views over parts of a single tensor. Bit it might be too special-case

vadimkantorov · 2022-04-05T16:01:45Z

I also wonder if treemap is parallel (akin to _foreach methods). Probably for torch.to(..., non_blocking=True) it doesn't matter as much, but still. It also probably could benefit from a single cudaMalloc call (at least for all strided tensors) flattening all tensors in the input

vadimkantorov · 2022-05-10T19:21:17Z

Related PR that actually implements this recursive utility: #77187, but not in generic namespace :(

rohan-varma · 2022-05-11T21:56:22Z

Found this issue after @vadimkantorov comment on a related PR. Agreed that such a utility would be quite useful and PT-D would then not need such custom logic to move inputs for DDP / FSDP.

@albanD @jbschlosser @mruberry Do we have any new thoughts on this feature and whether this is something that the core team might be able to address?

albanD · 2022-05-11T23:29:33Z

Any reason for not using pytree as suggested above in your case?

mruberry · 2022-05-12T00:37:48Z

I understand this would be convenient, but we try not to add sugar to core PyTorch operations.

jbschlosser · 2022-05-12T17:05:54Z

To be explicit: this would involve use of pytree.tree_map on the data structure with lambda t: t.to(...).

albanD · 2022-05-12T17:21:09Z

Well, for nn.Module, that already works as you can call to on the top Module.

vadimkantorov · 2022-05-12T21:45:56Z

there are some stuffs about streams, not sure if tree_map will be able to handle it?

may also be a good idea to support non_blocking and calling custom methods .to if they exist in addition to tensors

vadimkantorov · 2022-12-23T17:41:42Z

So now in theory, torch.to could just be implemented in terms of torch.utils._pytree.tree_map.

Compared to vanilla tree_map, torch.to could theoretically do all allocations in one batch (asynchronously) / in a single allocation, but not sure what could be a good design for it. Although, not even sure if it's optimal of doing one large allocation or many smaller ones wrt reuse of already allocated segments that might not be contiguous

vadimkantorov · 2023-05-10T18:25:03Z

although in another context (sparse tensors) this post describes an optimization option that torch.to could also do (minimize the number of host->device copies): https://pytorch.org/blog/optimizing-production-pytorch-performance-with-graph-transformations/#31-combining-input-sparse-features

vadimkantorov · 2024-01-02T01:02:47Z

ERROR: AttributeError: 'tuple' object has no attribute 'to' #116584

albanD · 2024-01-02T10:10:22Z

I guess pytree would be the way to go for this in 2024 :D
new = tree_map_only(torch.Tensor, lambda t: t.to(device), old)

vadimkantorov · 2024-01-02T10:16:45Z

On the workability side, yes it would cut it, but having some builtin option for separate copy thread or efficiently moving a lot of tensors in one allocation might be interesting - especially if it's already implemented in distributed context. Otheriwse, for functional programming side, I think implementing torch.to as tree_map_only is a worthy shortcut for users to have in core and for consistency between functions/methods :)

albanD · 2024-01-02T10:29:42Z

I do not think that changing torch.to() would be good for consistency actually. That would make the behavior significantly different between function and methods.

vadimkantorov · 2024-01-02T10:45:59Z

For basic individual tensors, the behavior would be exactly the same anyway? So for all inputs acceptable for instance .to method, the function behavior torch.to would be the same, right? I'm proposing to keep them strictly identical to all individual tensors (so, both .to and torch.to would continue to exist), and potentially have more knobs/functionality for more complex inputs, like tensor lists or pytree object hierarchies

vadimkantorov · 2024-04-16T07:00:19Z

Btw _recursive_to got copied over in https://github.com/pytorch/pytorch/blob/main/torch/distributed/optim/zero_redundancy_optimizer.py#L29

So maybe it is the right moment for it to be promoted at least to torch._to?

albanD · 2024-04-16T18:41:15Z

We should remove this code and just use tree_map there instead. It's going to be a lot more reliable...

vadimkantorov · 2024-04-16T22:53:33Z

just use tree_map there instead

I think it'd be also a great impl for a publicly advertised shortcut :) Otherwise, this duplicate code exists in distributed_utils and in this ZeroOptimizer.py

A more custom op could somehow parallelize copies (if a large tensor list is passed as input) using multiple CUDA threads or allocate a single large contig memory chunk on GPU (and maybe do all this in async fashion if this suits to be able to schedule the copies on user-provided background CUDA stream used for copies)

albanD · 2024-04-17T15:25:21Z

I think that if we need to add one more API for our users to know about, I prefer for them to learn about pytree. It will allow them to solve many of their problems with one line. Compared to a specialized API that will only solve a single problem.

vadimkantorov · 2024-04-17T15:38:40Z

I think, torch.to is a very natural thing to search for if you already used tensor.to (as most functions exist both as instance functions and static methods). But even if this pytree-to-implement-generic-to idiom is publicized via HF and PyTorch code examples - it would already be great!

Another advantage to introduce torch.to is that later on some optimizations can be made for processing large lists

albanD · 2024-04-17T18:41:29Z

In theory, nothing prevents Dynamo from tracing pytree work and doing fancy optimization there :D

cc @zou3519 for pytree as a public feature

vadimkantorov changed the title ~~[proposal] [util] torch.to(obj, device) supporting reecursive lists/dicts/tuples of tensors~~ [proposal] [util] torch.to(obj, device) supporting recursive lists/dicts/tuples of tensors Dec 6, 2021

albanD added the needs research We need to decide whether or not this merits inclusion, based on research world label Dec 7, 2021

vadimkantorov mentioned this issue May 10, 2022

Separate input moving to utils file #77187

Closed

vadimkantorov mentioned this issue Nov 5, 2022

Move generic FSDP helpers to torch.distributed.utils #88540

Closed

vadimkantorov mentioned this issue Jan 23, 2023

[BE] move _apply_to_tensors from FSDP to torch.distributed.utils, use in _recursive_to #92801

Open

vadimkantorov mentioned this issue Feb 14, 2023

Move helpers to torch.distributed.utils #94789

Closed

vadimkantorov mentioned this issue Dec 20, 2023

Do H2D/D2H of input/result on separate threads/cuda.Streams #116189

Closed

vadimkantorov mentioned this issue Jan 2, 2024

ERROR: AttributeError: 'tuple' object has no attribute 'to' #116584

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature request] `torch.to(obj, device)` supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting `torch.distributed.utils._recursive_to` #69431

[feature request] `torch.to(obj, device)` supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting `torch.distributed.utils._recursive_to` #69431

vadimkantorov commented Dec 5, 2021 •

edited by pytorch-probot bot

albanD commented Dec 7, 2021

vadimkantorov commented Dec 7, 2021 •

edited

albanD commented Dec 7, 2021

mruberry commented Dec 8, 2021

vadimkantorov commented Dec 8, 2021 •

edited

vadimkantorov commented Mar 29, 2022 •

edited

vadimkantorov commented Apr 5, 2022

vadimkantorov commented May 10, 2022 •

edited

rohan-varma commented May 11, 2022

albanD commented May 11, 2022

mruberry commented May 12, 2022 •

edited

jbschlosser commented May 12, 2022 •

edited

albanD commented May 12, 2022

vadimkantorov commented May 12, 2022

vadimkantorov commented Dec 23, 2022 •

edited

vadimkantorov commented May 10, 2023

vadimkantorov commented Jan 2, 2024

albanD commented Jan 2, 2024

vadimkantorov commented Jan 2, 2024 •

edited

albanD commented Jan 2, 2024

vadimkantorov commented Jan 2, 2024

vadimkantorov commented Apr 16, 2024 •

edited

albanD commented Apr 16, 2024

vadimkantorov commented Apr 16, 2024

albanD commented Apr 17, 2024

vadimkantorov commented Apr 17, 2024

albanD commented Apr 17, 2024

[feature request] torch.to(obj, device) supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting torch.distributed.utils._recursive_to #69431

[feature request] torch.to(obj, device) supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting torch.distributed.utils._recursive_to #69431

Comments

vadimkantorov commented Dec 5, 2021 • edited by pytorch-probot bot

albanD commented Dec 7, 2021

vadimkantorov commented Dec 7, 2021 • edited

albanD commented Dec 7, 2021

mruberry commented Dec 8, 2021

vadimkantorov commented Dec 8, 2021 • edited

vadimkantorov commented Mar 29, 2022 • edited

vadimkantorov commented Apr 5, 2022

vadimkantorov commented May 10, 2022 • edited

rohan-varma commented May 11, 2022

albanD commented May 11, 2022

mruberry commented May 12, 2022 • edited

jbschlosser commented May 12, 2022 • edited

albanD commented May 12, 2022

vadimkantorov commented May 12, 2022

vadimkantorov commented Dec 23, 2022 • edited

vadimkantorov commented May 10, 2023

vadimkantorov commented Jan 2, 2024

albanD commented Jan 2, 2024

vadimkantorov commented Jan 2, 2024 • edited

albanD commented Jan 2, 2024

vadimkantorov commented Jan 2, 2024

vadimkantorov commented Apr 16, 2024 • edited

albanD commented Apr 16, 2024

vadimkantorov commented Apr 16, 2024

albanD commented Apr 17, 2024

vadimkantorov commented Apr 17, 2024

albanD commented Apr 17, 2024

[feature request] `torch.to(obj, device)` supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting `torch.distributed.utils._recursive_to` #69431

[feature request] `torch.to(obj, device)` supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting `torch.distributed.utils._recursive_to` #69431

vadimkantorov commented Dec 5, 2021 •

edited by pytorch-probot bot

vadimkantorov commented Dec 7, 2021 •

edited

vadimkantorov commented Dec 8, 2021 •

edited

vadimkantorov commented Mar 29, 2022 •

edited

vadimkantorov commented May 10, 2022 •

edited

mruberry commented May 12, 2022 •

edited

jbschlosser commented May 12, 2022 •

edited

vadimkantorov commented Dec 23, 2022 •

edited

vadimkantorov commented Jan 2, 2024 •

edited

vadimkantorov commented Apr 16, 2024 •

edited