AOT Autograd - Contiguous tensors #537

anijain2305 · 2022-02-25T00:51:30Z

AOT Autograd does not handle non-contiguous tensors correctly right now.

One way to handle this is to trace backwards forcing contiguous tensors for the outputs and then call contiguous on backwards input grad tensors. This is done in #536. However, this can result in high and unnecessary overhead.

Other option is to record the strides of out in the forward pass and then accordingly restride the input grads in the backward. We have to investigate if that can be done for all cases.

Chillee · 2022-02-25T01:47:03Z

@anijain2305 Can you add an example failure case?

anijain2305 · 2022-02-25T01:55:20Z

When I got this error, I had TorchDynamo + AOT Autograd setup. I was unable to extract a separate subgraph, just with AOT Autograd, that could expose this issue.

But, let me think. As I understand the issue better now, I might be able to come up with an example.

ezyang · 2022-03-01T22:15:20Z

This feels like it is probably because the tracer isn't replicating strides correctly (which is an easy mistake to make). If so, we should be able to make progress on this.

EDIT: OK well it's not /completely/ busted, see

functorch/functorch/_src/python_key.py

Line 71 in 8acf2d1

strides=elem.stride(), storage_offset=elem.storage_offset(),

Chillee · 2022-03-01T22:48:17Z

@ezyang The problem is sadly ... more fundamental.

The problem is that given f, we AOT trace out f_out, backwards_function = vjp(f, inputs). Then, even though we don't have the actual input to backwards_function, we know that its shape, dtype, and device must be identical to f_out. The problem is that the strides are not necessarily identical, and that's what causing issues here.

In some sense this is kind of a stupid error. This only happens at the boundaries to the backwards pass, so at worst, this can be fixed with a single extra contiguous call there. But... that somewhat pessimizes the performance for small cases, which is why I've resisted doing it :(

ezyang · 2022-03-02T01:11:53Z

Maybe we can just take a stride argument to vjp lol

Chillee · 2022-03-02T05:11:07Z

Maybe... I am wondering whether we're going to need to relax the current restrictions with __torch_dispatch__ though... Like, it seems reasonable to me that the inputs to vjp could be an arbitrary tensor type.

In which case we're going to need to change our tracing/caching strategy, more or less.

Currently we trace the forwards + backwards graph upon hitting the forwards pass (and cache upon the forwards pass's inputs). Instead, we might need to cache on the inputs to the backwards pass, which is a bit... unclear about what that looks like.

Chillee · 2022-03-04T22:23:43Z

Maybe we can just take a stride argument to vjp lol

Also, sorry, this doesn't even work. The problem is that, at the time we call vjp, we don't know what the stride of the backwards input is.

anijain2305 mentioned this issue Mar 23, 2022

Initial support - AOTAutograd - Test accuracy for TorchBench models pytorch/torchdynamo#86

Closed

18 tasks

anijain2305 mentioned this issue Apr 5, 2022

Trace the backward pass assuming contiguous tensors #536

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AOT Autograd - Contiguous tensors #537

AOT Autograd - Contiguous tensors #537

anijain2305 commented Feb 25, 2022 •

edited

Loading

Chillee commented Feb 25, 2022

anijain2305 commented Feb 25, 2022 •

edited

Loading

ezyang commented Mar 1, 2022 •

edited

Loading

Chillee commented Mar 1, 2022 •

edited

Loading

ezyang commented Mar 2, 2022

Chillee commented Mar 2, 2022

Chillee commented Mar 4, 2022

AOT Autograd - Contiguous tensors #537

AOT Autograd - Contiguous tensors #537

Comments

anijain2305 commented Feb 25, 2022 • edited Loading

Chillee commented Feb 25, 2022

anijain2305 commented Feb 25, 2022 • edited Loading

ezyang commented Mar 1, 2022 • edited Loading

Chillee commented Mar 1, 2022 • edited Loading

ezyang commented Mar 2, 2022

Chillee commented Mar 2, 2022

Chillee commented Mar 4, 2022

anijain2305 commented Feb 25, 2022 •

edited

Loading

anijain2305 commented Feb 25, 2022 •

edited

Loading

ezyang commented Mar 1, 2022 •

edited

Loading

Chillee commented Mar 1, 2022 •

edited

Loading