[FX] intermediate types of empty lists/dicts not preserved during torch.fx tracing #49935

esqu1 · 2020-12-29T19:55:30Z

🐛 Bug

On occasions we may want to pass, for example, an empty list to a leaf function. In TorchScript, these empty lists are assumed to have type List[torch.Tensor], as most types in TorchScript (Tensor type defaulting behavior). In order to get around this defaulting behavior, we can either:

Annotate the type of the variable: var1: List[str] = []
Use torch.jit.annotate to notify the TorchScript compiler: var1 = torch.jit.annotate(List[str], [])

However, during torch.fx, neither of these methods will annotate the variables in the resulting GraphModule. Thus, TorchScript will always assume that these are List[Tensor] and may fail to compile the resulting module.

To Reproduce

Suppose that my_identity is a custom op with the following signature:

c10::List<std::string> identity(c10::List<std::string> x, at::Tensor& t) {
  return x;
}

TORCH_LIBRARY(my_ops, m) {
  m.def("identity", &identity);
}

Then use this in a module and trace it:

import torch
from typing import List
from torch.fx import symbolic_trace

class TestModule(torch.nn.Module):
    def forward(self, x: torch.Tensor) -> torch.Tensor:
        out = torch.ops.my_ops.identity(
            torch.jit.annotate(List[str], []),
            x
        )
        return out

graph_module = symbolic_trace(TestModule())
print(graph_module)

yielding:

import torch
def forward(self, x : torch.Tensor):
    identity = torch.ops.my_ops.identity([], position)
    return identity

Finally, run the graph module through TorchScript via torch.jit.script(graph_module).

Expected behavior

TorchScript should ideally compile fine.

Actual behavior

The TorchScript compiler complains about the empty list:

my_ops::identity(str[] x, Tensor t) -> str[]:
Expected a value of type 'List[str]' for argument 'x' but instead found type 'List[Tensor]'.
Empty lists default to List[Tensor]. Add a variable annotation to the assignment to create an empty list of another type (torch.jit.annotate(List[T, []]) where T is the type of elements in the list for Python 2)
:
import torch
def forward(self, x : torch.Tensor):
    identity = torch.ops.my_ops.identity([], position)
               ~~~~~~~~~~~~~~~~~~~~~~~~~ <--- HERE
    return identity


cc @gmagogsfm

The text was updated successfully, but these errors were encountered:

ansley · 2021-01-12T21:12:19Z

After doing a fair bit of research, it looks like this isn't something we can change from the FX side. However, there is a fix on frontend. We need to make it so that the frontend doesn't automatically type empty lists as List[Tensor] and empty dicts as Dict[str, Tensor]. Let's discuss if this can (or should) happen now, as it would help unblock some FX users. I believe @penguinwu mentioned this as a project as well.

gmagogsfm · 2021-01-18T23:22:08Z

Could you fill in some details on why this isn't viable by changing FX side? I think James mentioned something about preserving annotations, is that infeasible?

ansley · 2021-01-19T00:03:23Z

@gmagogsfm It's lot harder than you'd think to preserve existing annotations in this case. In Python, functions, methods, modules, and class objects store some of their annotations, which can be retrieved by __annotations__ or typing.get_type_hints. However, functions/methods only store the annotations for their args and return. Storing variable annotations that are also in the function scope was rejected since it’s such an expensive operation (link).

The first thing I thought of doing was walking back up the stack, getting the calling frame, and examining the context of the code responsible for that frame. I ran into a lot of problems with this, though. It required me to make some uncomfortable assumptions about the code.

I eventually came up with a solution that involved AST rewrites and a custom tracer. (I can explain my design more if you're interested.) Unfortunately, it had an awful time complexity. James and Zach discussed the issue, and we eventually came to the conclusion that this is not a feature that we should pursue from the FX side.

SplitInfinity · 2021-01-19T18:40:59Z

A potential solution for this is to improve JIT type inference to make smarter decisions about types of lists and dicts.

gmagogsfm · 2021-02-23T05:51:08Z

If I remember correctly, @ansley is working on this. Should it be moved out of "in discussion"?

facebook-github-bot added the fx label Dec 29, 2020

ansley self-assigned this Jan 5, 2021

ansley added oncall: jit Add this issue/PR to JIT oncall triage queue and removed fx labels Jan 12, 2021

github-actions bot added this to Need triage in JIT Triage Jan 12, 2021

SplitInfinity moved this from Need triage to In discussion in JIT Triage Jan 19, 2021

Lilyjjo added the TSRootCause:DefaultTypes label Jan 20, 2021

Lilyjjo added this to Default Types in TorchScript Usability Tracking Jan 20, 2021

Lilyjjo added TSRootCause:DefaultTypes TSUsability and removed TSRootCause:DefaultTypes labels Jan 20, 2021

Lilyjjo removed this from Default Types in TorchScript Usability Tracking Jan 21, 2021

ansley moved this from In discussion to In progress in JIT Triage Feb 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FX] intermediate types of empty lists/dicts not preserved during torch.fx tracing #49935

[FX] intermediate types of empty lists/dicts not preserved during torch.fx tracing #49935

esqu1 commented Dec 29, 2020 •

edited by pytorch-probot bot

ansley commented Jan 12, 2021 •

edited

gmagogsfm commented Jan 18, 2021

ansley commented Jan 19, 2021

SplitInfinity commented Jan 19, 2021

gmagogsfm commented Feb 23, 2021

[FX] intermediate types of empty lists/dicts not preserved during torch.fx tracing #49935

[FX] intermediate types of empty lists/dicts not preserved during torch.fx tracing #49935

Comments

esqu1 commented Dec 29, 2020 • edited by pytorch-probot bot

🐛 Bug

To Reproduce

Expected behavior

Actual behavior

ansley commented Jan 12, 2021 • edited

gmagogsfm commented Jan 18, 2021

ansley commented Jan 19, 2021

SplitInfinity commented Jan 19, 2021

gmagogsfm commented Feb 23, 2021

esqu1 commented Dec 29, 2020 •

edited by pytorch-probot bot

ansley commented Jan 12, 2021 •

edited