✨[Feature] Support `list` and `namedtuple` input types to `forward` function #798

chaoz-dev · 2022-01-08T08:48:05Z

Is your feature request related to a problem? Please describe.

Currently, the forward function only supports tensor input types when compiling. However, sometimes we wish to supply many tensors into the forward function at once (say, greater than 10); this results in a very long forward API call where we have to list every tensor individually when calling forward. It would be helpful if we could pass in a single container containing these tensors all at once instead, which results in a much cleaner API call.

For this specific request, I focus on the list and namedtuple input types first, since these should cover most basic uses cases (and should functionally satisfy named tensor key-value pair type inputs).

Describe the solution you'd like

Instead of supporting only the following, where we need to supply torch.Tensors into forward:

  DEVICE = torch.device("cuda:0")                                                                                            
  SHAPE = (1, 1)        

  torch.manual_seed(0)                                                                                                                                                                                                                                                                                                                                 

  class Model(torch.nn.Module):                                                                                              
      def __init__(self):                                                                                                    
          super().__init__()                                                                                                 
                                                                                                                          
      def forward(self, a, b):                                                                                               
          return a - b                                                                                      

  if __name__ == "__main__":                                                                                                 
      tensor = torch.randn(SHAPE, dtype=torch.float32, device=DEVICE)                                                        
                                                                                                                             
      model = Model().eval().to(DEVICE)                                                                                      
      out = model(tensor, tensor)                                                                                                   
                                                                                                                                                                                                                                                          
      model_trt = torch_tensorrt.compile(                                                                                    
          model,                                                                                                             
          inputs=[                                                                                                           
              torch_tensorrt.Input(shape=SHAPE),                                                                             
              torch_tensorrt.Input(shape=SHAPE),                                                                             
          ],                                                                                                                 
          enabled_precisions={torch.float},                                                                                  
      )                                                                                                                      
      out_trt = model(tensor, tensor)                                                                                               
                                                                                                                                                                                                                                                         
      assert torch.max(torch.abs(out - out_trt)) < 1e-6

Support also inputting namedtuple or list into forward:

  DEVICE = torch.device("cuda:0")                                                                                            
  SHAPE = (1, 1)        

  torch.manual_seed(0)  

  Input = namedtuple('Input', ['t1', 't2'])                                                                                  
                                                                                                                             
  class Model(torch.nn.Module):                                                                                              
      def __init__(self):                                                                                                    
          super().__init__()                                                                                                 
                                                                                                                          
      def forward(self, input_: Input):                                                                                      
          return input_.t1 - input_.t2                                                                                       

  if __name__ == "__main__":                                                                                                 
      tensor = torch.randn(SHAPE, dtype=torch.float32, device=DEVICE)                                                        
      input_ = Input(tensor, tensor)                                                                                         
                                                                                                                             
      model = Model().eval().to(DEVICE)                                                                                      
      out = model(input_)                                                                                                   
                                                                                                                                                                                                                                                          
      model_trt = torch_tensorrt.compile(                                                                                    
          model,                                                                                                             
          inputs=[                                                                                                           
              torch_tensorrt.Input(shape=SHAPE),                                                                             
              torch_tensorrt.Input(shape=SHAPE),                                                                             
          ],                                                                                                                 
          enabled_precisions={torch.float},                                                                                  
      )                                                                                                                      
      out_trt = model(input_)                                                                                               
                                                                                                                                                                                                                                                         
      assert torch.max(torch.abs(out - out_trt)) < 1e-6

Describe alternatives you've considered

Currently the only alternative is to supply tensors directly into the forward function; supplying namedtuples will cause the compilation to segfault, and supplying lists will cause the compilation to fail to recognize the input altogether.

Additional context

For simplicity, the input containers should contain ONLY tensors (implying that we disallow nested containers). Containers with mixed input types are ignored.
Furthermore, there must be a bijection between the tensors in the container and the sizes provided into the compile call; ie. there must be one Input size for each tensor in the container and both are taken in the same order.
We can mix tensors and containers into the forward call (eg. forward(x: torch.Tensor, y: List[torch.Tensor], z: namedtuple[torch.Tensor])). Any other types are treated as they are currently when input.

The text was updated successfully, but these errors were encountered:

chaoz-dev · 2022-01-08T09:00:52Z

@narendasan Let me know if the behaviors listed under Additional context make sense. In particular, I believe we currently ignore other input types going into forward... if we allow them at all?

I can try taking a crack at the implementation here later when I get a chance.

chaoz-dev · 2022-01-08T09:06:08Z

Ah this might be a duplicate of #428, although this request might be slightly less ambitious.

narendasan · 2022-01-08T19:24:46Z

@chaoz-dev Yeah this is reasonable. We have been working on a design doc for these sort of features here #629. @inocsin Has been working on the first steps here with arbitrary mixes of tuples (since they are fixed size) and tensors as inputs and outputs. Need to check with him on if he has a public dev branch but help here is greatly appreciated.

chaoz-dev · 2022-01-08T23:52:04Z

Sounds good, I'll take a look at the design doc and make some suggestions there for review. I had a quick look at the code and my naive first pass at this is to unpack input containers in torch_tensorrt/ts/_compiler.py in the compile function before it hits the actual compilation step, so the compilation always sees a flat list of tensors... I believe this should satisfy the basic aspects of inputting an iterable container of tensors.

narendasan · 2022-01-11T00:17:17Z

Seems reasonable to take the step of adding support for one collection of inputs of any type. But we need to do this in compiler.cpp since we need to support C++ and Python APIs as well as we need to be able to construct a new module with the correct interface otherwise users cannot reuse the same input formatting code in their applications.

chaoz-dev · 2022-01-11T04:02:22Z

Yeah that makes sense. I'll take a look at this shortly.

chaoz-dev · 2022-02-15T17:16:10Z

Deferring to @inocsin in #629 here

github-actions · 2022-05-17T00:02:07Z

This issue has not seen activity for 90 days, Remove stale label or comment or this will be closed in 10 days

narendasan · 2022-09-02T18:27:29Z

Initial feature support has been merged

chaoz-dev added the feature request New feature or request label Jan 8, 2022

chaoz-dev assigned narendasan Jan 8, 2022

p1x31 mentioned this issue Jan 27, 2022

✨[Feature] Torch.jit.trace_module support in Torch-TensorRT #832

Closed

github-actions bot added the No Activity label May 17, 2022

ncomly-nvidia removed the No Activity label May 23, 2022

ncomly-nvidia added the release: v1.2 Tagged to be included in v1.2 label Jul 26, 2022

ncomly-nvidia mentioned this issue Jul 28, 2022

Collection Support [Inprogress] #802

Closed

19 tasks

narendasan closed this as completed Sep 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨[Feature] Support `list` and `namedtuple` input types to `forward` function #798

✨[Feature] Support `list` and `namedtuple` input types to `forward` function #798

chaoz-dev commented Jan 8, 2022 •

edited

Loading

chaoz-dev commented Jan 8, 2022

chaoz-dev commented Jan 8, 2022 •

edited

Loading

narendasan commented Jan 8, 2022

chaoz-dev commented Jan 8, 2022

narendasan commented Jan 11, 2022

chaoz-dev commented Jan 11, 2022

chaoz-dev commented Feb 15, 2022

github-actions bot commented May 17, 2022

narendasan commented Sep 2, 2022

✨[Feature] Support list and namedtuple input types to forward function #798

✨[Feature] Support list and namedtuple input types to forward function #798

Comments

chaoz-dev commented Jan 8, 2022 • edited Loading

chaoz-dev commented Jan 8, 2022

chaoz-dev commented Jan 8, 2022 • edited Loading

narendasan commented Jan 8, 2022

chaoz-dev commented Jan 8, 2022

narendasan commented Jan 11, 2022

chaoz-dev commented Jan 11, 2022

chaoz-dev commented Feb 15, 2022

github-actions bot commented May 17, 2022

narendasan commented Sep 2, 2022

✨[Feature] Support `list` and `namedtuple` input types to `forward` function #798

✨[Feature] Support `list` and `namedtuple` input types to `forward` function #798

chaoz-dev commented Jan 8, 2022 •

edited

Loading

chaoz-dev commented Jan 8, 2022 •

edited

Loading