Support recursive data type in TorchScript #42487

yf225 · 2020-08-03T21:07:35Z

🚀 Feature

It would be really awesome to support recursive data type in TorchScript. For example:

import torch
from typing import Dict

class TypedDataDict(object):
  def __init__(self):
    self.str_to_dict: Dict[str, 'TypedDataDict'] = {}

  def set_str_to_dict(self, value: Dict[str, 'TypedDataDict']):
    self.str_to_dict = value


class TestModule(torch.nn.Module):
  def __init__(self):
    super().__init__()

  def forward(self, input):
    return TypedDataDict().set_str_to_dict({"123": TypedDataDict()})


m = TestModule()
m_scripted = torch.jit.script(m)
m_scripted(torch.tensor(1.))

'''
Currently throws:

RuntimeError: 
Assignment to attribute 'str_to_dict' cannot be of a type that contains class '__torch__.TypedDataDict'.
Classes that recursively contain instances of themselves are not yet supported:
  File "test_yf225.py", line 6
  def __init__(self):
    self.str_to_dict: Dict[str, 'TypedDataDict'] = {}
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <--- HERE
'TypedDataDict.__init__' is being compiled since it was called from '__torch__.TypedDataDict'
  File "test_yf225.py", line 17
  def forward(self, input):
    return TypedDataDict().set_str_to_dict({"123": TypedDataDict()})
           ~~~~~~~~~~~~~ <--- HERE
'__torch__.TypedDataDict' is being compiled since it was called from 'TestModule.forward'
  File "test_yf225.py", line 17
  def forward(self, input):
    return TypedDataDict().set_str_to_dict({"123": TypedDataDict()})
           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <--- HERE
'''

Motivation

Many DPER3 modules take input of nested data types like Dict[str, Dict[str, Dict[str, torch.Tensor]]], and it will be really hard to maintain if we had to add support to all variations of those nested data types to TypedDataDict (which is DPER3's typed dictionary object class). But if recursive data type is supported, then we only need Dict[str, TypedDataDict] and Dict[str, torch.Tensor] to cover all nesting possibilities.

Having recursive data type support will greatly speed up the work for moving all PyPer models to 100% TorchScript, demonstrating TorchScript's production readiness for large-scale ranking models.

cc. @wanchaol @suo

cc @ezyang @gchanan @zou3519 @suo @gmagogsfm

The text was updated successfully, but these errors were encountered:

gmagogsfm · 2020-08-11T22:13:19Z

Discussed with @suo offline, we will need to sync with yf225 more to figure out a solution.

0xJchen · 2021-12-29T13:43:59Z

Discussed with @suo offline, we will need to sync with yf225 more to figure out a solution.

Hi, guys. I wonder if there are any workouts to avoid this problem?

I found recursive data types are common in many algorithms (as mentioned in PEP 484). Take tree search as an example, we have a node class, and it also has children (let's say, a mapping from index to new nodes) self.children: Dict[int, Node]={}. I met the above problem when trying to convert a tree-search algorithm to Torchscript, and I wonder if there are any ways to avoid it?

This is an example:


@torch.jit.script
class Node(object):

    def __init__(self, prior: float):
        self.children: Dict[int, 'Node'] = { }

    def expanded(self) -> bool:
        return len(self.children) > 0

    def expand(self, priors):
        for id, prior in enumerate(priors):
            self.children[id] = Node(prior )

Also, I wonder if I wrote a Cython script for managing the above tree search logic (I can directly call those Cython functions after compilation in a normal python script), can I still re-use them in Torchscript?

cc @yf225 @suo @gmagogsfm

yf225 added the oncall: jit Add this issue/PR to JIT oncall triage queue label Aug 3, 2020

github-actions bot added this to Need triage in JIT Triage Aug 3, 2020

SplitInfinity added the high priority label Aug 3, 2020

pytorch-probot bot added the triage review label Aug 3, 2020

SplitInfinity added weeks and removed triage review labels Aug 3, 2020

SplitInfinity moved this from Need triage to HIGH PRIORITY in JIT Triage Aug 3, 2020

SplitInfinity added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 3, 2020

SplitInfinity assigned wanchaol and gmagogsfm Aug 3, 2020

gmagogsfm unassigned wanchaol Aug 4, 2020

gmagogsfm assigned suo and unassigned gmagogsfm Aug 11, 2020

suo removed this from HIGH PRIORITY in JIT Triage Aug 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support recursive data type in TorchScript #42487

Support recursive data type in TorchScript #42487

yf225 commented Aug 3, 2020 •

edited

gmagogsfm commented Aug 11, 2020

0xJchen commented Dec 29, 2021 •

edited

Support recursive data type in TorchScript #42487

Support recursive data type in TorchScript #42487

Comments

yf225 commented Aug 3, 2020 • edited

🚀 Feature

Motivation

gmagogsfm commented Aug 11, 2020

0xJchen commented Dec 29, 2021 • edited

yf225 commented Aug 3, 2020 •

edited

0xJchen commented Dec 29, 2021 •

edited