Auto forward non method attribute lookups to the user's model and bind custom methods to ORTModule #8798

baijumeswani · 2021-08-20T19:46:51Z

This pull request:

For non methods attributes: auto forwards any call made to the user's original torch.nn.Module through ORTModule by implementing the methods:
- __getattr__: Implemented for attribute lookups that could not be found in ORTModule and so a lookup is performed on the user's torch.nn.Module.
- __setattr__: Implemented so that users can set attributes on their original module with ease. All attributes are set on the user provided module instead of on ORTModule. This is also used as a way to signal to ORTModule that the execution graph has changed and therefore a re-export must be done before the next forward call. This is done automatically by this implementation. The auto re-export can be controlled by enabling the skip check for re-building the graph, re creating the execution agent and so on (export ORTMODULE_SKIPCHECK_POLICY="SKIP_CHECK_BUILD_GRADIENT|SKIP_CHECK_EXECUTION_AGENT").
For methods attributes: copies user defined methods and binds them to the ORTModule instance. This is done in order to prevent the problem where user defined methods invoke the forward on the model thereby calling the PyTorch module implementation of forward as opposed to ORTModule's implementation of forward.

Users can now seamlessly use their training script with ORTModule without needing to change how they invoke user defined methods on their original torch.nn.Module. Here is an example:

# User defined torch.nn.Module 
class UserDefinedMethodsNet(torch.nn.Module): 
    def __init__(self): 
        super(UserDefinedMethodsNet, self).__init__() 

    def forward(self, ...): 
        ... 

    def custom_user_method(self, ...): 
        return some_calculation()  

    def training_step(self, ...):
        out = self(...)
        ...

# Instantiation of ORTModule 
model = UserDefinedMethodsNet() 
model = ORTModule(model) 

# Invoke user defined function 
out = model.custom_user_method() # No AttributeError since ORTModule auto forwards the call to the original module 
model.training_step(...) # ORTModule's forward will be executed and not the user defined forward

In addition, ORTModule checks for any attribute name collisions between the user's model and ORTModule.

SherlockNoMad · 2021-08-20T23:07:10Z

Another thing to consider is that what if there is private variable used in user defined functions?

def custom_functions(): self.state = update

IIUC, current implementation only cover stateless user defined functions?

baijumeswani · 2021-08-20T23:14:06Z

Another thing to consider is that what if there is private variable used in user defined functions?

def custom_functions(): self.state = update

IIUC, current implementation only cover stateless user defined functions?

Do you mean for auto detecting attribute change and marking the model for re-export? Then yes, we cannot always detect that the user has made changes to the model, especially if the changes are being made to the model from a path that cannot be controlled by ORTModule. One way to ensure that the model get's re-exported is by exposing a method (mark_execution_graph_as_stale) to the user that should be explicitly called whenever they made a change to the model.

orttraining/orttraining/python/training/ortmodule/_utils.py