Fix the way imports are done to be more correct for static type checkers #47027

gramster · 2020-10-28T22:43:42Z

🐛 Bug

There are a number of imports in torch of submodules that are implicitly reexported. E.g. you can do (and it is commonly done in examples):

import torch

torch.nn.<something>

In __init__.py, the submodules are imported as (using the nn example):

import torch.nn

But to be properly re-exported for static type checkers, this is not correct. Instead, something like:

import torch.nn as nn

is needed.

cc @ezyang @malfet @rgommers @xuzhao9 @gramster

The text was updated successfully, but these errors were encountered:

jakebailey · 2020-10-29T21:31:32Z

For context on this, there's a typing-sig thread here about settling on some common behaviors in py.typed packages: https://mail.python.org/archives/list/typing-sig@python.org/thread/YLJPWECBNPD2K4TRIBRIPISNUZJCRREY/

Which we distilled into a doc here: https://github.com/microsoft/pyright/blob/master/docs/typed-libraries.md

It might be the case that the form should be from torch import nn as nn to fit the PEP 484 pattern.

ezyang · 2020-10-30T15:25:28Z

Thanks for the report, we should definitely fix these.

gramster · 2020-11-17T19:35:32Z

We'd like this fix to get in for Pylance to work better with Pytorch in Visual Studio Code; should we do a PR?

rgommers · 2020-11-17T19:44:43Z

Thanks for asking @gramster, a PR would be great.

nurpax · 2020-12-18T11:11:20Z

@gramster, did you by any chance make a PR for this in pytorch?

@jakebailey Is the conclusion that the correct way to import would be from torch import nn as nn or is import torch.nn as nn ok too with pyright? It feels somewhat of a difficult guideline to follow given that Python developers will most likely encounter this only if they're using a specific static type checking tool that's strict about public imports. For example, it seems like mypy is more relaxed with these imports -- so it's easy to introduce this problem if you don't test with pyright. The harm in being super strict about imports means that a single wrong import line in a library means it's hard/impossible to use pylance/pyright with the problematic library. (EDIT: reading through https://mail.python.org/archives/list/typing-sig@python.org/thread/YLJPWECBNPD2K4TRIBRIPISNUZJCRREY/ it looks like import torch.nn as nn is the way to go.)

If no one's done a PR for this yet and there's a clear obviously standard way to do the imports, I could take a crack at it.

gramster · 2020-12-18T16:03:05Z

I have not done a PR yet. Maybe the simplest solution is to use __all__ but I’d be interested in Jake’s thoughts.

…

On Fri, Dec 18, 2020 at 3:11 AM Janne Hellsten ***@***.***> wrote: Hi @gramster <https://github.com/gramster>, did you by any chance make a PR for this in pytorch? @jakebailey <https://github.com/jakebailey> Is the conclusion that the correct way to import would be from torch import nn as nn or is import torch.nn as nn ok too with pyright? It feels somewhat of a difficult guideline to follow given that Python developers will most likely encounter this only if they're using a specific static type checking tool that's strict about public imports. For example, it seems like mypy is more relaxed with these imports -- so it's easy to introduce this problem if you don't test with pyright. The harm in being super strict about imports means that a single wrong import line in a library means it's hard/impossible to use pylance/pyright with the problematic library. If no one's done a PR for this yet and there's a single, obviously standard way to do the imports, I could take a crack at it. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#47027 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAVCPCEMTB6SE2UAHPNBBJDSVM2GRANCNFSM4TC6T2FQ> .

jakebailey · 2020-12-18T17:19:09Z

Doing a quick pass over the pyright code for this, I'd think the form would have to be from torch import nn as nn. I don't think the clarified PEP says anything about this case, and I don't recall it being mentioned in that thread. Maybe @erictraut can give an answer more quickly than me.

I know mypy has implemented this new strictness for stubs for a while, but I don't know if they've started to enforce this for py.typed modules yet. I also don't know if they attempt to emulate this side effect behavior or not. (Things start to get messy if you imagine importing a nested module like torch.foo.bar.baz as baz and asking both what you export and what you expect each module to contain.)

I wouldn't use __all__ here. You'd need to list everything out (and I mean everything), and it will have a runtime effect (unless you do want to enforce what is copied during a * import). There's a specific subset of operations that are supported to help build the list, but I don't think it scales too well.

(I'm on break for the next two weeks; I'm not sure if I'll be able to take a look or submit something for it at the moment.)

erictraut · 2020-12-18T17:34:04Z

The rules pyright uses for re-exported imports in a py.typed library can be found here. In summary: you need to use either the redundant form of import (from torch import nn as nn) or use __all__.

MHDante · 2021-01-17T17:05:05Z

Is it enough to update the imports on torch's __init__.py? would we have to recursively apply the same treatment to the __init__.py files in submodules?

If not, I've submitted #50665 for review.

erictraut · 2021-01-17T17:31:03Z

Thanks for the change @MHDante.

The answer is that it depends on which imported symbols you intend to re-export from which submodules. That's for the library authors & maintainers to decide. This mechanism gives you the ability to express that intent to the tooling.

This same change would apply to any imported symbol that you intend to re-export from any submodule. For example, if submodule "torch.a.b.c" imports symbol "foo", from another module and you want to allow consumers to import foo directly from torch.a.b.c (i.e. from torch.a.b.c import foo), then you'd need to apply a similar change to torch.a.b.c.__index__.py. If your intent is not to re-export "foo" in this case, then no change is needed because the tooling assumes no re-export by default.

Hope that makes sense.

MHDante · 2021-01-17T17:51:25Z

Right. I assume it would require a thorough revision by the torch team regarding the intentionality of exporting specific modules.

In the short-term, I will update PR #50665 to expose all submodules (and intermediate submodules) that are documented in API section of the documentation here: https://pytorch.org/docs/stable/index.html , As it is clear that the authors want those to be accessible by consumers. Hopefully with that change, at least the api as is listed will work with pylance/pyright in torch v1.7.(2/3)

MHDante · 2021-01-18T04:02:12Z

Hmm. In the process of writing these imports, I came across a few oddities.

I assume calling from foo import bar as bar causes bar.__init__.py to be evaluated.
Some init.py files have significant overhead/side-effects to them during runtime.

As a result, I would hesitate to modify the code to import additional submodules in order to avoid additional overhead or regressions. This can lead to some inconsistencies in the API:

for example, currently these two statements work (in python, not pyright) :

torch.backends.cuda.is_built()
torch.backends.cudnn.is_built()

however, only one of the two is imported.

My key assumption here is that torch.backends.cudnn.__init__.py is evaluated at the time that this
line is run: torch.backends.cudnn.is_built(). Whereas torch.backends.cuda.__init__.py was evaluated when import torch was run.

if my key assumption is correct, I would have 2 questions.

For @erictraut (or anyone more familiar with the import system): Is my key assumption correct? Or am I misunderstanding the way that __init_.py works?
For @erictraut : Is there a way in pylance/pyright/mypy to expose submodules without importing them?
For the torch team: Would you be ok importing the rest of the submodules (listed in the docs as accessible from torch) during torch/__init__.py ?

So far, I've updated #50665 to the best of my ability without changing any of the loading orders. If there is no other way, I would like to move forward with those changes.

jakebailey · 2021-01-18T04:08:46Z

Imports are never done at an expression like that (the call succeeding in your example implies the module has already been executed). If the module is there at runtime but doesn't appear to be imported in the code, then it's getting imported as a side effect of another import eventually importing that module and it getting added to backends, i.e. some other file writing import torch.backends.cudann and then every other module who has access to backends now has that symbol in their own backends because they're all the same object.

You'd have to decide what you really want the API to be (since maybe one day, code gets rewriten and someone who was relying on this gets a nice surprise).

jakebailey · 2021-01-18T04:11:35Z

FWIW back a good year or so ago, I described this in microsoft/pyright#439 (as the old MPLS sort of handled this sort of thing to handle torch, which was the biggest reliant on this behavior; this was a behavior difference between the analyses).

jakebailey · 2021-01-18T04:21:52Z

Is there a way in pylance/pyright/mypy to expose submodules without importing them?

I'm a bit confused by this question; if the module isn't actually imported or accessible, then I wouldn't expect a type checker or editor to accept or suggest it to me, no? Otherwise the code may crash at runtime because the editor or type checker told me I could use it but I couldn't.

erictraut · 2021-01-18T04:40:01Z

As Jake said, torch.backends.cudnn.is_built() will not implicitly execute cudnn/__init__.py. Since this doesn't crash at runtime, it must be relying on the side effect of some other import statement — a dangerous and fragile assumption.

Digging into this further, I think I understand the side effect that makes this work. The main module is importing torch.jit which imports torch.jit._async which imports torch.jit._builtins which imports torch.backends.cudnn. So the code is relying on this very tenuous and fragile assumption. Small refactoring changes could easily break this assumption and result in runtime errors. I'd consider this a bad bug if this were in my code base even though it happens to work at the moment.

The good news is that this is easy to fix, and it won't impact runtime performance. You can simply add an explicit import of torch.backends.cudnn in the module that depends on it. Once imported, the results will be cached, and any subsequent attempt to import it will effectively be "free".

ezyang · 2021-01-19T21:27:43Z

Yeah, all of this is a really good reason why we should fix all of this.

My general thinking is that when doing this work, we should work to avoid breaking other people's code. So if there is code that is invisibly causing more modules to come into scope, we should simply pave the cowpaths, rather than try to fix it at the same time. We can always, alter, try to reduce the number of imports, but in general that has to be done more deliberately. Of course, fine to add comments to this effect.

jakebailey · 2021-02-16T18:50:18Z

FWIW I have been working on this in an unpushed branch on my fork (based on some of #50665); there are some serious evaluation order issues at play here. Each time I fix one thing, another breaks, as things are very circular and are depending on the current ordering.

I'm considering sending a PR that at least fixes the top-level API of the torch module, e.g. from . import nn as nn and so on, without any semantic changes, and potentially other "safe" ones. Would such a PR be accepted as an incremental improvement? It would at least help out editors to know that nn is intended to be part of the API.

rgommers · 2021-02-16T19:10:59Z

That sounds fine @jakebailey, probably even preferred to break it up rather than a single PR that touched almost every module.

jakebailey · 2021-02-16T19:14:14Z

Sure, I can send a low-risk small one to get the easy ones out of the way.

Out of curiosity, is it too far gone to attempt to some of this into your next release? I don't quite know what satisfies the requirement for a cherry-pick, and unfortunately I didn't have time to work on this before what appears to be the 1.8.0 branching 😞

rgommers · 2021-02-16T19:33:00Z

Out of curiosity, is it too far gone to attempt to some of this into your next release? I don't quite know what satisfies the requirement for a cherry-pick, and unfortunately I didn't have time to work on this before what appears to be the 1.8.0 branching

That's for the release team to decide, but I think the chance is quite low. Only critical changes are backported normally I believe.

gchanan · 2021-02-17T22:26:02Z

One thing I'm trying to understand is when you are talking about "type checkers" if you are talking about python-level guidance or pyright-level guidance.

E.g. the import rules https://github.com/microsoft/pyright/blob/master/docs/typed-libraries.md#library-interface list PEP-561 but that doesn't make any claims about how to treat "redundant" import statements like "from A import X as X".

Am I correct that this guidance is pyright specific?

jakebailey · 2021-02-17T22:29:12Z

It's not pyright specific, no, though some type checkers may be more lenient about things (I don't know to what extent mypy is run here on internal modules, or if it even understands some of the nuances of these side effects and what is "intended" to be visible).

The "redundant form" is something that's listed in PEP 484's section on stubbing, and the discussion on the typing-sig (which led to that doc) had general agreement that py.typed libraries with inlined types should behave in this way as well.

#47027 (comment) has a few links to the other sources, I just re-linked the clearer guidance doc here for reference.

erictraut · 2021-02-17T22:31:47Z

We haven't formalized these rules in the form of a PEP, but we have discussed them with the maintainers of other Python type checkers (mypy, pyre, pytype) in the typing-sig, as Jake mentioned. We've incorporated their feedback and have general agreement on this guidance.

The treatment of redundant imports is consistent with PEP 484. Admittedly, PEP 484 was talking specifically about type stubs, not ".py" files that contain inlined types, but this is a natural extension of PEP 484 guidance. Since it's solving the same problem (namely, differentiating re-exports that are intended versus those that are not), it makes sense to use the same consistent mechanism.

erictraut · 2021-02-19T20:52:18Z

The problem is that there's no good way for a type checker to intuit the difference between an imported symbol that is meant to be re-exported and an imported symbol that is not. Only the author of the module knows what was intended, and there needs to be some agreed-upon way to distinguish between these two cases. Such a mechanism was introduced in PEP 484 for stubs. Since it's the same problem faced by "py.typed" (PEP 571) packages that include inlined types, it makes sense to adopt the same approach rather than inventing a new one.

I'm sympathetic to the argument that it makes the code more a little more verbose, but I'd argue that humans are probably not typically reading these long lists of import statements anyway.

jakebailey · 2021-02-19T21:15:52Z

FWIW before all of the stubs were deleted and merged in as inline types, this pattern was done in them to document what is re-exported:

pytorch/torch/__init__.pyi.in

Lines 20 to 41 in 6f396e1

    
           from ._tensor_str import set_printoptions as set_printoptions 
        
           from .functional import * 
        
           from .serialization import save as save, load as load 
        
           from .autograd import no_grad as no_grad, enable_grad as enable_grad, \ 
        
               set_grad_enabled as set_grad_enabled 
        
           from ._ops import ops 
        
           from ._classes import classes 
        
           from . import autograd as autograd 
        
           from . import cuda as cuda 
        
           from . import optim as optim 
        
           from . import nn as nn 
        
           from . import multiprocessing as multiprocessing 
        
           from . import sparse as sparse 
        
           from . import onnx as onnx 
        
           from . import jit as jit 
        
           from . import hub as hub 
        
           from . import random as random 
        
           from . import distributions as distributions 
        
           from . import testing as testing 
        
           from . import quantization as quantization 
        
           from . import __config__ as __config__ 
        
           from . import __future__ as __future__

From my experience, I've found re-exports like this to be pretty rare across projects. I think most libraries would have expected users to have to write import torch.nn themselves if they really wanted to use it (and we had to add hacks into MPLS specifically because torch users were expecting the opposite). I think that instances of this are going to be rare within pytorch, and __init__.py is just a hotspot.

t-vi · 2021-02-19T21:24:54Z

I'm sympathetic to the argument that it makes the code more a little more verbose, but I'd argue that humans are probably not typically reading these long lists of import statements anyway.

Heya, thank you for your answer. Clearly, the Zen of Python isn't what it used to be when we put secret handshakes into our code for the benefit of machines that read it. And I would argue that this is a stark contrast between pyi that never were aimed at people much and so the hacks for the sake of the machines had much less impact than in the py files. IMHO a # type: export would signal much clearer what is going on, even if it probably would not be considered a work of art on its own.

Summary: For #47027. Some progress has been made in #50665, but in my testing trying to unwrap the circular dependencies is turning into a neverending quest. This PR explicitly exports things in the top-level torch module without any semantic effect, in accordance with this py.typed library guidance: https://github.com/microsoft/pyright/blob/master/docs/typed-libraries.md#library-interface It may be possible to do some of the other fixes just using `__all__` where needed, but `__all__` has a semantic effect I would like to further review. This PR at least fixes simple completions like `torch.nn` in Pylance/pyright. Pull Request resolved: #52339 Reviewed By: smessmer Differential Revision: D26694909 Pulled By: malfet fbshipit-source-id: 99f2c6d0bf972afd4036df988e3acae857dde3e1

…#52339) Summary: For pytorch#47027. Some progress has been made in pytorch#50665, but in my testing trying to unwrap the circular dependencies is turning into a neverending quest. This PR explicitly exports things in the top-level torch module without any semantic effect, in accordance with this py.typed library guidance: https://github.com/microsoft/pyright/blob/master/docs/typed-libraries.md#library-interface It may be possible to do some of the other fixes just using `__all__` where needed, but `__all__` has a semantic effect I would like to further review. This PR at least fixes simple completions like `torch.nn` in Pylance/pyright. Pull Request resolved: pytorch#52339 Reviewed By: smessmer Differential Revision: D26694909 Pulled By: malfet fbshipit-source-id: 99f2c6d0bf972afd4036df988e3acae857dde3e1

…53675) Summary: For #47027. Some progress has been made in #50665, but in my testing trying to unwrap the circular dependencies is turning into a neverending quest. This PR explicitly exports things in the top-level torch module without any semantic effect, in accordance with this py.typed library guidance: https://github.com/microsoft/pyright/blob/master/docs/typed-libraries.md#library-interface It may be possible to do some of the other fixes just using `__all__` where needed, but `__all__` has a semantic effect I would like to further review. This PR at least fixes simple completions like `torch.nn` in Pylance/pyright. Pull Request resolved: #52339 Reviewed By: smessmer Differential Revision: D26694909 Pulled By: malfet fbshipit-source-id: 99f2c6d0bf972afd4036df988e3acae857dde3e1 Co-authored-by: Jake Bailey <5341706+jakebailey@users.noreply.github.com>

…#52339) Summary: For pytorch#47027. Some progress has been made in pytorch#50665, but in my testing trying to unwrap the circular dependencies is turning into a neverending quest. This PR explicitly exports things in the top-level torch module without any semantic effect, in accordance with this py.typed library guidance: https://github.com/microsoft/pyright/blob/master/docs/typed-libraries.md#library-interface It may be possible to do some of the other fixes just using `__all__` where needed, but `__all__` has a semantic effect I would like to further review. This PR at least fixes simple completions like `torch.nn` in Pylance/pyright. Pull Request resolved: pytorch#52339 Reviewed By: smessmer Differential Revision: D26694909 Pulled By: malfet fbshipit-source-id: 99f2c6d0bf972afd4036df988e3acae857dde3e1

pmeier · 2021-07-01T09:54:58Z

@gramster Is this still relevant for torch>=1.8.1 (especially #53675) and the current pyright==1.1.154? I don't see any failures for

import torch

model = torch.nn.Conv2d(3, 3, 1)

Do you have another example where pyright failed?

jakebailey · 2021-07-01T15:41:38Z

nn works because I sent a PR to do the easy ones, but there are a load of modules that I couldn't do because they were really difficult to fix. If you check the __init__.py you'll see the easy ones changed but some of the more complicated ones left as-is.

pmeier · 2021-07-01T15:48:32Z

@jakebailey Could you give me a simple example to get this started?

jakebailey · 2021-07-01T15:57:23Z

pytorch/torch/__init__.py

Line 714 in ccfdb30

import torch.backends.cuda

Is an example; the init imports this to effectively make backends "export" cuda so you can do torch.backends.cuda. This should really be in backends as "from . import cuda as cuda".

This same thing applied to all of the other stuff in this area is the goal.

albanD added enhancement Not as big of a feature, but technically not a bug. Should be easy to fix module: typing Related to mypy type annotations triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Oct 29, 2020

jakebailey mentioned this issue Nov 2, 2020

Autocomplete issues in PyTorch (torch.nn, others) microsoft/pylance-release#484

Closed

MHDante mentioned this issue Jan 17, 2021

Expose submodules in __init__.py for type checking #50665

Closed

jakebailey mentioned this issue Feb 17, 2021

Explicitly export submodules and variables from torch module #52339

Closed

jakebailey mentioned this issue Feb 26, 2021

I am getting "is not a known member of module" errors when using pytorch microsoft/pyright#1542

Closed

malfet mentioned this issue Mar 10, 2021

[1.8.1] Explicitly export submodules and variables from torch module #53675

Merged

pmeier self-assigned this Jul 1, 2021

jakebailey mentioned this issue Aug 27, 2021

Doesn' find the module torch.backends microsoft/pyright#2232

Closed

twoertwein mentioned this issue Sep 19, 2021

TYP: Redundant imports to define the public API pandas-dev/pandas#43664

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the way imports are done to be more correct for static type checkers #47027

Fix the way imports are done to be more correct for static type checkers #47027

gramster commented Oct 28, 2020 •

edited by pytorch-probot bot

jakebailey commented Oct 29, 2020 •

edited

ezyang commented Oct 30, 2020

gramster commented Nov 17, 2020

rgommers commented Nov 17, 2020

nurpax commented Dec 18, 2020 •

edited

gramster commented Dec 18, 2020 via email

jakebailey commented Dec 18, 2020

erictraut commented Dec 18, 2020

MHDante commented Jan 17, 2021 •

edited

erictraut commented Jan 17, 2021 •

edited

MHDante commented Jan 17, 2021 •

edited

MHDante commented Jan 18, 2021

jakebailey commented Jan 18, 2021 •

edited

jakebailey commented Jan 18, 2021 •

edited

jakebailey commented Jan 18, 2021

erictraut commented Jan 18, 2021

ezyang commented Jan 19, 2021

jakebailey commented Feb 16, 2021

rgommers commented Feb 16, 2021

jakebailey commented Feb 16, 2021

rgommers commented Feb 16, 2021

gchanan commented Feb 17, 2021

jakebailey commented Feb 17, 2021

erictraut commented Feb 17, 2021 •

edited

erictraut commented Feb 19, 2021

jakebailey commented Feb 19, 2021 •

edited

t-vi commented Feb 19, 2021

pmeier commented Jul 1, 2021

jakebailey commented Jul 1, 2021

pmeier commented Jul 1, 2021

jakebailey commented Jul 1, 2021

Fix the way imports are done to be more correct for static type checkers #47027

Fix the way imports are done to be more correct for static type checkers #47027

Comments

gramster commented Oct 28, 2020 • edited by pytorch-probot bot

🐛 Bug

jakebailey commented Oct 29, 2020 • edited

ezyang commented Oct 30, 2020

gramster commented Nov 17, 2020

rgommers commented Nov 17, 2020

nurpax commented Dec 18, 2020 • edited

gramster commented Dec 18, 2020 via email

jakebailey commented Dec 18, 2020

erictraut commented Dec 18, 2020

MHDante commented Jan 17, 2021 • edited

erictraut commented Jan 17, 2021 • edited

MHDante commented Jan 17, 2021 • edited

MHDante commented Jan 18, 2021

jakebailey commented Jan 18, 2021 • edited

jakebailey commented Jan 18, 2021 • edited

jakebailey commented Jan 18, 2021

erictraut commented Jan 18, 2021

ezyang commented Jan 19, 2021

jakebailey commented Feb 16, 2021

rgommers commented Feb 16, 2021

jakebailey commented Feb 16, 2021

rgommers commented Feb 16, 2021

gchanan commented Feb 17, 2021

jakebailey commented Feb 17, 2021

erictraut commented Feb 17, 2021 • edited

erictraut commented Feb 19, 2021

jakebailey commented Feb 19, 2021 • edited

t-vi commented Feb 19, 2021

pmeier commented Jul 1, 2021

jakebailey commented Jul 1, 2021

pmeier commented Jul 1, 2021

jakebailey commented Jul 1, 2021

gramster commented Oct 28, 2020 •

edited by pytorch-probot bot

jakebailey commented Oct 29, 2020 •

edited

nurpax commented Dec 18, 2020 •

edited

MHDante commented Jan 17, 2021 •

edited

erictraut commented Jan 17, 2021 •

edited

MHDante commented Jan 17, 2021 •

edited

jakebailey commented Jan 18, 2021 •

edited

jakebailey commented Jan 18, 2021 •

edited

erictraut commented Feb 17, 2021 •

edited

jakebailey commented Feb 19, 2021 •

edited