[BE][Easy] export explicitly imported public submodules #127703

XuehaiPan · 2024-06-02T12:36:30Z

Stack from ghstack (oldest at bottom):

Add top-level submodules torch.{storage,serialization,functional,amp,overrides,types}

cc @albanD @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-06-02T12:36:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127703

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (3 Unrelated Failures)

As of commit d7fd93d with merge base 75b0720 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / linux-focal-py3_8-clang9-xla / test (xla, 1, 1, linux.12xlarge) (gh) (similar failure)
test_all_reduce_no_op_with_one_replica
trunk / linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 3, 5, linux.g5.4xlarge.nvidia.gpu) (gh) (similar failure)
test_foreach.py::TestForeachCUDA::test_binary_op_list_error_cases__foreach_add_cuda_bfloat16

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (inductor_torchbench_cpu_smoketest_perf, 1, 1, linux.24xl.spr-metal, unstable) (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Add top-level submodules `torch.{storage,serialization,functional,amp,utils,overrides,types}` Resolves pytorch#126401 ghstack-source-id: 4be1063ca731cfbb01266b1ded2e94e5cf43ab9a Pull Request resolved: pytorch#127703

[ghstack-poisoned]

Add top-level submodules `torch.{storage,serialization,functional,amp,utils,overrides,types}` Resolves pytorch#126401 ghstack-source-id: 139555ce7eccae5e42562d5759cd4ae79fc00881 Pull Request resolved: pytorch#127703

torch/utils/data/sampler.py

Skylion007 · 2024-06-02T15:35:43Z

Some of these changes seem related to: #127688 and probably should wait on that.

Skylion007 · 2024-06-02T17:33:15Z

torch/optim/__init__.py

@@ -6,21 +6,21 @@
 future.
 """

-from . import lr_scheduler, swa_utils


Is ufmt forcing the removal of all these relative imports?

No. This is updated by hand.

Absolute imports, or relative imports from siblings, are recommended by PEP8.

Absolute imports are recommended, as they are usually more readable and tend to be better behaved (or at least give better error messages) if the import system is incorrectly configured (such as when a directory inside a package ends up on sys.path):

import mypkg.sibling from mypkg import sibling from mypkg.sibling import example

However, explicit relative imports are an acceptable alternative to absolute imports, especially when dealing with complex package layouts where using absolute imports would be unnecessarily verbose:

from . import sibling from .sibling import example

Standard library code should avoid complex package layouts and always use absolute imports.

We can enforce this by adding the ruff rule TID (flake8-tidy-imports) with:

[tool.ruff.lint.flake8-tidy-imports] ban-relative-imports = "all" # defaults to "parents"; see https://docs.astral.sh/ruff/settings/#lintflake8-tidy-imports

Can't this affect import ordering which can be bad for our Python submodules in torch that do have side effects (or cause cyclic import errors where non existed before).

Changing relative import to identical absolute import does not change the side effects of the import statement.

Doesn't it change the import order though?
Where the relative import would actually trigger the import of the file while the absolute import would expect the partially initialized module to already contain the corresponding entry?

This change is made based on:

The relative import will convert to absolute import in bytecode eval loop:

$ echo 'from torch.optim import adam' | python3 -m dis - 0 0 RESUME 0 1 2 LOAD_CONST 0 (0) 4 LOAD_CONST 1 (('adam',)) 6 IMPORT_NAME 0 (torch.optim) 8 IMPORT_FROM 1 (adam) 10 STORE_NAME 1 (adam) 12 POP_TOP 14 RETURN_CONST 2 (None) $ echo 'from . import adam' | python3 -m dis - 0 0 RESUME 0 1 2 LOAD_CONST 0 (1) 4 LOAD_CONST 1 (('adam',)) 6 IMPORT_NAME 0 8 IMPORT_FROM 1 (adam) 10 STORE_NAME 1 (adam) 12 POP_TOP 14 RETURN_CONST 2 (None)

https://github.com/python/cpython/blob/a9f2daf1ab182d95b44ee94dc9fb8faec60e34b1/Python/ceval.c#L2513-L2526

from . import adam is convert to __import__(f'{__name__}.adam') where __name__ = 'torch.optim'.

They are semantically identical.

See [BE][Easy] export explicitly imported public submodules #127703 (comment): Absolute imports, or relative imports from siblings, are recommended by PEP8.

How about absolute paths when combined with circular dependencies? Example:

torch/optim/init.py import torch.optim._multi_tensor and
torch/optim/_multi_tenoser/init.py import torch.optim

This pattern, combined with absolute import paths is causing unpredictable and hard to debug failures.

This pattern, combined with absolute import paths is causing unpredictable and hard to debug failures.

You can add a raise ImportError statement at the top of the file, when looking at the traceback of python3 -c 'import torch'. This can be used to find the first dependent of the submodule.

[ghstack-poisoned]

Add top-level submodules `torch.{storage,serialization,functional,amp,utils,overrides,types}` Resolves pytorch#126401 ghstack-source-id: 08adbe6e709d2963258855e1a9ca156b4cbd5f88 Pull Request resolved: pytorch#127703

---- - Sort import via `usort` - Change relative import `from . import xxx` to absolute import `from torch import xxx` Pull Request resolved: #127708 Approved by: https://github.com/ezyang ghstack dependencies: #127703

Pull Request resolved: #127709 Approved by: https://github.com/ezyang ghstack dependencies: #127703, #127708

Pull Request resolved: #127710 Approved by: https://github.com/ezyang ghstack dependencies: #127703, #127708, #127709

Add top-level submodules `torch.{storage,serialization,functional,amp,utils,overrides,types}` Resolves pytorch#126401 ghstack-source-id: 95329001fef0a504fd80cf02afc1ea379eecb05e Pull Request resolved: pytorch#127703

Add top-level submodules `torch.{storage,serialization,functional,amp,overrides,types}` Pull Request resolved: pytorch#127703 Approved by: https://github.com/ezyang

---- - Sort import via `usort` - Change relative import `from . import xxx` to absolute import `from torch import xxx` Pull Request resolved: pytorch#127708 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#127703

Pull Request resolved: pytorch#127709 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#127703, pytorch#127708

Pull Request resolved: pytorch#127710 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#127703, pytorch#127708, pytorch#127709

Add top-level submodules `torch.{storage,serialization,functional,amp,overrides,types}` Pull Request resolved: pytorch#127703 Approved by: https://github.com/ezyang

---- - Sort import via `usort` - Change relative import `from . import xxx` to absolute import `from torch import xxx` Pull Request resolved: pytorch#127708 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#127703

Pull Request resolved: pytorch#127709 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#127703, pytorch#127708

Pull Request resolved: pytorch#127710 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#127703, pytorch#127708, pytorch#127709

Summary: - PR pytorch#127703 introduced a circular dependency `torch/optim/__init__.py` imports `torch.optim._multi_tensor` and `torch.optim._multi_tensor/_init__.py` imports `torch.optim` This seemed to work fine (green signals everywhere) but caused some internal test failures after it landed; an infinite recursion during import. - PR pytorch#128875 attempted to fix this by removing the import from `torch/optim/__init__.py`. This seemed to work fine: green signals everywhere and the failing tests started passing but a smaller number of tests started failing; unable to import `torch.optim._multi_tensor` - This diff re-introduces the import but after `torch.optim` is fully initialized Test Plan: CI signals Differential Revision: D58792889

) Summary: Pull Request resolved: pytorch#129095 - PR pytorch#127703 introduced a circular dependency `torch/optim/__init__.py` imports `torch.optim._multi_tensor` and `torch.optim._multi_tensor/_init__.py` imports `torch.optim` This seemed to work fine (green signals everywhere) but caused some internal test failures after it landed; an infinite recursion during import. - PR pytorch#128875 attempted to fix this by removing the import from `torch/optim/__init__.py`. This seemed to work fine: green signals everywhere and the failing tests started passing but a smaller number of tests started failing; unable to import `torch.optim._multi_tensor` - This diff re-introduces the import but after `torch.optim` is fully initialized Test Plan: CI signals Differential Revision: D58792889

Summary: This avoids some internal test failures caused by "infinite" recursion during import. Example: P1430048846 A bit of history: - PR pytorch#127703 introduced a circular dependency `torch/optim/__init__.py` imports `torch.optim._multi_tensor` and `torch.optim._multi_tensor/_init__.py` imports `torch.optim` This seemed to work fine (green signals everywhere) but caused some internal test failures after it landed; an infinite recursion during import. - PR pytorch#128875 attempted to fix this by removing the import from `torch/optim/__init__.py`. This seemed to work fine: green signals everywhere and the failing tests started passing but a smaller number of tests started failing; unable to import `torch.optim._multi_tensor` - This diff re-introduces the import but avoids the infinite recursion by using relative package paths Differential Revision: D58815471

…ytorch#129132) Summary: Pull Request resolved: pytorch#129132 This avoids some internal test failures caused by "infinite" recursion during import. Example: P1430048846 A bit of history: - PR pytorch#127703 introduced a circular dependency `torch/optim/__init__.py` imports `torch.optim._multi_tensor` and `torch.optim._multi_tensor/_init__.py` imports `torch.optim` This seemed to work fine (green signals everywhere) but caused some internal test failures after it landed; an infinite recursion during import. - PR pytorch#128875 attempted to fix this by removing the import from `torch/optim/__init__.py`. This seemed to work fine: green signals everywhere and the failing tests started passing but a smaller number of tests started failing; unable to import `torch.optim._multi_tensor` - This diff re-introduces the import but avoids the infinite recursion by using relative package paths Test Plan: CI signals Previously failing tests are passing: * https://www.internalfb.com/intern/testinfra/testrun/4785074840480058 * https://www.internalfb.com/intern/testinfra/testrun/281475349103659 * https://www.internalfb.com/intern/testinfra/testrun/11540474083575936 Differential Revision: D58815471

malfet · 2024-06-20T20:41:24Z

@fbgheith reports that this broke some lazy import logic (still trying
to figure out the details), so I'll be attempting to revert this and few others that depend on it to restore original behavior)

XuehaiPan · 2024-06-21T07:08:59Z

@fbgheith reports that this broke some lazy import logic (still trying to figure out the details), so I'll be attempting to revert this and few others that depend on it to restore original behavior)

I will help with fixing that in PR #129095.

re-export torch.optim._multi_tensor in torch/__init__.py #129095

fbgheith · 2024-06-21T14:16:33Z

This has been a big source of headache where depending on the exact ordering if imports in the pytorch client code one of three things would happen:

(1) great majority: things work fine
(2a) the "import torch.optim._multi_tensor" in torch/optim/init.py causes infinite recursion in a small set of uses cases
(2b) removing the import above solves (2a) but makes "import torch.optim._multi_tensor" fail for another small set
(2c) replacing the backwards absolute import with relative ones make both sets happy

This is just empirical data for this pair of dependencies.

fbgheith · 2024-06-21T14:22:53Z

Please keep in mind that #129095 is just a bandaid to remedy the particular scenario described above without reverting #127703. We also need to come up with an explanation for why we ended up in this state and the proper guidelines for what can go in init.py files.

XuehaiPan · 2024-06-21T16:02:22Z

We also need to come up with an explanation for why we ended up in this state and the proper guidelines for what can go in __init__.py files.

We will be able to do this after we enable ufmt globally and optionally enable ruff's TID rules.

Update

25af27e

[ghstack-poisoned]

XuehaiPan requested review from eqy, andrewkho, gokulavasan, albanD, janeyx99, jbschlosser and mikaylagawarecki as code owners June 2, 2024 12:36

pytorch-bot bot added the release notes: optim label Jun 2, 2024

XuehaiPan added better-engineering Relatively self-contained tasks for better engineering contributors module: python frontend For issues relating to PyTorch's Python frontend labels Jun 2, 2024

pytorchbot added the open source label Jun 2, 2024

Update

6765f14

[ghstack-poisoned]

Skylion007 reviewed Jun 2, 2024

View reviewed changes

torch/utils/data/sampler.py Outdated Show resolved Hide resolved

Skylion007 reviewed Jun 2, 2024

View reviewed changes

XuehaiPan mentioned this pull request Jun 2, 2024

TYP: declare torch.utils in __all__ #126401

Closed

Update

0683742

[ghstack-poisoned]

pytorchmergebot closed this in dcc0093 Jun 12, 2024

pytorchmergebot added Merged and removed merging labels Jun 12, 2024

pytorchmergebot pushed a commit that referenced this pull request Jun 12, 2024

[BE][Easy] sort __all__ in torch/__init__.py (#127709)

26433b8

Pull Request resolved: #127709 Approved by: https://github.com/ezyang ghstack dependencies: #127703, #127708

pytorchmergebot pushed a commit that referenced this pull request Jun 12, 2024

[BE] enable UFMT for torch/__init__.py (#127710)

46a35a1

Pull Request resolved: #127710 Approved by: https://github.com/ezyang ghstack dependencies: #127703, #127708, #127709

fbgheith mentioned this pull request Jun 19, 2024

re-export torch.optim._multi_tensor in torch/__init__.py #129095

Open

fbgheith mentioned this pull request Jun 20, 2024

[BE] use relative backwards references in torch.optim._multi_tensor #129132

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BE][Easy] export explicitly imported public submodules #127703

[BE][Easy] export explicitly imported public submodules #127703

XuehaiPan commented Jun 2, 2024 •

edited

Loading

pytorch-bot bot commented Jun 2, 2024 •

edited

Loading

Skylion007 commented Jun 2, 2024

Skylion007 Jun 2, 2024

XuehaiPan Jun 2, 2024 •

edited

Loading

Skylion007 Jun 2, 2024

XuehaiPan Jun 2, 2024

albanD Jun 10, 2024

XuehaiPan Jun 10, 2024

fbgheith Jun 21, 2024

XuehaiPan Jun 21, 2024 •

edited

Loading

malfet commented Jun 20, 2024

XuehaiPan commented Jun 21, 2024

fbgheith commented Jun 21, 2024

fbgheith commented Jun 21, 2024 •

edited

Loading

XuehaiPan commented Jun 21, 2024

[BE][Easy] export explicitly imported public submodules #127703

[BE][Easy] export explicitly imported public submodules #127703

Conversation

XuehaiPan commented Jun 2, 2024 • edited Loading

pytorch-bot bot commented Jun 2, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127703

✅ You can merge normally! (3 Unrelated Failures)

Skylion007 commented Jun 2, 2024

Skylion007 Jun 2, 2024

Choose a reason for hiding this comment

XuehaiPan Jun 2, 2024 • edited Loading

Choose a reason for hiding this comment

Absolute imports, or relative imports from siblings, are recommended by PEP8.

Skylion007 Jun 2, 2024

Choose a reason for hiding this comment

XuehaiPan Jun 2, 2024

Choose a reason for hiding this comment

albanD Jun 10, 2024

Choose a reason for hiding this comment

XuehaiPan Jun 10, 2024

Choose a reason for hiding this comment

fbgheith Jun 21, 2024

Choose a reason for hiding this comment

XuehaiPan Jun 21, 2024 • edited Loading

Choose a reason for hiding this comment

malfet commented Jun 20, 2024

XuehaiPan commented Jun 21, 2024

fbgheith commented Jun 21, 2024

fbgheith commented Jun 21, 2024 • edited Loading

XuehaiPan commented Jun 21, 2024

XuehaiPan commented Jun 2, 2024 •

edited

Loading

pytorch-bot bot commented Jun 2, 2024 •

edited

Loading

XuehaiPan Jun 2, 2024 •

edited

Loading

XuehaiPan Jun 21, 2024 •

edited

Loading

fbgheith commented Jun 21, 2024 •

edited

Loading