[Bug] (suggested temporary fix) Pytorch >= 2 causes mmrazor.engine to fail #632

elisa-aleman · 2024-03-26T08:58:41Z

Describe the bug

When using tools/train.py, I get the following error:

Traceback (most recent call last):
  File "/root/workspace/mmrazor/tools/train.py", line 121 in <module>
    main()
  File "/root/workspace/mmrazor/tools/train.py", line 55 in main
    register_all_modules
  File "/root/.cache/.../site-packages/mmrazor/utils/setup_env.py", line 65 in register_all_modules
    import mmrazor.engine #noqa: F401,F403
    ^^^^^^^^^^^^^^^^^^^^^
  File "/root/.cache/.../site-packages/mmrazor/engine/__init__.py", line 2 in <module>
    from .hooks import(DMCPSubnetHook, DumpSubnetHook, EstimateResourcesHook,
  File "/root/.cache/.../site-packages/mmrazor/engine/hooks/__init__.py", line 2 in <module>
    from .dmcp_subnet_hook import DMCPSubnetHook
  File "/root/.cache/.../site-packages/mmrazor/engine/hooks/dmcp_subnet_hook.py", line 8 in <module>
    from mmrazor.structures import export_fix_subnet
  File "/root/.cache/.../site-packages/mmrazor/structures/__init__.py", line 2 in <module>
    from .quantization import * #noqa: F401,F403
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/root/.cache/.../site-packages/mmrazor/structures/quantization/__init__.py", line 2 in <module>
    from .backend_config import * #noqa: F401,F403
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/root/.cache/.../site-packages/mmrazor/structures/quantization/backend_config/__init__.py", line 2 in <module>
  from .academic import (get_academic_backend_config,
  File "/root/.cache/.../site-packages/mmrazor/structures/quantization/backend_config/academic.py", line 11 in <module>
    from .common_operator_config_utils import (_get_conv_configs,
  File "/root/.cache/.../site-packages/mmrazor/structures/quantization/backend_config/common_operator_config_utils.py", line 54 in <module>
    nn.Conv1d, nn.ConvTranspose1d, nn.BatchNorm1d, nnqr.Conv1d,
                                                   ^^^^^^^^^^^
  File "/root/.cache/.../site-packages/mmrazor/utils/placeholder.py", line 50, in __getattr__
    raise_import_error(string)
  File "/root/.cache/.../site-packages/mmrazor/utils/placeholder.py", line 43, in raise_import_error
    raise ImportError(
ImportError: `torch>=1.13` is not installed properly, plz check

However I am using torch 2.0.0:

>>> import torch
>>> torch.__version__
2.0.0

To Reproduce

The command you executed.

python tools/train.py \

configuration redacted.

Additional context

Checking the mmrazor/structures/quantization/backend_config/common_operator_config_utils.py myself led me to find this line:

from torch.ao.quantization.fuser_method_mappings import (
        fuse_conv_bn, fuse_conv_bn_relu, fuse_convtranspose_bn, fuse_linear_bn,
        reverse2, reverse3, reverse_sequential_wrapper2)

Executing that line in my local environment resulted in a module name error, so I checked further.

Looking at the pytorch repository under 2.0.0, torch.ao.quantization.fuser_method_mappings.reverse2 is now torch.ao.quantization.fuser_method_mappings._reverse2, and the same thing happens with reverse3 -> _reverse3. Furthermore, the reverse_sequential_wrapper2 is gone altogether.

Other namespaces that disappeared were:

torch.ao.quantization.backend_config.BackendPatternConfig._set_overwrite_output_fake_quantize
torch.ao.quantization.backend_config.BackendPatternConfig._set_overwrite_output_observer
torch.ao.quantization.backend_config.BackendPatternConfig._set_input_output_observed

Monkey patching all these with the removed methods and the changed namespaces and then calling import mmrazor.engine produces no errors anymore, but the solution needs to be >= torch2.0.0 compatible moving forward.

This bug might be related to #615

The text was updated successfully, but these errors were encountered:

elisa-aleman · 2024-04-02T05:23:58Z

Also related to #553

chenjie04 · 2024-05-16T01:52:00Z

If you look at the source code, this error does not require torch to be higher than 1.13, but less than or equal to 1.13, and degrading torch will fix the problem.

elisa-aleman · 2024-05-17T01:14:51Z

the source code

Then the requirements files need to be updated and follow PEP.

degrading torch will fix the problem.

Regardless, torch 1.13 is extremely outdated, please update the source code.

elisa-aleman · 2024-08-11T05:04:24Z

Note, the suggested fix above will not work with fusions because of the changes in the BackendPatternConfig from torch 1 to torch 2. Any model with potential fusions will have mishaps in torch 2 unless updating these BackendPatternConfig to match the new version.

elisa-aleman added the bug Something isn't working label Mar 26, 2024

elisa-aleman mentioned this issue Apr 5, 2024

[Bug] (suggested fix) mmpose.models.pose_estimators.topdown.TopdownPoseEstimator is unable to be symbolically traced because of untraceable add_pred_to_datasample() and loss() open-mmlab/mmpose#3012

Open

2 tasks

elisa-aleman changed the title ~~[Bug] Pytorch >= 2 causes mmrazor.engine to fail~~ [Bug] (suggested temporary fix) Pytorch >= 2 causes mmrazor.engine to fail Apr 10, 2024

elisa-aleman mentioned this issue Apr 10, 2024

[Bug] (suggested fix) mmrazor.models.algorithms.quantization.mm_architecture.MMArchitectureQuant.sync_qparams() fails if there are modules present in other modes but not in forward mode='tensor' #634

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] (suggested temporary fix) Pytorch >= 2 causes mmrazor.engine to fail #632

[Bug] (suggested temporary fix) Pytorch >= 2 causes mmrazor.engine to fail #632

elisa-aleman commented Mar 26, 2024 •

edited

Loading

elisa-aleman commented Apr 2, 2024

chenjie04 commented May 16, 2024

elisa-aleman commented May 17, 2024

elisa-aleman commented Aug 11, 2024

[Bug] (suggested temporary fix) Pytorch >= 2 causes mmrazor.engine to fail #632

[Bug] (suggested temporary fix) Pytorch >= 2 causes mmrazor.engine to fail #632

Comments

elisa-aleman commented Mar 26, 2024 • edited Loading

Describe the bug

To Reproduce

Additional context

elisa-aleman commented Apr 2, 2024

chenjie04 commented May 16, 2024

elisa-aleman commented May 17, 2024

elisa-aleman commented Aug 11, 2024

elisa-aleman commented Mar 26, 2024 •

edited

Loading