[autoparallel] Add metainfo support for F.linear #1987

Cypher30 · 2022-11-18T10:12:04Z

What’s New?

In this PR, I done some work to support torch.nn.functional.linear in our metainfo generation, the memory estimation results are aligned with torch.nn.Linear (without bias). And though the biased linear is now separated by us into matmul and bias add, I still retain the part that generate metainfo for biased linear for future use.

Merge ColossalAI

Daily merge

Merge

Daily Merge

…r30/ColossalAI into feature/metainfo_for_auto_parallel

Cypher30 · 2022-11-18T10:12:43Z

colossalai/auto_parallel/meta_profiler/meta_registry/linear.py

@@ -65,7 +68,7 @@ def linear_meta_info(*args, **kwargs) -> Tuple[TrainCycleItem, TrainCycleItem, L
    has_bias: bool = False
    input_tensor = next(filter(lambda x: x.type == OperationDataType.ARG, args)).data
    output_tensor = next(filter(lambda x: x.type == OperationDataType.OUTPUT, args)).data
-    weight_tensor = next(filter(lambda x: x.name == 'weight', args)).data
+    weight_tensors = [x.data for x in args if x.type == OperationDataType.PARAM]


Modify this part for more robust code

Cypher30 · 2022-11-18T10:13:27Z

colossalai/auto_parallel/meta_profiler/metainfo.py

-        assert meta_register.has(self._target.__class__), f'{self._target.__class__} not found in the meta registry'
-        meta_func = meta_register.get(self._target.__class__)
+        try:
+            # module


Support the case that node.op == “call_function”

Cypher30 · 2022-11-18T10:14:08Z

tests/test_auto_parallel/test_tensor_shard/test_metainfo/utils.py

@@ -104,8 +106,12 @@ def mem_test_for_node_strategy(rank: int,
            )

            # estimated memory
-            metainfo = MetaInfo(target_node.strategies_vector[strategy_index],
-                                target_node.graph.owning_module.get_submodule(target_node.target))
+            if target_node.op == "call_module":


Modify this part to support node.op == “call_function”

Cypher30 and others added 30 commits July 14, 2022 16:07

Merge pull request #1 from hpcaitech/main

04e5272

Merge ColossalAI

Merge pull request #2 from hpcaitech/main

75618b3

Daily merge

Merge pull request #3 from hpcaitech/main

3e4620c

Merge

Merge remote-tracking branch 'upstream/main' into main

cf24049

Merge

Merge remote-tracking branch 'upstream/main' into main

3d223b6

Daily Merge

Merge branch 'hpcaitech:main' into main

644115c

Merge branch 'hpcaitech:main' into main

d995ade

Merge branch 'hpcaitech:main' into main

bba2dbe

Merge branch 'hpcaitech:main' into main

05ca628

Merge branch 'hpcaitech:main' into main

0a967da

Merge branch 'hpcaitech:main' into main

0637c0d

Merge branch 'hpcaitech:main' into main

74a6227

Merge branch 'hpcaitech:main' into main

e550490

Merge branch 'hpcaitech:main' into main

2d7f5d9

Merge branch 'hpcaitech:main' into main

b62e870

Merge branch 'hpcaitech:main' into main

b4b0974

Merge branch 'hpcaitech:main' into main

65c20de

Merge branch 'hpcaitech:main' into main

1660bfc

Merge branch 'hpcaitech:main' into main

6eb0ad0

Merge branch 'hpcaitech:main' into main

56df059

Merge branch 'hpcaitech:main' into main

480e932

Merge branch 'hpcaitech:main' into main

0fa66ee

Merge branch 'hpcaitech:main' into main

1d013b0

Merge branch 'hpcaitech:main' into main

5774db2

Merge branch 'hpcaitech:main' into main

e8ff699

Merge branch 'hpcaitech:main' into main

855c728

Merge branch 'main' of github.com:Cypher30/ColossalAI into main

2c113ea

Merge branch 'hpcaitech:main' into main

838ba70

Merge branch 'main' of github.com:Cypher30/ColossalAI into main

cacec2b

Merge branch 'hpcaitech:main' into main

5ed6ef0

Cypher30 and others added 21 commits November 4, 2022 20:21

[fx] restore profiler

db76c2f

[fx] fix conflict

da7ac6a

[fx] restore meta profiler

465af2b

[autoparallel] modify unit test

0111911

[fx] modify unit test

c4d52e2

Merge branch 'hpcaitech:main' into feature/metainfo_for_auto_parallel

ff542d1

[autoparallel] add batchnorm metainfo class

00bdcc9

Merge branch 'hpcaitech:main' into feature/metainfo_for_auto_parallel

79754ae

[autoparallel] fix batchnorm unit test function declaration

3356d3c

Merge branch 'feature/metainfo_for_auto_parallel' of github.com:Cyphe…

0eb1f99

…r30/ColossalAI into feature/metainfo_for_auto_parallel

[fx] restore profiler

78535d3

[fx] add relu metainfo class

784fc0e

Merge branch 'hpcaitech:main' into feature/metainfo_for_auto_parallel

f159516

[fx] restore profiler

cdd353a

Merge branch 'feature/metainfo_for_auto_parallel' of github.com:Cyphe…

200f61b

…r30/ColossalAI into feature/metainfo_for_auto_parallel

[autoparallel] modify metainfo input

c79a5ae

Merge branch 'hpcaitech:main' into feature/metainfo_for_auto_parallel

520ff79

[autoparallel] add pooling metainfo

f65ff08

[autoparallel] fix conflict

4f39456

[autoparallel] add F.linear metainfo generator

74c1458

Merge branch 'hpcaitech:main' into feature/metainfo_for_auto_parallel

2e33969

Cypher30 commented Nov 18, 2022

View reviewed changes

Cypher30 requested review from FrankLeeeee, super-dainiu and YuliangLiu0306 November 18, 2022 10:14

Cypher30 added the Run Build and Test label Nov 18, 2022

YuliangLiu0306 approved these changes Nov 23, 2022

View reviewed changes

YuliangLiu0306 merged commit 6cd784f into hpcaitech:main Nov 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[autoparallel] Add metainfo support for F.linear #1987

[autoparallel] Add metainfo support for F.linear #1987

Cypher30 commented Nov 18, 2022

Cypher30 Nov 18, 2022

Cypher30 Nov 18, 2022

Cypher30 Nov 18, 2022

[autoparallel] Add metainfo support for F.linear #1987

[autoparallel] Add metainfo support for F.linear #1987

Conversation

Cypher30 commented Nov 18, 2022

What’s New?

Cypher30 Nov 18, 2022

Choose a reason for hiding this comment

Cypher30 Nov 18, 2022

Choose a reason for hiding this comment

Cypher30 Nov 18, 2022

Choose a reason for hiding this comment