[fx] PoC of runtime shape consistency application #1607

YuliangLiu0306 · 2022-09-19T12:04:05Z

No description provided.

FrankLeeeee · 2022-09-20T02:36:36Z

colossalai/fx/passes/experimental/adding_shape_consistency_pass.py

+    return shape_consistency_manager.apply(*args, **kwargs)
+
+
+def solution_annotatation_pass(gm: torch.fx.GraphModule, solution: List[int], device_mesh):


why is solution a list of int?

The solution of solver is a list of int, the value of each element stands for the best strategy of the node.

FrankLeeeee · 2022-09-20T02:36:57Z

colossalai/fx/passes/experimental/adding_shape_consistency_pass.py

+
+
+def solution_annotatation_pass(gm: torch.fx.GraphModule, solution: List[int], device_mesh):
+    mod_graph = gm.graph


what is a mod_graph?

model graph

FrankLeeeee · 2022-09-20T02:40:09Z

colossalai/fx/passes/experimental/adding_shape_consistency_pass.py

+            origin_sharding_spec = ShardingSpec(device_mesh, target_module.weight.shape, {})
+            setattr(target_module.weight, 'sharding_spec', origin_sharding_spec)
+            target_weight_sharding_spec = node.best_strategy.input_shardings[1]
+            target_module.weight.data = target_module.weight.data.permute((1, 0, 2, 3))


Why permute?

If this is because that conv/linear weight is in the desired shape, I can accept it now but we should handle this in NodeHandler.

Sure, I just find this problem during test, I will fix it in future PR.

FrankLeeeee · 2022-09-20T02:42:33Z

colossalai/fx/passes/experimental/adding_shape_consistency_pass.py

+            with mod_graph.inserting_before(user_node):
+                shape_consistency_node = mod_graph.create_node('call_function', apply, args=(node, sharding_spec_node))
+
+    gm.recompile()


recompile should be only called when all passes finish.

FrankLeeeee · 2022-09-20T02:46:26Z

tests/test_auto_parallel/test_shape_consistency_pass.py

+    sharding_spec_dict, origin_spec_dict = solution_annotatation_pass(gm, solution, device_mesh)
+    shape_consistency_pass(gm)
+    nodes = [node for node in gm.graph.nodes]
+    output = gm(input, sharding_spec_dict, origin_spec_dict)


Such usage is kind of not intuitive, I would recommend to stick to gm(input) in the future but I can let it pass for now. We can annotate with a TODO tag.

[fx] PoC of runtime shape consistency application

c610f0e

FrankLeeeee added the Run Build and Test label Sep 19, 2022

FrankLeeeee reviewed Sep 20, 2022

View reviewed changes

polish code

fa33ffb

FrankLeeeee approved these changes Sep 20, 2022

View reviewed changes

FrankLeeeee merged commit 7d1bb71 into hpcaitech:main Sep 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fx] PoC of runtime shape consistency application #1607

[fx] PoC of runtime shape consistency application #1607

YuliangLiu0306 commented Sep 19, 2022

FrankLeeeee Sep 20, 2022

YuliangLiu0306 Sep 20, 2022

FrankLeeeee Sep 20, 2022

YuliangLiu0306 Sep 20, 2022

FrankLeeeee Sep 20, 2022

FrankLeeeee Sep 20, 2022

YuliangLiu0306 Sep 20, 2022

FrankLeeeee Sep 20, 2022

YuliangLiu0306 Sep 20, 2022

FrankLeeeee Sep 20, 2022

YuliangLiu0306 Sep 20, 2022

		return shape_consistency_manager.apply(args, *kwargs)


		def solution_annotatation_pass(gm: torch.fx.GraphModule, solution: List[int], device_mesh):



		def solution_annotatation_pass(gm: torch.fx.GraphModule, solution: List[int], device_mesh):
		mod_graph = gm.graph

[fx] PoC of runtime shape consistency application #1607

[fx] PoC of runtime shape consistency application #1607

Conversation

YuliangLiu0306 commented Sep 19, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment