[tensor] redistribute among different process groups #1247

feifeibear · 2022-07-11T08:17:54Z

No description provided.

…name_apis

1SAA · 2022-07-11T08:37:00Z

colossalai/tensor/colo_tensor.py

+        if pg is not None and pg != self.get_process_group():
+            print('here _redistribute')
+            # if the pg is not equal, convert the current tensor to replicated
+            self._redistribute(ReplicaSpec())


I think the updated operand, redistribute, always used for non-model data in training.
It natrual that we need to keep redistribute as an autograd operand.
But here, _redistribute is not capable being used as an autograd operand.

I see. I will discuss with you offline.
I hope the ColoTensor can be used independently, we don't need to assume it's only used for training.

feifeibear added 8 commits July 6, 2022 17:34

make it faster

86be744

Merge branch 'main' of github.com:hpcaitech/ColossalAI into main

4b2333e

Merge branch 'main' of github.com:hpcaitech/ColossalAI into main

5078a72

Merge branch 'main' of github.com:hpcaitech/ColossalAI into main

73069e2

[tensor] rename convert_to_dist -> redistribute

2bf1606

[tensor] ShardSpec and ReplicaSpec

5758074

[tensor] redistribute among diff pgs

30e708a

Merge branch 'main' of github.com:hpcaitech/ColossalAI into tensor/re…

aab4097

…name_apis

1SAA reviewed Jul 11, 2022

View reviewed changes

polish code

be95183

YuliangLiu0306 approved these changes Jul 12, 2022

View reviewed changes

YuliangLiu0306 merged commit 1aad903 into hpcaitech:main Jul 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tensor] redistribute among different process groups #1247

[tensor] redistribute among different process groups #1247

feifeibear commented Jul 11, 2022

1SAA Jul 11, 2022 •

edited

Loading

feifeibear Jul 11, 2022

[tensor] redistribute among different process groups #1247

[tensor] redistribute among different process groups #1247

Conversation

feifeibear commented Jul 11, 2022

1SAA Jul 11, 2022 • edited Loading

Choose a reason for hiding this comment

feifeibear Jul 11, 2022

Choose a reason for hiding this comment

1SAA Jul 11, 2022 •

edited

Loading