consistent_tensor_to_api #5703

clackhan · 2021-08-03T08:34:59Z

ConsistentTensor 支持方法 .to("cpu")：
0号进程演示

>>> import oneflow as flow
>>> import numpy as np
>>> import os
>>> if int(os.environ.get("RANK")) == 0:
...     ndarr = np.asarray([[1, -3, 2], [-3, 7, -4]])
... else:
...     ndarr = np.asarray([[6, -5, 9], [-4, 8, -3]])
...
>>> a = flow.Tensor(ndarr)
>>> b = a.to("cuda")
>>> p = flow.placement("cuda", {0:range(2)})
>>> c = b.to_consistent(p, flow.sbp.split(0))
>>> print(c.to_local())
tensor([[ 1., -3.,  2.],
        [-3.,  7., -4.]], device='cuda:0', dtype=oneflow.float32)
>>> d = c.to("cpu")
>>> print(d.to_local())
tensor([[ 1., -3.,  2.],
        [-3.,  7., -4.]], dtype=oneflow.float32)
>>>

1号进程演示：

>>> import oneflow as flow
>>> import numpy as np
>>> import os
>>> if int(os.environ.get("RANK")) == 0:
...     ndarr = np.asarray([[1, -3, 2], [-3, 7, -4]])
... else:
...     ndarr = np.asarray([[6, -5, 9], [-4, 8, -3]])
...
>>> a = flow.Tensor(ndarr)
>>> b = a.to("cuda")
>>> p = flow.placement("cuda", {0:range(2)})
>>> c = b.to_consistent(p, flow.sbp.split(0))
>>> print(c.to_local())
tensor([[ 6., -5.,  9.],
        [-4.,  8., -3.]], device='cuda:1', dtype=oneflow.float32)
>>> d = c.to("cpu")
>>> print(d.to_local())
tensor([[ 6., -5.,  9.],
        [-4.,  8., -3.]], dtype=oneflow.float32)
>>>

github-actions · 2021-08-03T11:51:36Z

Speed stats:

GPU Name: GeForce GTX 1080 

PyTorch resnet50 time: 146.1ms (= 7305.8ms / 50, input_shape=[16, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 126.4ms (= 6319.4ms / 50, input_shape=[16, 3, 224, 224], backward is enabled)
Relative speed: 1.16 (= 146.1ms / 126.4ms)

PyTorch resnet50 time: 83.7ms (= 4187.5ms / 50, input_shape=[8, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 74.1ms (= 3703.8ms / 50, input_shape=[8, 3, 224, 224], backward is enabled)
Relative speed: 1.13 (= 83.7ms / 74.1ms)

PyTorch resnet50 time: 56.1ms (= 2802.9ms / 50, input_shape=[4, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 47.7ms (= 2383.3ms / 50, input_shape=[4, 3, 224, 224], backward is enabled)
Relative speed: 1.18 (= 56.1ms / 47.7ms)

PyTorch resnet50 time: 48.1ms (= 2403.9ms / 50, input_shape=[2, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 42.7ms (= 2136.5ms / 50, input_shape=[2, 3, 224, 224], backward is enabled)
Relative speed: 1.13 (= 48.1ms / 42.7ms)

PyTorch resnet50 time: 45.0ms (= 2250.0ms / 50, input_shape=[1, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 42.2ms (= 2109.6ms / 50, input_shape=[1, 3, 224, 224], backward is enabled)
Relative speed: 1.07 (= 45.0ms / 42.2ms)

daquexian · 2021-08-04T01:21:56Z

python/oneflow/nn/modules/to.py

+    input_local_tesnor = input.to_local()
+    device = flow.device(device)
+    output_local_tesnor = To(False)(input_local_tesnor, device, None)
+    return output_local_tesnor.to_consistent(out_placement, sbp)


input_local_tesnor 和 output_local_tesnor 有 typo

daquexian · 2021-08-04T01:29:58Z

python/oneflow/nn/modules/to.py

+        "cuda",
+        "cpu",
+    ), 'consistent tensor only support to("cuda") or to("cpu")'
+    if (device == "cuda" and input.is_cuda) or (device == "cpu" and not input.is_cuda):


这里直接判断 device == input.placement.device_type 吧，后续如果加了新设备类型也不用改这里了

daquexian · 2021-08-04T01:36:34Z

一号进程的输出是不是贴错了

clackhan · 2021-08-04T02:13:35Z

一号进程的输出是不是贴错了

是的，粘错了，已更新

github-actions · 2021-08-04T07:07:02Z

Speed stats:

GPU Name: GeForce GTX 1080 

PyTorch resnet50 time: 138.9ms (= 6947.4ms / 50, input_shape=[16, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 128.1ms (= 6402.7ms / 50, input_shape=[16, 3, 224, 224], backward is enabled)
Relative speed: 1.09 (= 138.9ms / 128.1ms)

PyTorch resnet50 time: 83.8ms (= 4191.3ms / 50, input_shape=[8, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 74.4ms (= 3721.9ms / 50, input_shape=[8, 3, 224, 224], backward is enabled)
Relative speed: 1.13 (= 83.8ms / 74.4ms)

PyTorch resnet50 time: 58.2ms (= 2910.3ms / 50, input_shape=[4, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 47.9ms (= 2394.5ms / 50, input_shape=[4, 3, 224, 224], backward is enabled)
Relative speed: 1.22 (= 58.2ms / 47.9ms)

PyTorch resnet50 time: 48.9ms (= 2446.6ms / 50, input_shape=[2, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 45.7ms (= 2284.0ms / 50, input_shape=[2, 3, 224, 224], backward is enabled)
Relative speed: 1.07 (= 48.9ms / 45.7ms)

PyTorch resnet50 time: 43.9ms (= 2196.8ms / 50, input_shape=[1, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 41.6ms (= 2080.0ms / 50, input_shape=[1, 3, 224, 224], backward is enabled)
Relative speed: 1.06 (= 43.9ms / 41.6ms)

clackhan added 2 commits August 3, 2021 16:23

consistent_tensor_to_api

96d3db7

add check_meta_consistency

1278c91

clackhan requested a review from lixinqi August 3, 2021 08:35

clackhan added api automerge python labels Aug 3, 2021

Update to.py

f42c524

clackhan added feature enhancement and removed feature labels Aug 3, 2021

refine

8b67d3f

clackhan requested a review from oneflow-ci-bot August 3, 2021 10:36

Merge branch 'master' into consistent_tensor_to_api

4175c80

clackhan requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 3, 2021 10:36

oneflow-ci-bot removed their request for review August 3, 2021 11:58

clackhan requested a review from daquexian August 4, 2021 01:01

Merge branch 'master' into consistent_tensor_to_api

c142e27

daquexian approved these changes Aug 4, 2021

View reviewed changes

daquexian reviewed Aug 4, 2021

View reviewed changes

Merge branch 'master' into consistent_tensor_to_api

29eeb4c

oneflow-ci-bot self-requested a review August 4, 2021 01:51

daquexian removed the automerge label Aug 4, 2021

refine

305837f

clackhan requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 4, 2021 02:13

clackhan added the automerge label Aug 4, 2021

oneflow-ci-bot removed their request for review August 4, 2021 03:40

Merge branch 'master' into consistent_tensor_to_api

1c0924b

oneflow-ci-bot self-requested a review August 4, 2021 03:40

Merge branch 'master' into consistent_tensor_to_api

cd19d88

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 4, 2021 05:06

oneflow-ci-bot merged commit 0149c14 into master Aug 4, 2021

oneflow-ci-bot deleted the consistent_tensor_to_api branch August 4, 2021 07:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consistent_tensor_to_api #5703

consistent_tensor_to_api #5703

clackhan commented Aug 3, 2021 •

edited

Loading

github-actions bot commented Aug 3, 2021

daquexian Aug 4, 2021

clackhan Aug 4, 2021

daquexian Aug 4, 2021

clackhan Aug 4, 2021

daquexian commented Aug 4, 2021

clackhan commented Aug 4, 2021

github-actions bot commented Aug 4, 2021

consistent_tensor_to_api #5703

consistent_tensor_to_api #5703

Conversation

clackhan commented Aug 3, 2021 • edited Loading

github-actions bot commented Aug 3, 2021

daquexian Aug 4, 2021

Choose a reason for hiding this comment

clackhan Aug 4, 2021

Choose a reason for hiding this comment

daquexian Aug 4, 2021

Choose a reason for hiding this comment

clackhan Aug 4, 2021

Choose a reason for hiding this comment

daquexian commented Aug 4, 2021

clackhan commented Aug 4, 2021

github-actions bot commented Aug 4, 2021

clackhan commented Aug 3, 2021 •

edited

Loading