[Feature] Add gather points op from mmdet3d #1338

DCNSW · 2021-09-15T08:55:16Z

Motivation

Add gather points cuda operation from mmdet3d (branch: v1.0.0.dev0).

Modification

Several files in mmcv/ops folder.

BC-breaking (Optional)

No.

Use cases (Optional)

from mmcv.ops import gather_points

codecov · 2021-09-15T15:45:01Z

Codecov Report

Merging #1338 (d18d416) into master (8cac7c2) will decrease coverage by 0.08%.
The diff coverage is 40.00%.

❗ Current head d18d416 differs from pull request most recent head d4757ba. Consider uploading reports for the commit d4757ba to get more accurate results

@@            Coverage Diff             @@
##           master    #1338      +/-   ##
==========================================
- Coverage   68.59%   68.51%   -0.09%     
==========================================
  Files         164      165       +1     
  Lines       10891    10922      +31     
  Branches     1991     1993       +2     
==========================================
+ Hits         7471     7483      +12     
- Misses       3030     3047      +17     
- Partials      390      392       +2

Flag	Coverage Δ
unittests	`68.51% <40.00%> (-0.09%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmcv/ops/gather_points.py	`37.50% <37.50%> (ø)`
mmcv/ops/__init__.py	`100.00% <100.00%> (ø)`
mmcv/runner/dist_utils.py	`50.00% <0.00%> (-1.07%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 42c9e71...d4757ba. Read the comment docs.

…ather

grimoire · 2021-09-24T12:19:59Z

Hi, I think this ops can be implemented with PyTorch only:

# method0, permute might take extra computation
def gather_point_pytorch0(features, idx):
    batch_size = features.shape[0]
    batch_idx = torch.arange(batch_size, device=features.device).unsqueeze(-1)
    permute_output = features[batch_idx, :, idx]
    return permute_output.permute(0, 2, 1)

# method1, repeat might need more memory
def gather_point_pytorch1(features, idx):
    new_idx = idx.unsqueeze(1).repeat(1, 3, 1)
    return features.gather(2, new_idx)

DCNSW · 2021-09-29T16:37:34Z

Hi, I think this ops can be implemented with PyTorch only:

# method0, permute might take extra computation
def gather_point_pytorch0(features, idx):
    batch_size = features.shape[0]
    batch_idx = torch.arange(batch_size, device=features.device).unsqueeze(-1)
    permute_output = features[batch_idx, :, idx]
    return permute_output.permute(0, 2, 1)

# method1, repeat might need more memory
def gather_point_pytorch1(features, idx):
    new_idx = idx.unsqueeze(1).repeat(1, 3, 1)
    return features.gather(2, new_idx)

We tested that the running time of the PyTorch version code is 17.3 times that of the Cuda version code, so it is necessary to implement with cuda.

grimoire · 2021-09-30T03:05:27Z

We tested that the running time of the PyTorch version code is 17.3 times that of the Cuda version code, so it is necessary to implement with cuda.

Here is my timer:

def timer(func, num_test, num_warmup, *args, **kwargs):
    start = torch.cuda.Event(enable_timing=True)
    end = torch.cuda.Event(enable_timing=True)

    # warmup
    for _ in range(num_warmup):
        func(*args, **kwargs)

    torch.cuda.synchronize()

    start.record()
    for _ in range(num_test):
        output = func(*args, **kwargs)
    end.record()

    torch.cuda.synchronize()
    print(f'{func} take time {start.elapsed_time(end)/num_test}.')

    return output

With warmup=5 and num_test=10, method0 is 3 times slower and method1 is 2 times slower.
I believe custom cuda implement must be faster, but 17.3 times seems too much.

mmcv/ops/csrc/common/cuda/gather_points_cuda_kernel.cuh

ZwwWayne · 2021-10-13T12:47:48Z

Need to resolve conflict and clean DIVUP before merge.

DCNSW added 2 commits September 15, 2021 16:51

add ops (gather points) in mmdet3d

77a27e0

add ops (gather points) in mmdet3d

604b06e

ZwwWayne requested review from ZwwWayne, grimoire and zhouzaida September 15, 2021 11:31

zhouzaida mentioned this pull request Sep 15, 2021

Iteration Plan v1.3.14 - Sep 2021 #1333

Closed

16 tasks

refactor code

30ead94

zhouzaida added the CUDA label Sep 21, 2021

DCNSW added 3 commits September 22, 2021 12:34

refactor code

0f70848

Merge branch 'master' of github.com:DCNSW/mmcv into add-mmdet3d-ops-g…

8d7cd69

…ather

fix typo

8ccb6de

zhouzaida mentioned this pull request Sep 24, 2021

Iteration Plan v1.3.15 - Oct 2021 #1367

Closed

29 tasks

update base

d18d416

grimoire reviewed Sep 30, 2021

View reviewed changes

mmcv/ops/csrc/common/cuda/gather_points_cuda_kernel.cuh Show resolved Hide resolved

zhouzaida reviewed Oct 2, 2021

View reviewed changes

mmcv/ops/csrc/common/cuda/gather_points_cuda_kernel.cuh Outdated Show resolved Hide resolved

zhouzaida approved these changes Oct 3, 2021

View reviewed changes

grimoire approved these changes Oct 8, 2021

View reviewed changes

ZwwWayne mentioned this pull request Oct 12, 2021

Progress of Migration of CUDA ops to MMCV open-mmlab/mmdetection3d#994

Closed

11 tasks

ZwwWayne approved these changes Oct 13, 2021

View reviewed changes

refactor code

d4757ba

ZwwWayne merged commit 1cd01db into open-mmlab:master Oct 14, 2021

zhouzaida mentioned this pull request Oct 14, 2021

[feature] Add op gather_points #682

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add gather points op from mmdet3d #1338

[Feature] Add gather points op from mmdet3d #1338

DCNSW commented Sep 15, 2021

codecov bot commented Sep 15, 2021 •

edited

Loading

grimoire commented Sep 24, 2021 •

edited

Loading

DCNSW commented Sep 29, 2021

grimoire commented Sep 30, 2021

ZwwWayne commented Oct 13, 2021

[Feature] Add gather points op from mmdet3d #1338

[Feature] Add gather points op from mmdet3d #1338

Conversation

DCNSW commented Sep 15, 2021

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

codecov bot commented Sep 15, 2021 • edited Loading

Codecov Report

grimoire commented Sep 24, 2021 • edited Loading

DCNSW commented Sep 29, 2021

grimoire commented Sep 30, 2021

ZwwWayne commented Oct 13, 2021

codecov bot commented Sep 15, 2021 •

edited

Loading

grimoire commented Sep 24, 2021 •

edited

Loading