Replace numpy transpose with torch permute to speed-up #9533

Min-Sheng · 2022-12-26T09:40:05Z

Motivation

numpy.transpose() is mush more slow than torch.permute() according to my benchmarks on a jupyter notebook:

If the input image size (1366, 800, 3) is the size of COCO dataset image

import numpy as np
import torch
img = np.random.randn(1366, 800, 3)

%%timeit
img_np = np.ascontiguousarray(img.transpose(2, 0, 1))
input = torch.from_numpy(img_np)

Output: 7.69 ms ± 314 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

%%timeit
input = torch.from_numpy(img).permute(2, 0, 1).contiguous()

Output: 1.65 ms ± 123 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

If the input image size is large, especially when inferencing a large image ((3648, 5472, 3) in my case)

img = np.random.randn(3648, 5472, 3)

%%timeit
img_np = np.ascontiguousarray(img.transpose(2, 0, 1))
input = torch.from_numpy(img_np)

Output: 327 ms ± 1.13 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

%%timeit
input = torch.from_numpy(img).permute(2, 0, 1).contiguous()

Output: 93.8 ms ± 4.77 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

Modification

Replace the transpose operationnumpy.transpose(2, 0, 1) with torch.permute(2, 0, 1) to in ImageToTensor and DefaultFormatBundle to speed-up the process.

CLAassistant · 2022-12-26T09:40:13Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ Min-Sheng
❌ vincentwu1

vincentwu1 seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

ZwwWayne · 2022-12-27T03:00:35Z

Hi @Min-Sheng ,
Thanks for your kind PR. Would you like to also update the docstring or code to indicate this issue？

Min-Sheng · 2022-12-27T05:16:04Z

Hi @Min-Sheng , Thanks for your kind PR. Would you like to also update the docstring or code to indicate this issue？

I have updated both the docstring and code.
By the way, I found that if the input numpy array is non-contiguous,

img = np.random.randn(1366, 800, 3)
img = img[..., ::-1]

use

input = torch.from_numpy(np.ascontiguousarray(img.transpose(2, 0, 1)))

Output: 7.58 ms ± 118 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

is faster than

input = torch.from_numpy(np.ascontiguousarray(img)).permute(2, 0, 1).contiguous()

Output: 14.5 ms ± 669 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

So, I use the numpy c_contiguous flag for array continuousness to switch the order of transpose and to_tensor operations.

ZwwWayne · 2022-12-27T06:18:00Z

Hi @Min-Sheng ，

Thanks for your kind PR. It seems that CLA is not signed. Could you sign the CLA so that eventually we could merge this PR after review? You can check the contents and follow the instruction in the communication box shown as below

mmdet/datasets/pipelines/formatting.py

…uousness

Co-authored-by: Wenwei Zhang <40779233+ZwwWayne@users.noreply.github.com>

Min-Sheng · 2022-12-28T03:29:15Z

Hi @Min-Sheng ，

Thanks for your kind PR. It seems that CLA is not signed. Could you sign the CLA so that eventually we could merge this PR after review? You can check the contents and follow the instruction in the communication box shown as below

Everything is really for merging.

mmdet/datasets/pipelines/formatting.py

codecov · 2023-01-03T07:18:51Z

Codecov Report

Base: 64.15% // Head: 64.14% // Decreases project coverage by -0.01% ⚠️

Coverage data is based on head (679284e) compared to base (31c8495).
Patch coverage: 100.00% of modified lines in pull request are covered.

❗ Current head 679284e differs from pull request most recent head 25c6efa. Consider uploading reports for the commit 25c6efa to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##              dev    #9533      +/-   ##
==========================================
- Coverage   64.15%   64.14%   -0.02%     
==========================================
  Files         361      361              
  Lines       29583    29586       +3     
  Branches     5033     5034       +1     
==========================================
- Hits        18980    18978       -2     
- Misses       9599     9601       +2     
- Partials     1004     1007       +3

Flag	Coverage Δ
unittests	`64.12% <100.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmdet/datasets/pipelines/formatting.py	`68.54% <100.00%> (+0.77%)`	⬆️
mmdet/core/bbox/samplers/random_sampler.py	`75.00% <0.00%> (-5.56%)`	⬇️
mmdet/models/roi_heads/mask_heads/maskiou_head.py	`87.35% <0.00%> (-2.30%)`	⬇️
mmdet/utils/misc.py	`62.22% <0.00%> (-2.23%)`	⬇️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

… speed-up (#2604) ## Motivation Original motivation was after [MMDetection PR #9533](open-mmlab/mmdetection#9533) With several experiments I found out that if a ndarray is contiguous, numpy.transpose + torch.contiguous perform better, while if not, then use numpy.ascontiguousarray + numpy.transpose ## Modification Replace numpy.ascontiguousarray with torch.contiguous in [PackSegInputs](https://github.com/open-mmlab/mmsegmentation/blob/1.x/mmseg/datasets/transforms/formatting.py) Co-authored-by: MeowZheng <meowzheng@outlook.com>

OpenMMLab-Assistant001 · 2023-04-04T10:46:11Z

Hi @Min-Sheng ！First of all, we want to express our gratitude for your significant PR in the MMDet project. Your contribution is highly appreciated, and we are grateful for your efforts in helping improve this open-source project during your personal time. We believe that many developers will benefit from your PR.

We would also like to invite you to join our Special Interest Group (SIG) private channel on Discord, where you can share your experiences, ideas, and build connections with like-minded peers. To join the SIG channel, simply message moderator— OpenMMLab on Discord or briefly share your open-source contributions in the #introductions channel and we will assist you. Look forward to seeing you there! Join us ：https://discord.gg/UjgXkPWNqA

If you have WeChat account，welcome to join our community on WeChat. You can add our assistant ：openmmlabwx. Please add "mmsig + Github ID" as a remark when adding friends：）
Thank you again for your contribution！❤

mm-assistant bot assigned Czm369 Dec 26, 2022

Replace numpy transpose with torch permute to speed-up

23e46fe

ZwwWayne reviewed Dec 27, 2022

View reviewed changes

mmdet/datasets/pipelines/formatting.py Outdated Show resolved Hide resolved

ZwwWayne reviewed Dec 27, 2022

View reviewed changes

mmdet/datasets/pipelines/formatting.py Outdated Show resolved Hide resolved

ZwwWayne unassigned Czm369 Dec 27, 2022

Min-Sheng added 6 commits December 28, 2022 02:05

Lint

51d45de

Lint

51932a4

Fix scikit-learn install name

5c21cfe

Fix non-contiguous numpy array

d6a5370

Switch the order of transpose and to_tensor according to array contin…

219f797

…uousness

Lint

6b8a1b0

Min-Sheng force-pushed the torch-permute-axes branch from 25aeb24 to 6b8a1b0 Compare December 28, 2022 02:21

Min-Sheng and others added 4 commits December 28, 2022 10:23

Update mmdet/datasets/pipelines/formatting.py

e4009ec

Co-authored-by: Wenwei Zhang <40779233+ZwwWayne@users.noreply.github.com>

Fix Indentation

455f666

Add acceleration ratio to the comment

f1a16c4

Yapf

679284e

ZwwWayne requested a review from RangiLyu January 3, 2023 03:04

ZwwWayne assigned RangiLyu Jan 3, 2023

ZwwWayne changed the base branch from master to dev January 3, 2023 03:30

ZwwWayne added this to the 2.28.0 milestone Jan 3, 2023

ZwwWayne reviewed Jan 3, 2023

View reviewed changes

mmdet/datasets/pipelines/formatting.py Show resolved Hide resolved

ZwwWayne requested a review from hhaAndroid January 3, 2023 03:33

Add PR reference in comment

25c6efa

RangiLyu approved these changes Jan 4, 2023

View reviewed changes

ZwwWayne approved these changes Jan 4, 2023

View reviewed changes

ZwwWayne merged commit cf43a1b into open-mmlab:dev Jan 4, 2023

triple-Mu mentioned this pull request Feb 13, 2023

Replace numpy transpose with torch permute to speed-up #9762

Merged

gaotongxiao mentioned this pull request Feb 13, 2023

[Enhancement] Speedup formatting by replacing np.transpose with torch… open-mmlab/mmocr#1719

Merged

This was referenced Feb 14, 2023

[Feature] Replace numpy transpose with torch permute to speed-up open-mmlab/mmdetection3d#2273

Merged

[Feature] dev-1.x change np.transpose to torch.permute for speed up open-mmlab/mmdetection3d#2277

Merged

csatsurnh mentioned this pull request Feb 15, 2023

[Enhancement]Replace numpy ascontiguousarray with torch contiguous to speed-up open-mmlab/mmsegmentation#2604

Merged

thmegy pushed a commit to thmegy/mmdetection that referenced this pull request May 5, 2023

Replace numpy transpose with torch permute to speed-up (open-mmlab#9533)

f5fa54b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace numpy transpose with torch permute to speed-up #9533

Replace numpy transpose with torch permute to speed-up #9533

Min-Sheng commented Dec 26, 2022

CLAassistant commented Dec 26, 2022 •

edited

ZwwWayne commented Dec 27, 2022

Min-Sheng commented Dec 27, 2022

ZwwWayne commented Dec 27, 2022

Min-Sheng commented Dec 28, 2022

codecov bot commented Jan 3, 2023 •

edited

OpenMMLab-Assistant001 commented Apr 4, 2023

Replace numpy transpose with torch permute to speed-up #9533

Replace numpy transpose with torch permute to speed-up #9533

Conversation

Min-Sheng commented Dec 26, 2022

Motivation

Modification

CLAassistant commented Dec 26, 2022 • edited

ZwwWayne commented Dec 27, 2022

Min-Sheng commented Dec 27, 2022

ZwwWayne commented Dec 27, 2022

Min-Sheng commented Dec 28, 2022

codecov bot commented Jan 3, 2023 • edited

Codecov Report

OpenMMLab-Assistant001 commented Apr 4, 2023

CLAassistant commented Dec 26, 2022 •

edited

codecov bot commented Jan 3, 2023 •

edited