[proto] Speed up adjust color ops #6784

vfdev-5 · 2022-10-18T10:25:58Z

Description:

Rewritten _blend op in prototype space
Simplified adjust_brightness
Optimized _rgb_to_gray
Removed redundant checks
Follow-up PR to [NOMERGE] Add optimized _blend #6765

Results:

[------------------ Adjust_brightness cpu torch.uint8 ------------------]
                     |  adjust_brightness stable  |  adjust_brightness v2
1 threads: --------------------------------------------------------------
      (3, 400, 400)  |            1080            |          483         

Times are in microseconds (us).

[----------------- Adjust_contrast cpu torch.uint8 -----------------]
                     |  adjust_contrast stable  |  adjust_contrast v2
1 threads: ----------------------------------------------------------
      (3, 400, 400)  |           1150           |         926        

Times are in microseconds (us).

[------------------ Adjust_saturation cpu torch.uint8 ------------------]
                     |  adjust_saturation stable  |  adjust_saturation v2
1 threads: --------------------------------------------------------------
      (3, 400, 400)  |            1200            |          935         

Times are in microseconds (us).

[------------------ Adjust_sharpness cpu torch.uint8 -----------------]
                     |  adjust_sharpness stable  |  adjust_sharpness v2
1 threads: ------------------------------------------------------------
      (3, 400, 400)  |            4.06           |          3.64       

Times are in milliseconds (ms).

[----------------- Adjust_brightness cpu torch.float32 -----------------]
                     |  adjust_brightness stable  |  adjust_brightness v2
1 threads: --------------------------------------------------------------
      (3, 400, 400)  |            665             |          206         
6 threads: --------------------------------------------------------------
      (3, 400, 400)  |            167             |           57         

Times are in microseconds (us).

[---------------- Adjust_contrast cpu torch.float32 ----------------]
                     |  adjust_contrast stable  |  adjust_contrast v2
1 threads: ----------------------------------------------------------
      (3, 400, 400)  |           691            |         455        
6 threads: ----------------------------------------------------------
      (3, 400, 400)  |           248            |         162        

Times are in microseconds (us).

[----------------- Adjust_saturation cpu torch.float32 -----------------]
                     |  adjust_saturation stable  |  adjust_saturation v2
1 threads: --------------------------------------------------------------
      (3, 400, 400)  |            728             |          466         

Times are in microseconds (us).

[----------------- Adjust_sharpness cpu torch.float32 ----------------]
                     |  adjust_sharpness stable  |  adjust_sharpness v2
1 threads: ------------------------------------------------------------
      (3, 400, 400)  |            3100           |          2860       

Times are in microseconds (us).

More results in logs, code

Logs with #6765 code + simplified adjust brightness : https://github.com/vfdev-5/tvapiv2_benchmarks/blob/ba35a27f14fd77123ea96f0c54ac9e2fb22f0f15/output/20221018-115341-output-adjust-color-ops_datumbox.log

…lend-op

test/test_prototype_transforms_consistency.py

torchvision/prototype/transforms/functional/_color.py

torchvision/prototype/transforms/functional/_meta.py

pmeier

One more perf improvement. Otherwise LGTM if CI is green. Thanks Victor!

pmeier · 2022-10-20T09:42:25Z

torchvision/prototype/transforms/functional/_color.py

+    grayscale_image = _rgb_to_gray(image) if c == 3 else image
+    mean = torch.mean(grayscale_image.to(dtype), dim=(-3, -2, -1), keepdim=True)


Suggested change

grayscale_image = _rgb_to_gray(image) if c == 3 else image

mean = torch.mean(grayscale_image.to(dtype), dim=(-3, -2, -1), keepdim=True)

grayscale_image = _rgb_to_gray(image.to(dtype)) if c == 3 else image.to(dtype)

mean = torch.mean(grayscale_image, dim=(-3, -2, -1), keepdim=True)

This saves one conversion in _rgb_to_gray in case the input is uint8: _rgb_to_gray would convert the output of its floating point computation back to uint8 just for the result being converted back to floating point in before the torch.mean call.

@pmeier actually, doing so the output of mean is not the same for uint8 image.
As we originally applied uint8 cast in the end of _rgb_to_gray we get rid of all floating point values. This is not the case if image is casted to float before _rgb_to_gray.
Here is an example of difference for mean:

tensor([[[125.9521]]]) # original implementation # vs tensor([[[126.4607]]]) # cast to float before `_rgb_to_gray`.

So, finally consistency tests report for example:

Mismatched elements: 256 / 2772 (9.2%) Greatest absolute difference: 3 at index (1, 2, 4, 21) (up to 1e-05 allowed) Greatest relative difference: 0.25 at index (2, 0, 6, 21) (up to 1e-05 allowed)

and this is a real failure, IMO.

I agree that the behavior changes, but IMO repeatedly converting to uint8 in the computation and thus eliminating intermediate values sounds more like a missed opportunity in the original kernel than a bug now. Thus, I would consider this more like a "bug fix" rather than a BC breaking change. On the other hand, that is not a strong opinion. Not going to block over this.

datumbox

LGTM, looks great!

datumbox · 2022-10-20T10:41:28Z

torchvision/prototype/transforms/functional/_color.py

+    ratio = float(ratio)
+    fp = image1.is_floating_point()
+    bound = 1.0 if fp else 255.0
+    output = image1.mul(ratio).add_(image2, alpha=(1.0 - ratio)).clamp_(0, bound)


I like this! Supersedes the work at #6765

vfdev-5 · 2022-10-20T13:21:23Z

This implementation is not consistent with previous behaviour. Valid time measurements are reported in the description.

~~With the latest commit we reduce runtime for adjust_contrast:~~

This reverts commit a82cf8c.

…to proto-improve-blend-op

…lend-op

Summary: * WIP * _blend optim v1 * _blend and color ops optims: v2 * updated a/r tol and configs to make tests pass * Loose a/r tolerance in AA tests * Use custom rgb_to_grayscale * Renamed img -> image * nit code update * PR review * adjust_contrast convert to float32 earlier * Revert "adjust_contrast convert to float32 earlier" This reverts commit a82cf8c. Reviewed By: YosuaMichael Differential Revision: D40588170 fbshipit-source-id: b87fbbecb7490c222d990ef5c3e620d9ffe457ab

MiChatz · 2023-02-17T07:40:48Z

torchvision/prototype/transforms/functional/_color.py

+
+    fp = image.is_floating_point()
+    bound = 1.0 if fp else 255.0
+    output = image.mul(brightness_factor).clamp_(0, bound)


Issue:
This clamp_ implementation is enforcing the histogram to take values between 0 and 1 in my case is not working the way I will expect since my images are normalized to values around 0 (ex: -0.2 to 0.8) so the multiplication with the factor of 1 will not return an identical image.

Suggestion:
output = image.mul(brightness_factor).clamp_(image.min(), image.max())

@MiChatz thanks for the feedback. There is (yet unwritten) assumption for color transformations on float images that image range is between [0, 1].

vfdev-5 added 3 commits October 17, 2022 20:58

WIP

94e918c

_blend optim v1

fc4f237

_blend and color ops optims: v2

58eec29

facebook-github-bot added the cla signed label Oct 18, 2022

Merge branch 'main' into proto-improve-blend-op

ffc5c4f

vfdev-5 requested a review from datumbox October 18, 2022 10:30

vfdev-5 added module: transforms Perf For performance improvements prototype labels Oct 18, 2022

vfdev-5 added 3 commits October 18, 2022 12:38

updated a/r tol and configs to make tests pass

b7b5178

Merge branch 'main' of github.com:pytorch/vision into proto-improve-b…

4f3491a

…lend-op

Loose a/r tolerance in AA tests

2a5e4d8

vfdev-5 requested a review from pmeier October 19, 2022 12:34

pmeier reviewed Oct 19, 2022

View reviewed changes

vfdev-5 added 4 commits October 19, 2022 14:00

Use custom rgb_to_grayscale

a170513

Renamed img -> image

0b55072

Merge branch 'main' into proto-improve-blend-op

a99d6ad

nit code update

b7fdd39

vfdev-5 requested a review from pmeier October 19, 2022 21:04

pmeier reviewed Oct 20, 2022

View reviewed changes

torchvision/prototype/transforms/functional/_meta.py Outdated Show resolved Hide resolved

PR review

4117957

vfdev-5 requested a review from pmeier October 20, 2022 09:31

pmeier approved these changes Oct 20, 2022

View reviewed changes

vfdev-5 marked this pull request as draft October 20, 2022 10:24

adjust_contrast convert to float32 earlier

a82cf8c

datumbox approved these changes Oct 20, 2022

View reviewed changes

datumbox mentioned this pull request Oct 20, 2022

[NOMERGE] Add optimized _blend #6765

Closed

vfdev-5 marked this pull request as ready for review October 20, 2022 13:29

Merge branch 'main' into proto-improve-blend-op

247ed7d

vfdev-5 marked this pull request as draft October 20, 2022 14:29

vfdev-5 added 3 commits October 20, 2022 19:59

Revert "adjust_contrast convert to float32 earlier"

f19edc9

This reverts commit a82cf8c.

Merge branch 'proto-improve-blend-op' of github.com:vfdev-5/vision in…

eff6c6f

…to proto-improve-blend-op

Merge branch 'main' of github.com:pytorch/vision into proto-improve-b…

2397725

…lend-op

vfdev-5 marked this pull request as ready for review October 20, 2022 20:02

Merge branch 'main' into proto-improve-blend-op

f364efc

vfdev-5 merged commit 9f024a6 into pytorch:main Oct 21, 2022

vfdev-5 deleted the proto-improve-blend-op branch October 21, 2022 10:04

pmeier mentioned this pull request Oct 24, 2022

Performance improvements for transforms v2 vs. v1 #6818

Closed

31 tasks

MiChatz reviewed Feb 17, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[proto] Speed up adjust color ops #6784

[proto] Speed up adjust color ops #6784

vfdev-5 commented Oct 18, 2022 •

edited

Loading

pmeier left a comment

pmeier Oct 20, 2022

vfdev-5 Oct 20, 2022 •

edited

Loading

pmeier Oct 20, 2022

datumbox left a comment

datumbox Oct 20, 2022

vfdev-5 commented Oct 20, 2022 •

edited

Loading

MiChatz Feb 17, 2023 •

edited

Loading

vfdev-5 Feb 17, 2023

		grayscale_image = _rgb_to_gray(image) if c == 3 else image
		mean = torch.mean(grayscale_image.to(dtype), dim=(-3, -2, -1), keepdim=True)

[proto] Speed up adjust color ops #6784

[proto] Speed up adjust color ops #6784

Conversation

vfdev-5 commented Oct 18, 2022 • edited Loading

pmeier left a comment

Choose a reason for hiding this comment

pmeier Oct 20, 2022

Choose a reason for hiding this comment

vfdev-5 Oct 20, 2022 • edited Loading

Choose a reason for hiding this comment

pmeier Oct 20, 2022

Choose a reason for hiding this comment

datumbox left a comment

Choose a reason for hiding this comment

datumbox Oct 20, 2022

Choose a reason for hiding this comment

vfdev-5 commented Oct 20, 2022 • edited Loading

MiChatz Feb 17, 2023 • edited Loading

Choose a reason for hiding this comment

vfdev-5 Feb 17, 2023

Choose a reason for hiding this comment

vfdev-5 commented Oct 18, 2022 •

edited

Loading

vfdev-5 Oct 20, 2022 •

edited

Loading

vfdev-5 commented Oct 20, 2022 •

edited

Loading

MiChatz Feb 17, 2023 •

edited

Loading