Perform `degamma` and `gamma` conversions on user request #32

MarijnS95 · 2023-08-26T09:00:04Z

Fixes #25

This crate can't assume that the input and output is linear, nor did it correct for that in the test example where the output from stb_image is clearly nonlinear (it doesn't state this in the "docs", but is visible from not linearizing JPG and PNG inputs and applying a gamma of 1/2.2 when converting HDR to LDR). While we could request users to pre- correct for this and return linear output to them, it is more efficient to do it within the downsampling algorithm that already runs over all the pixels, and (more importantly!) requiring these parameters in the input forces the caller to think about it.

Unfortunately this has a massive performance regression of 150% (±40ms to ±107ms):

Downsample `square_test.png` using ispc_downsampler
                    time:   [106.94 ms 107.05 ms 107.18 ms]
                    change: [+146.31% +146.79% +147.27%] (p = 0.00 < 0.05)
                    Performance has regressed.

Will need to dissect if this is caused by the pointer refactor or purely the extra ALU overhead.

MarijnS95 · 2023-08-26T09:21:40Z

Will need to dissect if this is caused by the pointer refactor or purely the extra ALU overhead.

The pointer refactor and additional if checks only seem to account for ±5ms, the rest is purely pow() overhead. And that makes sense: I made the remark above that it "is more efficient to do it within the downsampling algorithm that already runs over all the pixels", but this algorithm visits and applies gamma correction to every pixel so many times with the kernel that it becomes much more inefficient.

The alternatives are either preprocessing the input (but this doesn't seem to be too fast either... 😕) or simply documenting that the input and output will be linearized and let the user deal with this (if they need to).
(We should still have this as a separate step to make our bench and test do the correct thing)

Jasper-Bekkers · 2023-10-23T15:10:12Z

benches/basic.rs

+        let params = Parameters {
+            // Input stb Image is gamma-corrected (i.e. expects to be passed through a CRT with exponent 2.2)
+            degamma: true,
+            // Output image is PNG which must be stored with a gamma of 1/2.2
+            gamma: true,
+        };


I think both of these can be merged into one parameter.

src/ispc/kernels/lanczos3.ispc

Before: Downsample `square_test.png` using ispc_downsampler time: [43.438 ms 43.468 ms 43.500 ms] After: Downsample `square_test.png` using ispc_downsampler time: [29.891 ms 29.922 ms 29.953 ms] change: [-31.246% -31.162% -31.077%] (p = 0.00 < 0.05)

This crate can't assume that the input and output is linear, nor did it correct for that in the `test` example where the output from `stb_image` is clearly nonlinear (it doesn't state this in the "docs", but is visible from not linearizing JPG and PNG inputs and applying a gamma of 1/2.2 when converting HDR to LDR). While we could request users to pre- correct for this and return linear output to them, it is more efficient to do it within the downsampling algorithm that already runs over all the pixels, and (more importantly!) requiring these parameters in the input forces the caller to think about it.

MarijnS95 · 2024-05-03T09:05:09Z

PR #48 seems to have added unorm and snorm texture formats in addition to the already existing Srgb(a) format. It is yet unused, but that will be the right enum to key the request for (de)linearization off going forward!

MarijnS95 · 2024-05-03T09:06:16Z

In that sense I do think that this is more so a bug now in that we expose the Srgb(a) vs Rgb(a)U/Snorm capability to the caller, but don't enact on it.

Jasper-Bekkers reviewed Oct 23, 2023

View reviewed changes

src/ispc/kernels/lanczos3.ispc Outdated Show resolved Hide resolved

src/ispc/kernels/lanczos3.ispc Outdated Show resolved Hide resolved

MarijnS95 force-pushed the revert-7x7-kernel branch from 06f52a4 to 9b795a5 Compare October 23, 2023 15:25

MarijnS95 force-pushed the degamma branch from 86ade28 to dbf16ab Compare October 23, 2023 15:28

MarijnS95 force-pushed the revert-7x7-kernel branch from 9b795a5 to b9341a8 Compare October 23, 2023 19:18

Base automatically changed from revert-7x7-kernel to main October 23, 2023 20:06

MarijnS95 force-pushed the degamma branch 2 times, most recently from d79e664 to 35850c1 Compare October 27, 2023 14:03

MarijnS95 added 4 commits November 21, 2023 20:45

WIP: Preprocess degamma

8df604a

fixup! Perform degamma and gamma conversions on user request

0be3323

MarijnS95 force-pushed the degamma branch from 35850c1 to 0be3323 Compare November 21, 2023 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perform `degamma` and `gamma` conversions on user request #32

Perform `degamma` and `gamma` conversions on user request #32

MarijnS95 commented Aug 26, 2023

MarijnS95 commented Aug 26, 2023 •

edited

Loading

Jasper-Bekkers Oct 23, 2023

MarijnS95 commented May 3, 2024

MarijnS95 commented May 3, 2024

Perform degamma and gamma conversions on user request #32

Are you sure you want to change the base?

Perform degamma and gamma conversions on user request #32

Conversation

MarijnS95 commented Aug 26, 2023

MarijnS95 commented Aug 26, 2023 • edited Loading

Jasper-Bekkers Oct 23, 2023

Choose a reason for hiding this comment

MarijnS95 commented May 3, 2024

MarijnS95 commented May 3, 2024

Perform `degamma` and `gamma` conversions on user request #32

Perform `degamma` and `gamma` conversions on user request #32

MarijnS95 commented Aug 26, 2023 •

edited

Loading