Implement blur filter with dual kawase technique #2821

gw3583 · 2018-06-17T23:33:46Z

See https://community.arm.com/cfs-file/__key/communityserver-blogs-components-weblogfiles/00-00-00-26-50/siggraph2015_2D00_mmg_2D00_marius_2D00_notes.pdf for details of quality / performance improvements.

pcwalton · 2018-08-15T22:43:38Z

I wrote a quick script to compare similarity of Gaussian blurs of various radii with various numbers of dual Kawase passes. Here are the results:

gw3583 · 2018-08-15T22:57:50Z

@pcwalton related to your question on irc - https://searchfox.org/mozilla-central/source/gfx/2d/FilterNodeSoftware.cpp#3099

pcwalton · 2018-08-17T04:47:24Z

I did some experimentation and tentatively came up with the following formulas to determine suitable parameters for the dual Kawase blur:

numPasses = max(1, round(4/3 * log(blurRadius)))
distance = pow(0.4538, numPasses) * blurRadius

Experimentally, these parameters create blurs that are within 0.02 (i.e. 2%) SSIM of the reference Gaussian blur.

Note that with dual Kawase blurs, images tend to accumulate blocky artefacts if shrunk down too much. So after 4 downsample operations, I stop allocating smaller framebuffers. (That is, the size of each intermediate framebuffer is always at least 1/16 of the size of the original images.) Another way of thinking about this is it looks best to switch from a dual Kawase blur to a regular Kawase blur after 4 downsample operations, effectively forming a "partial dual Kawase blur".

LifeIsStrange · 2018-08-17T23:46:03Z

I don't know if it's helpful but you could maybe take a look at the openCV implémentation
https://docs.opencv.org/3.3.1/d4/d86/group__imgproc__filter.html#gaabe8c836e97159a9193fb0b11ac52cf1

Especially at the CUDA accelerated one (but could be open/sysCLised)
https://docs.opencv.org/2.4/modules/gpu/doc/image_filtering.html

pcwalton · 2018-08-25T04:48:56Z

Status update: I've come up with a "dual Gaussian" blur that is a variation on the dual Kawase technique based on Gaussian blur. Essentially I downsample, then upsample repeatedly, applying a 6x6 Gaussian kernel every time I upsample. (Thanks to the bilinear filtering hardware, this only requires 10 taps, which is slightly better than the 13 taps per level of dual Kawase.)

This results in images that are very similar to the full Gaussian blur, but there is an issue: when animating blur, discontinuities are visible when crossing over the threshold at which we introduce a new downsample/upsample pass. I tried to minimize it, but I'm a bit out of ideas at the moment as to how to eliminate it entirely. I'm not sure how much we care—on balance, I'd take making all blurs faster over perfectly smooth animated blurs—but it's a bit of a bummer.

jrmuizel · 2018-08-25T17:22:53Z

Do you have a video of the transitioning radius that you can share?

pcwalton · 2018-08-25T17:28:36Z

I just came up with a way to work around the problem, I think.

…

On Sat, Aug 25, 2018, 10:22 AM Jeff Muizelaar ***@***.***> wrote: Do you have a video of the transitioning radius that you can share? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2821 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAJoyRdj6xler7D3mrkezdl7g4OJ9qYIks5uUYfugaJpZM4UrBDt> .

… the texture filtering hardware more effectively. This new shader performs Gaussian *resampling* instead of regular convolution. It samples in between texels to reduce the number of taps. The speed is about the same as the existing technique. It is nevertheless the fastest blur method that I could come up with. In particular, it exceeds the performance of the Kawase and dual Kawase blur techniques. The speed comes from working at lower resolution and incurring fewer downsampling and upsampling passes. As is often the case, ALU performance does not really seem to be the limiting factor; it's mostly memory bandwidth, which is why downsampling is so important. Further improvements should come from not doing the downsampling at all and instead rendering the blurred content at low resolution to begin with. Closes servo#2821.

gw3583 · 2019-02-04T22:20:12Z

We investigated this and determined it's not a viable technique for what we need, due to performance / quality concerns. @pcwalton can provide more detail if anyone's is interested in the gory details.

This is an implementation of dual-filter Kawase blur [1], with many optimizations and quality tweaks. It was NOT shipped in any ProtonAOSP releases due to some problems with the smooth radius transitions typically used in mobile UIs. This is an inherent problem with the blur algorithm; see the Servo issue [2] for more details. The source code has been made for reference purposes, but it is not meant for production use. [1] https://community.arm.com/cfs-file/__key/communityserver-blogs-components-weblogfiles/00-00-00-26-50/siggraph2015_2D00_mmg_2D00_marius_2D00_notes.pdf [2] servo/webrender#2821 Change-Id: I5a774f2b55140860f82ac5e6fb4400f20aed8d81

This is an implementation of dual-filter Kawase blur [1], with many optimizations and quality tweaks. It was NOT shipped in any ProtonAOSP releases due to some problems with the smooth radius transitions typically used in mobile UIs. This is an inherent problem with the blur algorithm; see the Servo issue [2] for more details. The source code has been made for reference purposes, but it is not meant for production use. Because this blur implementation was never formally released, the commit messages and history are messy as they are from my quick iteration workflow during early development. All commits were preserved for reference, but they are not representative of my production-ready commits. [1] https://community.arm.com/cfs-file/__key/communityserver-blogs-components-weblogfiles/00-00-00-26-50/siggraph2015_2D00_mmg_2D00_marius_2D00_notes.pdf [2] servo/webrender#2821 Change-Id: I5a774f2b55140860f82ac5e6fb4400f20aed8d81

kvark added type: enhancement area: performance labels Jun 19, 2018

pcwalton self-assigned this Aug 2, 2018

pcwalton mentioned this issue Sep 5, 2018

Rewrite the blur shader to remove the variable-length loop and to use the texture filtering hardware more effectively. #3028

Closed

gw3583 closed this as completed Feb 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement blur filter with dual kawase technique #2821

Implement blur filter with dual kawase technique #2821

gw3583 commented Jun 17, 2018

pcwalton commented Aug 15, 2018

gw3583 commented Aug 15, 2018

pcwalton commented Aug 17, 2018 •

edited

Loading

LifeIsStrange commented Aug 17, 2018 •

edited

Loading

pcwalton commented Aug 25, 2018 •

edited

Loading

jrmuizel commented Aug 25, 2018

pcwalton commented Aug 25, 2018 via email

gw3583 commented Feb 4, 2019

Implement blur filter with dual kawase technique #2821

Implement blur filter with dual kawase technique #2821

Comments

gw3583 commented Jun 17, 2018

pcwalton commented Aug 15, 2018

gw3583 commented Aug 15, 2018

pcwalton commented Aug 17, 2018 • edited Loading

LifeIsStrange commented Aug 17, 2018 • edited Loading

pcwalton commented Aug 25, 2018 • edited Loading

jrmuizel commented Aug 25, 2018

pcwalton commented Aug 25, 2018 via email

gw3583 commented Feb 4, 2019

pcwalton commented Aug 17, 2018 •

edited

Loading

LifeIsStrange commented Aug 17, 2018 •

edited

Loading

pcwalton commented Aug 25, 2018 •

edited

Loading