More numerically stable lerp #18871

mkolod · 2019-04-04T18:29:44Z

The C++ and CUDA implementations of the lerp are not numerically stable. This is discussed on Wikipedia here. I checked the GPU SASS output and there's no overhead from using the more precise implementation, from Kepler all the way to Turing. I haven't looked at CPU ASM though.

ssnl · 2019-04-04T19:28:12Z

method proposed by https://math.stackexchange.com/a/1798323 could be better

mkolod · 2019-04-04T21:48:32Z

@ssnl I rewrote based on the proposal you shared. However, I'm worried that this may not perform well. For example, if t is fixed, then there's no warp divergence. If t depends on the data the way it does in say bilinear upsampling, I wonder if that may result in warp divergence and therefore be bad for perf. Of course the counter-argument to that is that this would be a stall for only a few instructions, for a problem that's bandwidth-bound anyway.

ezyang · 2019-04-05T01:46:39Z

@mkolod Maybe do a quick benchmark, if you're concerned? :)

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

mkolod · 2019-04-05T19:25:38Z

@ezyang It's fine. Looks like one CI run out of 76 is having issues, but it seems nothing to do with the commit (it's a multiprocessing/gloo issue). I hope that's not a blocker for the merging since it's not the lerp test that's affected, and I didn't change anything else in the code.

Summary: The C++ and CUDA implementations of the lerp are not numerically stable. This is discussed on Wikipedia [here](https://en.wikipedia.org/wiki/Linear_interpolation#Programming_language_support). I checked the GPU SASS output and there's no overhead from using the more precise implementation, from Kepler all the way to Turing. I haven't looked at CPU ASM though. Pull Request resolved: pytorch/pytorch#18871 Differential Revision: D14793438 Pulled By: ezyang fbshipit-source-id: 2ddc2e026c5285466cae7d1b4101174253100445

facebook-github-bot · 2019-04-05T22:08:17Z

@ezyang merged this pull request in c1790fa.

Summary: The C++ and CUDA implementations of the lerp are not numerically stable. This is discussed on Wikipedia [here](https://en.wikipedia.org/wiki/Linear_interpolation#Programming_language_support). I checked the GPU SASS output and there's no overhead from using the more precise implementation, from Kepler all the way to Turing. I haven't looked at CPU ASM though. Pull Request resolved: pytorch#18871 Differential Revision: D14793438 Pulled By: ezyang fbshipit-source-id: 2ddc2e026c5285466cae7d1b4101174253100445

mkolod force-pushed the stable_lerp branch from 8f7d170 to 9cb722b Compare April 4, 2019 21:45

More numerically stable lerp

7d72595

mkolod force-pushed the stable_lerp branch from 9cb722b to 7d72595 Compare April 4, 2019 22:54

ezyang approved these changes Apr 5, 2019

View reviewed changes

ezyang added the module: numerical-stability Problems related to numerical stability of operations label Apr 5, 2019

facebook-github-bot reviewed Apr 5, 2019

View reviewed changes

facebook-github-bot closed this in c1790fa Apr 5, 2019

facebook-github-bot added the merged label Apr 5, 2019

mkolod deleted the stable_lerp branch April 5, 2019 22:26

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More numerically stable lerp #18871

More numerically stable lerp #18871

mkolod commented Apr 4, 2019

ssnl commented Apr 4, 2019

mkolod commented Apr 4, 2019

ezyang commented Apr 5, 2019

facebook-github-bot left a comment

mkolod commented Apr 5, 2019

facebook-github-bot commented Apr 5, 2019

More numerically stable lerp #18871

More numerically stable lerp #18871

Conversation

mkolod commented Apr 4, 2019

ssnl commented Apr 4, 2019

mkolod commented Apr 4, 2019

ezyang commented Apr 5, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

mkolod commented Apr 5, 2019

facebook-github-bot commented Apr 5, 2019