Question about the radius setting. #5

YuDeng · 2021-07-21T06:26:53Z

Hi, a wonderful work!

I am wondering why the radius of the cone is set to r=2/sqrt(12)*pixel_size.

I know that this setting is to ensure that the variance of the cone matches that of the pixel in world coordinate space. I'm just curious about the derivation. Could you please give me some hint on how to get to this result?

Thanks,
Yu

jonbarron · 2021-07-22T17:57:13Z

The derivation follows from the fact that the variance of a uniform distribution over [0, 1] is 1/12: https://en.wikipedia.org/wiki/Continuous_uniform_distribution#Moments. So if you want a standard deviation, you take the square root of variance, and then scale it by the pixel size. The factor of 2 is to turn a diameter into a radius, I think, though I'm not sure.

YuDeng · 2021-07-23T05:29:25Z

Thanks for the reply.

I can get 1/sqrt(12) for the std of x or y. What I don't quite understand here is that why to multiply 2 with the std. If we want to turn diameter into radius, shouldn't we divide std by 2 so that the final result turns to be 1/(2*sqrt(12)) ?

jonbarron · 2021-07-23T17:13:40Z

yeah, that sounds right. I remember having the same thought and then verifying that this was correct, albeit counter-intuitive. I might have messed up the math though! That part of the code doesn't have unit test coverage.

kwea123 · 2021-12-06T08:25:56Z

Btw, why is here -2?

mipnerf/internal/datasets.py

Line 196 in ee5cbb8

dx = np.concatenate([dx, dx[:, -2:-1, :]], 1)

dx is the difference to its neighbor, so for the rightmost pixel it should be simply dx[:, -1:], isn't it?
For example, let's assume we have an image of width 3, with the above code it means the radius of the rightmost pixel is the direction difference between pixel 0 and pixel 1, which is confusing to me.

kwea123 · 2021-12-10T01:40:22Z

@jonbarron @YuDeng

The derivation follows from the fact that the variance of a uniform distribution over [0, 1] is 1/12

That's correct, but I think that it's only valid if you generate rays randomly and uniformly inside pixels. However, in your ray generation

mipnerf/internal/datasets.py

Lines 183 to 186 in ee5cbb8

    
           camera_dirs = np.stack( 
        
               [(x - self.w * 0.5 + 0.5) / self.focal, 
        
                -(y - self.h * 0.5 + 0.5) / self.focal, -np.ones_like(x)], 
        
               axis=-1)

The rays are always fixed to be passing through pixel centers, so I don't know why uniform distribution comes into play here.

From the description, r is the radius of a pixel in world coordinate (same as camera coordinate actually), so it should be as simple as dx/2 here, instead of 2*dx/sqrt(12):

mipnerf/internal/datasets.py

Line 200 in ee5cbb8

radii = dx[..., None] * 2 / np.sqrt(12)

I've tried changing it to dx/(2*sqrt(12)) as the discussion above, but found it to produce inferior quality. I wonder if you have done any experiments on this r, otherwise I think dx/2 should be the correct way... and numerically 1/2 is close to 2/sqrt(12), so maybe that's why your 2/sqrt(12) also works? But that value is not based on any valid explanation.

Hope to get some more clarification, thanks.

jonbarron · 2021-12-17T23:17:08Z

I think you might be right. I also re-derived this math a while back and got the answer you got, and I also tried it out and saw that it hurt performance, so I stuck with the math that we have here. It's entirely possible that there's a bug in the math/code here, which is definitely worth exploring. It's also possible that there is some advantage of using a radius that is slightly bigger or smaller than the true radius. For example, if you used a wider radius, you'd basically be having NeRF use a "camera" or "sensor" whose PSF has been blurred slightly, which maybe is a good thing in some circumstances? Maybe the best thing to do here is to introduce some radius_multiplier=~4 hyperparameter, and set the code to use radii = radius_multiplier * dx / (2*sqrt(12)). This would be fix this bug, but not degrade performance. And you could imagine tuning this hyperparameter depending on what assumptions you want to make about the optical prefilter that your data is using. I don't find this answer particularly satisfying but it's the best I can come up with. This definitely merits further investigation from my end (which may or may not happen) and I'll let you know if I figure something out.

bsuleymanov · 2022-02-12T08:14:58Z

@kwea123 @jonbarron I had another explanation why 2/sqrt(12) worked, but the derivation was different.

We have a square (in case where we use only dx and ignore dy) pixel with width dx. Area of that pixel is dx^2. When we trace a cone we want it basis to be the circle with area approximately equal to the pixel's area. Area of the cone's basis is pi * r^2. pi * r^2 = dx^2 => r = dx / sqrt(pi) ~ dx / 0.56. 2 * dx / sqrt(12) ~ 0.577.

jonbarron · 2022-02-14T17:13:36Z

Oh cool, I think this is what I did! Sorry this is so complicated, I didn't do a good job of recording my notes when working through the math for this paper.

jonbarron · 2022-02-14T17:15:40Z

Yeah I remember first using dx as the radius, which gives you a circle that is inscribed inside of the pixel, and then thinking "no, it should be bigger than that" and I must have tried to match the area or do something similar. I think what I ended up doing was taking the mean or geometric mean of the radius you get when you inscribe the circle and circumscribe the circle.

longbowzhang · 2022-06-28T04:20:05Z

Hi all,
I just would like to make sure that the setting of radius has nothing to do with neither the variance of the pixel’s footprint as mentioned in the paper nor a normal distribution, right?

jonbarron · 2022-06-30T20:16:48Z

I'm not sure I understand the question. The radius of a pixel is a direct function of the size of the pixel's footprint on the image plane. Then the radius is used to construct a normal distribution.

longbowzhang · 2022-07-01T02:48:20Z

Hi @jonbarron,

[1] > Hi all, I just would like to make sure that the setting of radius has nothing to do with neither the variance of the pixel’s footprint as mentioned in the paper nor a normal distribution, right?

normal distribution should be corrected to uniform distribution (sorry for my mistake).
Based on your answer #5 (comment), the radius is derived mainly from a geometric perspective. Then its value has nothing to do with a distribution/variance. Right?

[2] Besides, I still do not know where the sqrt(12) comes from. I checked the arithmetic mean and the geometric mean of the radius of the inscribed circle and circumscribed circle, and found out that neither gives a sqrt(12). (Hope I do not get the math wrong again)

[3] Furthermore, I am wondering this radius setting is constant for all the pixels across the image plane, even for pixels near the border where the intersection of the cone and the image plane is an ellipsoid instead of a circle?

jonbarron · 2022-07-05T16:19:32Z

sqrt(12) is just the standard deviation of a uniform distribution defined on [0, 1]. Here I'm deriving it using the definition of variance (E[x^2] - E[x]^2): https://www.wolframalpha.com/input?i=Sqrt%5BIntegrate%5Bx%5E2%2C+%7Bx%2C+0%2C+1%7D%5D+-+Integrate%5Bx%2C+%7Bx%2C+0%2C+1%7D%5D%5E2%5D

longbowzhang · 2022-07-06T03:14:50Z

I finally figure it out that the variance of uniform distribution within the circle of radius d is (d^2)/4 https://www.wolframalpha.com/input?i=Integrate%5B1%2F%28Pi*d%5E2%29*r*%28r*sin%28theta%29%29%5E2%2C+%7Br%2C+0%2C+d%7D%2C+%7Btheta%2C+0%2C+2*Pi%7D%5D. By setting it equal to 1/12, we can easily derive that d = 2/sqrt(12).

qhdqhd · 2022-12-30T09:28:02Z

Is dx all the same for different pixel rays?

`

Distance from each unit-norm direction vector to its x-axis neighbor.

dx = [
    np.sqrt(np.sum((v[:-1, :, :] - v[1:, :, :])**2, -1)) for v in directions
]

dx = [np.concatenate([v, v[-2:-1, :]], 0) for v in dx]

# Cut the distance in half, and then round it out so that it's
# halfway between inscribed by / circumscribed about the pixel.

radii = [v[..., None] * 2 / np.sqrt(12) for v in dx]`

qhdqhd · 2022-12-31T11:29:24Z

@longbowzhang
for [3]:
I think dx is all the same for different pixel rays. 这个半径设置对于图像平面上的所有像素都是恒定的.

longbowzhang · 2023-01-03T03:43:25Z

@qhdqhd it seems like the authors set that value constant for all the pixels even though mathematically I personally think dx should be adaptive for every pixel.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the radius setting. #5

Question about the radius setting. #5

YuDeng commented Jul 21, 2021

jonbarron commented Jul 22, 2021

YuDeng commented Jul 23, 2021

jonbarron commented Jul 23, 2021

kwea123 commented Dec 6, 2021 •

edited

kwea123 commented Dec 10, 2021

jonbarron commented Dec 17, 2021

bsuleymanov commented Feb 12, 2022

jonbarron commented Feb 14, 2022

jonbarron commented Feb 14, 2022

longbowzhang commented Jun 28, 2022

jonbarron commented Jun 30, 2022

longbowzhang commented Jul 1, 2022 •

edited

jonbarron commented Jul 5, 2022

longbowzhang commented Jul 6, 2022

qhdqhd commented Dec 30, 2022 •

edited

qhdqhd commented Dec 31, 2022 •

edited

longbowzhang commented Jan 3, 2023

Question about the radius setting. #5

Question about the radius setting. #5

Comments

YuDeng commented Jul 21, 2021

jonbarron commented Jul 22, 2021

YuDeng commented Jul 23, 2021

jonbarron commented Jul 23, 2021

kwea123 commented Dec 6, 2021 • edited

kwea123 commented Dec 10, 2021

jonbarron commented Dec 17, 2021

bsuleymanov commented Feb 12, 2022

jonbarron commented Feb 14, 2022

jonbarron commented Feb 14, 2022

longbowzhang commented Jun 28, 2022

jonbarron commented Jun 30, 2022

longbowzhang commented Jul 1, 2022 • edited

jonbarron commented Jul 5, 2022

longbowzhang commented Jul 6, 2022

qhdqhd commented Dec 30, 2022 • edited

qhdqhd commented Dec 31, 2022 • edited

longbowzhang commented Jan 3, 2023

kwea123 commented Dec 6, 2021 •

edited

longbowzhang commented Jul 1, 2022 •

edited

qhdqhd commented Dec 30, 2022 •

edited

qhdqhd commented Dec 31, 2022 •

edited