Inquiry on Applying IDA to Generated Rays in NeRF-based Models #38

SPA-junghokim · 2024-03-30T18:10:43Z

I hope you're doing well. I'm writing to express my appreciation for your research on NeRF and its rendering techniques, particularly the application to OCC models. Your work has inspired my project, which explores integrating camera characteristics more effectively into model structures.

I have a specific question regarding the ray generation process when applying IDA. Should post-rotation and post-translation adjustments be applied to the generated rays after projection onto images? Understanding the implications of IDA on ray adjustments would greatly benefit my research.

I look forward to your insights and thank you in advance for your guidance.

Best
Jungho Kim

pmj110119 · 2024-03-31T12:46:51Z

Hi Kim, I believe it's unnecessary.

Rays are in the world coordinate system, not the camera coordinate system, so the IDA transformation applied at the image level doesn't affect rays. IDA does impact the process of converting 2D image features to 3D volume features, requiring careful handling, but it doesn't affect rays.
For a ray generated from the original image, you simply need to directly retrieve color, class, depth, and other pixel label values from the corresponding pixel. This process doesn't involve coordinate transformations.

SPA-junghokim · 2024-03-31T12:49:08Z

I really appreciate your reply. I understood your word.

Regards,
Jungho Kim

SPA-junghokim · 2024-03-31T12:50:46Z

I have another question: In the nerf_head, using the rendering loss versus not using it results in about a 7-fold difference in training time. Is it normal for it to take significantly longer?

pmj110119 · 2024-03-31T13:14:06Z

7x increase in time seems somewhat abnormal. Could you provide the time cost of one iteration, and is it similar to that observed in the example log?

SPA-junghokim · 2024-04-19T04:37:39Z

It has been confirmed to be a hardware issue with our own server. Thank you.

I have one more question:
Do we have to use separate heads for density and semantic, or can we treat the output from a single head, like in BEVDET-Occ, as density volume for areas not predicted as empty-class and render them?

pmj110119 · 2024-04-19T05:21:01Z

You can of course use output predicted by single head and then separate channels to get density and semantic logits.

However, in our early experiments, the performance of using two MLPs to predict geometry (density) and semantics respectively was slightly better, but this has not been rigorously verified.

SPA-junghokim closed this as completed Mar 31, 2024

SPA-junghokim reopened this Mar 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry on Applying IDA to Generated Rays in NeRF-based Models #38

Inquiry on Applying IDA to Generated Rays in NeRF-based Models #38

SPA-junghokim commented Mar 30, 2024

pmj110119 commented Mar 31, 2024

SPA-junghokim commented Mar 31, 2024

SPA-junghokim commented Mar 31, 2024

pmj110119 commented Mar 31, 2024

SPA-junghokim commented Apr 19, 2024

pmj110119 commented Apr 19, 2024 •

edited

Loading

Inquiry on Applying IDA to Generated Rays in NeRF-based Models #38

Inquiry on Applying IDA to Generated Rays in NeRF-based Models #38

Comments

SPA-junghokim commented Mar 30, 2024

pmj110119 commented Mar 31, 2024

SPA-junghokim commented Mar 31, 2024

SPA-junghokim commented Mar 31, 2024

pmj110119 commented Mar 31, 2024

SPA-junghokim commented Apr 19, 2024

pmj110119 commented Apr 19, 2024 • edited Loading

pmj110119 commented Apr 19, 2024 •

edited

Loading