Some questions about the paper #2

kwea123 · 2020-11-11T06:15:27Z

Hi, thanks for the great work!

I have some questions:

How much is the computational overhead introduced by the CNN feature extraction? At inference maybe it's not that much because we only need to do 1 forward pass for each image and store the features in a buffer, but at training, we need to perform it on the entire images at each iteration, and we only train on a very little portion (800-1000 rays), so I wonder isn't it somewhat inefficient and slow, or maybe you have some outstanding implementation to accelerate this part.
As for the generalization, is it correct to understand that it only generalizes to objects within the same class (experiments on shapenetv2) with very similar visual and pose settings? For example, if we train on 7 NeRF-synthetic scenes, does it generalize to the 8th?

alextrevithick · 2020-11-13T01:56:16Z

Thanks for your interest in our work!

The computational overhead for the feature extraction is expensive both in time and space, and you're right, some implementations allow us to cache the features.
That's a really important question, and we are working on it right now. Personally, I'm pretty sure it could generalize across object classes IF it were trained on multiple. For the synthetic question, we will know that answer soon.

alextrevithick · 2020-11-16T16:27:22Z

Here is a result for your final question. This was a model trained on just 4 synthetic scenes, not including lego.

kwea123 · 2020-11-17T00:54:09Z

It seems it generalizes not that well in this case. If you then finetune the model to the lego scene from this point, does it take shorter time to attain the same performance (or better?) comparing to training only on the lego scene from scratch?

alextrevithick · 2020-11-17T04:49:56Z

Thanks for your insightful question. Here is an example after 1000 iters(usually takes 250k iters from scratch). Seems like it can achieve good performance very fast.

kwea123 · 2020-11-17T07:05:31Z

Wow, just 1k iters! That's a really fast convergence! How long does it take to actually achieve the same level of performance? From the image I'd say it's still about 26 to 27 in PSNR but the best model trained from scratch can reach 32 to 33.

kwea123 closed this as completed Jul 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions about the paper #2

Some questions about the paper #2

kwea123 commented Nov 11, 2020

alextrevithick commented Nov 13, 2020

alextrevithick commented Nov 16, 2020

kwea123 commented Nov 17, 2020

alextrevithick commented Nov 17, 2020 •

edited

kwea123 commented Nov 17, 2020 •

edited

Some questions about the paper #2

Some questions about the paper #2

Comments

kwea123 commented Nov 11, 2020

alextrevithick commented Nov 13, 2020

alextrevithick commented Nov 16, 2020

kwea123 commented Nov 17, 2020

alextrevithick commented Nov 17, 2020 • edited

kwea123 commented Nov 17, 2020 • edited

alextrevithick commented Nov 17, 2020 •

edited

kwea123 commented Nov 17, 2020 •

edited