about prepocess #56

UestcJay · 2023-02-02T03:10:06Z

Hi,

Thanks for the great work !
because 384 is the size for Omnidata-model, the dtu image size is 1200x1600, if I want to use monocular cues with original size, can I first resize the 1200x1600 -> 1152x1536, then get the monocular cues and upsamle them to 1200x1600？
looking forward to your reply!

niujinshuchong · 2023-02-02T20:44:00Z

Hi, we also simply resize the monocular outputs to 1200x1200 with padding for dtu images with 1200x1600. You could check it here: https://github.com/autonomousvision/monosdf/blob/main/preprocess/paded_dtu.py.
Other way to get high-resolution monocular priors can be found here https://github.com/autonomousvision/monosdf#high-resolution-cues.

UestcJay · 2023-02-03T07:49:32Z

I still have some problems, is my method more convenient than the way in thepaded_dtu.py, because there is no need to modify the parameters of the camera?

niujinshuchong · 2023-02-03T09:04:18Z

You could just try it out.

UestcJay · 2023-02-13T01:51:59Z

How many experiments are averaged for the CD value on the DTU dataset reported in the paper?

niujinshuchong · 2023-02-13T10:31:25Z

It's average over 15 scenes.

Wuuu3511 · 2023-02-15T07:09:41Z

Hi,

Thanks for the great work ! because 384 is the size for Omnidata-model, the dtu image size is 1200x1600, if I want to use monocular cues with original size, can I first resize the 1200x1600 -> 1152x1536, then get the monocular cues and upsamle them to 1200x1600？ looking forward to your reply!

Helllo! I'd like to ask a question.
Omnidata-model is trained with img_size 384. Can it support input at any image resolution such as 1152*1536?
Thank you!

UestcJay · 2023-02-15T08:19:23Z

yes, as long as the length and width are multiples of 384.

Wuuu3511 · 2023-02-17T13:43:48Z

yes, as long as the length and width are multiples of 384.

Thank you very much for your reply!
I try to use images 512 * 640 as input , Omnidata-model can also return a depth map which is 512*640. Picture of this size is not a multiple of 384. Does this result in a larger depth error?

liuxiaozhu01 · 2023-02-24T09:22:05Z

Hello! I've got question here.
I am wondering whether the resolution of rgb images, depth and normal cues will impact on the reconstruction result. If it will, and why?
Thank you for your reply! My experiment result really confused me.

niujinshuchong · 2023-02-24T10:27:11Z

Hi, omnidata is not trained on large resolution images. So it's not clear whether it can generalise in this case and the reconstruction results might vary scene by scene.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about prepocess #56

about prepocess #56

UestcJay commented Feb 2, 2023

niujinshuchong commented Feb 2, 2023

UestcJay commented Feb 3, 2023

niujinshuchong commented Feb 3, 2023

UestcJay commented Feb 13, 2023

niujinshuchong commented Feb 13, 2023

Wuuu3511 commented Feb 15, 2023

UestcJay commented Feb 15, 2023

Wuuu3511 commented Feb 17, 2023

liuxiaozhu01 commented Feb 24, 2023

niujinshuchong commented Feb 24, 2023

about prepocess #56

about prepocess #56

Comments

UestcJay commented Feb 2, 2023

niujinshuchong commented Feb 2, 2023

UestcJay commented Feb 3, 2023

niujinshuchong commented Feb 3, 2023

UestcJay commented Feb 13, 2023

niujinshuchong commented Feb 13, 2023

Wuuu3511 commented Feb 15, 2023

UestcJay commented Feb 15, 2023

Wuuu3511 commented Feb 17, 2023

liuxiaozhu01 commented Feb 24, 2023

niujinshuchong commented Feb 24, 2023