enable coarse-to-fine training #2984

jb-ye · 2024-03-05T02:58:34Z

In the original Inria GS code, the coarse-to-fine training was never implemented. This is because the authors didn't observe benefits for training images no greater than 1600 pixels. However, a lot users want to train splatfacto at very high resolutions (e.g. 4k images). I found that when training at high resolutions, it is very likely for the optimization stuck at local minimums (due to various reasons, such as SfM errors, thin structures, aliasing etc.): a lot fine details are not reconstructed properly through densification of gaussians. One counter-intuitive phenomenon is when training at higher resolution, splatfacto actually creates less gaussians.

One way to work around this issue is to re-enable coarse-to-fine training in splatfacto. I found that the default resolution_schedule=250 is too short to recover fine details. I basically set it to 3000 to allow more densifications when training with coarser images. The change also significantly shorten the time to finish the first 6k iterations.

I experiment with mip360 dataset's bicycle, garden, and some private datasets, and found the setting can meaningfully improve metrics and also visually when training with images at about 1k resolution or higher. I also want to learn about this setting from other users.

Zunhammer · 2024-03-08T09:03:28Z

Hi and thanks for your efforts in testing to improve. I've tested with a single high-res dataset (slightly above 4k) but couldn't really confirm better visual results. PSNR is oscillating a lot in this dataset which is why I cannot compare those values. I think it might also depend on how images were taken/are aligned? Could you share some more details about your datasets where you experience improvements?

ichsan2895 · 2024-03-09T16:43:33Z

Interesting, If I have free time, I will try this implementation and share the result...

ichsan2895 · 2024-03-11T00:53:00Z

My Experiment:

Desolation dataset

Images size: 1899x1064
Already processed by colmap image_undistorter, then Recreating sparse_pc.ply and transforms.json.
Using splatfacto-big & antialiased rasterizer.

Blue : without coarse to fine training

ns-train splatfacto-big --logging.steps-per-log 200 --vis viewer+wandb --viewer.websocket-port 7007 \
    --pipeline.model.rasterize_mode antialiased \
    nerfstudio-data \
    --data path/to/scene --downscale-factor 1

Red : with coarse to fine training

ns-train splatfacto-big --logging.steps-per-log 200 --vis viewer+wandb --viewer.websocket-port 7007 \
    --pipeline.model.rasterize_mode antialiased \
    --pipeline.model.resolution-schedule 3000 \
    --pipeline.model.num-downscales 2 \
    nerfstudio-data \
    --data path/to/scene --downscale-factor 1

ichsan2895 · 2024-03-11T00:57:11Z

My Second Experiment:

Truck dataset

Images size: 3904x2176 ((has been enlarged 4 times with Waifu4X super resolution))
Already processed by colmap image_undistorter, then Recreating sparse_pc.ply and transforms.json.
Using splatfacto-big & classic rasterizer.

Blue : without coarse to fine training

ns-train splatfacto-big --logging.steps-per-log 200 --vis viewer+wandb --viewer.websocket-port 7007 \
    nerfstudio-data \
    --data path/to/scene --downscale-factor 1

Red : with coarse to fine training

ns-train splatfacto-big --logging.steps-per-log 200 --vis viewer+wandb --viewer.websocket-port 7007 \
    --pipeline.model.resolution-schedule 3000 \
    --pipeline.model.num-downscales 2 \
    nerfstudio-data \
    --data path/to/scene --downscale-factor 1

Like @jb-ye being said:

One counter-intuitive phenomenon is when training at higher resolution, splatfacto actually creates less gaussians.

Yes, it seems right. lower resolution (976x544) has higher gaussians than current repo & this PR too. Please see black line

kerrj

lgtm!

Following this thread: nerfstudio-project#2984

lxzbg · 2024-03-13T03:35:28Z

good work! I also have similar findings, at high resolution, shooting high frequency signal(sofa part), when the image is downsampled, the Gaussian distribution is more (left is original, right is downsampled)

jb-ye · 2024-03-13T13:27:41Z

good work! I also have similar findings, at high resolution, shooting high frequency signal(sofa part), when the image is downsampled, the Gaussian distribution is more (left is original, right is downsampled)

Did this PR improve your results at the original resolution?

lxzbg · 2024-03-14T04:36:34Z

good work! I also have similar findings, at high resolution, shooting high frequency signal(sofa part), when the image is downsampled, the Gaussian distribution is more (left is original, right is downsampled)

Did this PR improve your results at the original resolution?

I haven't tried it yet. I'll let you know when I get the results.

ichsan2895 · 2024-03-17T12:12:46Z

My Third Experiment:

Purancak dataset (Private Dataset)

Images size: 2000x1500
Already processed by ns-process-data image --data /path/to/images --output-dir /path/to/output --skip-image-processing, then copying original images to output-dir.
Using splatfacto & classic rasterizer.

Blue : without coarse to fine training

ns-train splatfacto --logging.steps-per-log 200 --vis viewer+wandb --viewer.websocket-port 7007 \
    nerfstudio-data \
    --data path/to/scene --downscale-factor 1

Red : with coarse to fine training

ns-train splatfacto --logging.steps-per-log 200 --vis viewer+wandb --viewer.websocket-port 7007 \
    --pipeline.model.resolution-schedule 3000 \
    --pipeline.model.num-downscales 2 \
    nerfstudio-data \
    --data path/to/scene --downscale-factor 1

It gives a little bit improvement so far in my third experiment. @jb-ye

enable coarse-to-fine training

2fd7211

jb-ye force-pushed the main branch from 5732606 to 2fd7211 Compare March 11, 2024 17:00

kerrj approved these changes Mar 11, 2024

View reviewed changes

Merge branch 'main' into main

2019fc6

kerrj enabled auto-merge (squash) March 11, 2024 17:45

kerrj merged commit 8e0c687 into nerfstudio-project:main Mar 11, 2024
2 checks passed

ichsan2895 mentioned this pull request Mar 12, 2024

Enable coarse to fine training pierotofy/OpenSplat#38

Merged

ichsan2895 added a commit to ichsan2895/nerfstudio that referenced this pull request Mar 12, 2024

Enable coarse-to-fine training

ef7ebb7

Following this thread: nerfstudio-project#2984

Michael-Spleenlab pushed a commit to Michael-Spleenlab/nerfstudio that referenced this pull request Apr 26, 2024

enable coarse-to-fine training (nerfstudio-project#2984)

2de7246

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable coarse-to-fine training #2984

enable coarse-to-fine training #2984

jb-ye commented Mar 5, 2024 •

edited

Zunhammer commented Mar 8, 2024

ichsan2895 commented Mar 9, 2024

ichsan2895 commented Mar 11, 2024 •

edited

ichsan2895 commented Mar 11, 2024 •

edited

kerrj left a comment

lxzbg commented Mar 13, 2024 •

edited

jb-ye commented Mar 13, 2024

lxzbg commented Mar 14, 2024

ichsan2895 commented Mar 17, 2024 •

edited

enable coarse-to-fine training #2984

enable coarse-to-fine training #2984

Conversation

jb-ye commented Mar 5, 2024 • edited

Zunhammer commented Mar 8, 2024

ichsan2895 commented Mar 9, 2024

ichsan2895 commented Mar 11, 2024 • edited

Desolation dataset

ichsan2895 commented Mar 11, 2024 • edited

Truck dataset

kerrj left a comment

Choose a reason for hiding this comment

lxzbg commented Mar 13, 2024 • edited

jb-ye commented Mar 13, 2024

lxzbg commented Mar 14, 2024

ichsan2895 commented Mar 17, 2024 • edited

Purancak dataset (Private Dataset)

jb-ye commented Mar 5, 2024 •

edited

ichsan2895 commented Mar 11, 2024 •

edited

ichsan2895 commented Mar 11, 2024 •

edited

lxzbg commented Mar 13, 2024 •

edited

ichsan2895 commented Mar 17, 2024 •

edited