Implement nerfacto alpha transparency training #2165

nepfaff · 2023-07-03T14:42:17Z

This PR cleans up and replaces #2025.

Test data

Mustard data set with alpha masks.
https://drive.google.com/file/d/1XX4ioj9NgaRoMIA00Negp5x9XM8gWxjD/view?usp=sharing

ns-train nerfacto --pipeline.model.background_color "random" --pipeline.model.disable-scene-contraction True  instant-ngp-data --data data/mustard_alpha_images/

Results

Accumulation without alpha transparent training:

Accumulation with alpha transparent training:

Closes #1498
Closes #2025

nepfaff · 2023-07-03T14:47:06Z

@SamDM, @jkulhanek, @f-dy let me know if you have any comments :)

I tried to address all mentioned suggestions in #2025 while keeping the functionality the same.

jkulhanek · 2023-07-04T12:34:44Z

nerfstudio/model_components/renderers.py

+    def blend_background(
+        cls,
+        image: Tensor,
+        outputs: Dict[str, Union[Tensor, list]],


Can you please pass the optional background_color directly instead of passing the outputs?

Also, can you please pass RGB image and "opacity" as two separate arguments?

Passed them as separate arguments. Also, it is probably better to not allow for optional arguments but only call this function if we actually want to blend the background. We can resolve this if you agree with removing the optionality of the arguments.

But shouldn't blending happen always? When background_color=random it would use the provided background_color, otherwise it would use the renderer's background colour? Perhaps I am missing something...

It should only happen if the input data includes an alpha channel. If this is the case, we would also have a background_color to pass to this method.
So you are right, if "background_color" in outputs is not needed.

I remember looking at this a good while ago and thinking that a scoreboard of transient values, like this run's background color selection, could be more generally useful.

The other use case I had in my head was some way to prevent single bad iterations from having a catastrophic impact, maybe pause/stop training if metrics plummet instead of saving checkpoint.

What are you thinking in terms of scoreboard? Do you mean having a dedicated object/place to write values like background_color to instead of storing them in outputs?

The other use case I had in my head was some way to prevent single bad iterations from having a catastrophic impact, maybe pause/stop training if metrics plummet instead of saving checkpoint.

I'm sorry, but I'm not quite following how this relates to always blending the background color if an alpha channel is present. Or do you mean as a use case for the scoreboard idea?

Do you mean having a dedicated object/place to write values like background_color to instead of storing them in outputs?

That was the thought, although not sure whether there is need.

Or do you mean as a use case for the scoreboard idea?

Yes, this (sorry)

This seems like something that would warrant a separate pull request that then also implements one of your suggested functionalities.
However, I do see one reason for implementing something like this as part of this PR: All the outputs are currently passed through this resizing operation. This doesn't make much sense for background_color. It does not crash as the background color is a per-image pixel color, but it still isn't great.

Any opinions on this or separate PR?

Separate PR probably - unless it would fit neatly into this one

nerfstudio/data/datasets/base_dataset.py

nerfstudio/models/base_surface_model.py

nerfstudio/models/nerfacto.py

machenmusik · 2023-07-12T14:22:44Z

Do we think this is in condition to be merged?

yonghoonkwon · 2023-07-13T10:57:55Z

If set the background to random, rendering works fine, but there seems to be a problem with exporting mesh.
I started training with the following command and the function for exporting point cloud or mesh (poisson) is not working properly.

CUDA_VISIBLE_DEVICES=0 ns-train nerfacto --data $DATA_SET --max-num-iterations 300001 --vis viewer+tensorboard --experiment-name $EXP_NAME --pipeline.model.background_color random --pipeline.datamanager.camera-optimizer.mode off --pipeline.model.near-plane 0.5 --pipeline.model.far-plane 10.0 --machine.num-devices 0 --pipeline.datamanager.camera-res-scale-factor 0.5 --pipeline.model.disable-scene-contraction True nerfstudio-data --train-split-fraction 1

jkulhanek · 2023-07-13T13:14:49Z

What does it mean the function is not working properly? Can you please post the error message or something?

nepfaff · 2023-07-13T13:22:19Z

I didn't get amazing results with Poisson on that dataset. This isn't alpha transparent training dependent though...

TSDF fusion works great and is significantly improved by alpha transparent training. This is actually my sole use case of this feature. Currently on the way to an airport, but will post some mesh export results once I'm there :)

nerfstudio/model_components/renderers.py

jkulhanek · 2023-07-13T13:26:20Z

@nepfaff , Can you please add me as a contributor to your fork so I can try implementing some changes? If you are not comfortable with this idea, I can alternatively create a new branch...

yonghoonkwon · 2023-07-13T13:31:13Z

What does it mean the function is not working properly? Can you please post the error message or something?

If you export pointcloud to a learned model, it will not proceed at 0%, but you will continue to consume hardware resources (GPU). I don't know the exact cause, but I think it's because of the random value in the background. If pipeline.model.background_color is black, ghost pixels are included, but the export was successful. Point cloud needs to be extracted for Poisson mesh extraction, but I think there is a problem in this process.

nepfaff · 2023-07-13T13:33:53Z

I did manage to run Poisson but I can try with point cloud. Will report the results soon

nepfaff · 2023-07-13T13:34:30Z

@nepfaff , Can you please add me as a contributor to your fork so I can try implementing some changes? If you are not comfortable with this idea, I can alternatively create a new branch...

Done :)

yonghoonkwon · 2023-07-13T13:35:33Z

I didn't get amazing results with Poisson on that dataset. This isn't alpha transparent training dependent though...

TSDF fusion works great and is significantly improved by alpha transparent training. This is actually my sole use case of this feature. Currently on the way to an airport, but will post some mesh export results once I'm there :)

What I tested is my custom data, but I will train and test again with mustard data and share the results.

yonghoonkwon · 2023-07-13T13:40:15Z

I did manage to run Poisson but I can try with point cloud. Will report the results soon

Can you share the script you used to train mustard data??

nepfaff · 2023-07-13T13:44:11Z

I did manage to run Poisson but I can try with point cloud. Will report the results soon

Can you share the script you used to train mustard data??

Does the command in the description not work?

jkulhanek · 2023-07-13T13:58:29Z

@nepfaff , Can you please add me as a contributor to your fork so I can try implementing some changes? If you are not comfortable with this idea, I can alternatively create a new branch...

Thanks! Will try to implement some changes, let me then know if you like them or not...

nepfaff · 2023-07-13T14:19:55Z

I didn't get amazing results with Poisson on that dataset. This isn't alpha transparent training dependent though...
TSDF fusion works great and is significantly improved by alpha transparent training. This is actually my sole use case of this feature. Currently on the way to an airport, but will post some mesh export results once I'm there :)

What I tested is my custom data, but I will train and test again with mustard data and share the results.

It would also be great if you could share your data. The more diverse the test data, the better :)

nepfaff

Looks great to me! Will share some of the testing results soon

nerfstudio/models/base_model.py

nerfstudio/exporter/exporter_utils.py

nepfaff · 2023-07-18T09:29:01Z

It looks good to me now.

The export results are as desired and as shown in the previous images.

jkulhanek · 2023-07-18T10:16:16Z

Great! @nepfaff, thank you very much for your work. This has been a pleasure.

jkulhanek · 2023-07-18T10:16:44Z

I will now merge this to the main, ok?

machenmusik · 2023-07-18T21:47:53Z

Nice! Just to confirm, this mode is automatic when input images contain alpha, but separate mask images will have previous non-carving behavior - is that right? Can both be used simultaneously?

nepfaff · 2023-07-18T21:52:28Z

Almost. It is only automatic if the images have an alpha channel and the background color is random (this is not true by default). Hence, alpha images can be used with the previous behavior if the color is not random.
I didn't test both simultaneously but it should work.

gradeeterna · 2023-07-19T09:12:35Z

Thanks for working on this! Just wondering if this should work with equirectangular images with an alpha channel? The old workflow with separate mask images did not work with equirectangular for some reason.

If so, do I just need to add --pipeline.model.background_color "random" to my training command?

Also, do masks in the alpha channel use less VRAM than having separate masks? I was never able to use those with large datasets as I would get CUDA out of memory errors.

hnj5247 · 2023-07-21T08:30:34Z

Hello, Trying to follow up this great work. Is there any way to transform the image format( nerfstudios' original(.png) &camera position(transforms.jason) )
to this nerfacto alpha transparency training model's format?

nepfaff · 2023-07-22T12:36:25Z

Thanks for working on this! Just wondering if this should work with equirectangular images with an alpha channel? The old workflow with separate mask images did not work with equirectangular for some reason.

If so, do I just need to add --pipeline.model.background_color "random" to my training command?

Also, do masks in the alpha channel use less VRAM than having separate masks? I was never able to use those with large datasets as I would get CUDA out of memory errors.

It should work. Just add an alpha channel and specify the random background color as you suggested.
I'd assume that they would use less VRAM but haven't tested this. It would be amazing if you could try this and post the results here :)

nepfaff · 2023-07-22T12:40:00Z

Hello, Trying to follow up this great work. Is there any way to transform the image format( nerfstudios' original(.png) &camera position(transforms.jason) )

to this nerfacto alpha transparency training model's format?

You need to add an alpha channel to your images. This is a 4th channel with values between 0 and 1. For alpha-transparent training you would use binar values where zero is transparent.
The easiest way to do this is probably numpy, concatenating the alpha channel with the RGB images.

Then additionally specify a random background color as specified in the PR description.

hnj5247 · 2023-07-24T08:20:04Z

Using rembg p <input.img.folder> <output.img.folder>
I was able to remove the background of real world photo and add alpha. [r,g,b,a] . Thank you!
However it worked when the line '--pipeline.model.disable-scene-contraction True' was removed.
Thank you for replying!

gradeeterna · 2023-07-24T09:06:17Z

Thanks for working on this! Just wondering if this should work with equirectangular images with an alpha channel? The old workflow with separate mask images did not work with equirectangular for some reason.
If so, do I just need to add --pipeline.model.background_color "random" to my training command?
Also, do masks in the alpha channel use less VRAM than having separate masks? I was never able to use those with large datasets as I would get CUDA out of memory errors.

It should work. Just add an alpha channel and specify the random background color as you suggested. I'd assume that they would use less VRAM but haven't tested this. It would be amazing if you could try this and post the results here :)

Hey, thanks for getting back to me. I tested a few scenes last night, but got strange results with equirectangular and fisheye. My use case is is quite different to the mustard example - I'm masking myself out of 360 equirectangular images, and the black border on circular fisheye images.

I exported png images with masks in the alpha from Metashape (black pixels masked) and used "ns-process metashape" to create the downscales, and trained nerfacto-huge with a random background color. The masks were definitely doing something, but the colours are strange and the masked objects were still kind of visible in the NeRF.

It definitely seems to use less VRAM than seperate masks though, as I didn't get cuda oom errors for the first time! :)

f-dy · 2023-08-18T17:54:45Z

@gradeeterna to mask yourself out, you shouldn't use these masks ("alpha transparency trauining" / "alpha carving"), but the "ignore" masks.
Here's how I add a single mask to all my fisheye images. You have to add the mask to the transforms.json, but also downscale it. You can easily customize it to add one mask per image. See also the discussion in #1498 for the "ignore" masking strategy

cat > add_mask_to_transforms_json.py <<EOF
#!/usr/bin/env python
import sys
import json

if len(sys.argv) != 3:
    print(f"Usage: {sys.argv[0]} input_transforms.json output_transforms.json")
    sys.exit(1)
with open(sys.argv[1]) as input_file:
    file_contents = input_file.read()
parsed_json = json.loads(file_contents)
for frame in parsed_json["frames"]:
    frame["mask_path"] = "masks/mask.png"
with open(sys.argv[2], "w") as output_file:
    json.dump(parsed_json, output_file, indent=4)
EOF

cat > downsize_mask.py <<EOF
#!/usr/bin/env python
import cv2
import sys
import json
from pathlib import Path

if len(sys.argv) != 2:
    print(f"Usage: {sys.argv[0]} path_to/mask.png")
    print(f"Output is path_to/masks_{downscale}/mask.png")
    sys.exit(1)
mask_path = Path(sys.argv[1])
mask = cv2.imread(str(mask_path), cv2.IMREAD_GRAYSCALE)
height, width = mask.shape[:2]
processed_data_dir = mask_path.parent
downscale_factors = [2, 4, 8]
for downscale in downscale_factors:
    mask_path_i = processed_data_dir / f"masks_{downscale}"
    mask_path_i.mkdir(exist_ok=True)
    mask_path_i = mask_path_i / "mask.png"
    mask_i = cv2.resize(
        mask, (width // downscale, height // downscale), interpolation=cv2.INTER_NEAREST
    )
    cv2.imwrite(str(mask_path_i), mask_i)
    print(f"Wrote {mask_path_i}")
EOF

gradeeterna · 2023-08-18T19:35:19Z

@f-dy Hey, using a single ignore mask for all my fisheye images does work, but it slows down training by about 5x. When I use one mask per image with a big dataset, I get out of memory errors on my 3090 24GB. I was trying this method in case it was faster and uses less memory.

The way masks work with NGP is great, and doesn't seem to slow down training or increase memory usage. One mask per image goes in the main images folder, and they don't need to be added to the transforms.json.

Thanks for sharing those scripts, very useful!

machenmusik · 2023-08-18T19:45:07Z

(IIRC there is an option that can be set which puts masks on gpu which significantly speeds up training.)

gradeeterna · 2023-08-18T20:37:34Z

Oh yeah, just found it - --pipeline.datamanager.masks-on-gpu True

Training speed is almost the same as without masks, thanks a lot!

nepfaff force-pushed the alpha_carving_v2 branch 2 times, most recently from a4bf22a to a28c90a Compare July 3, 2023 15:15

Implement nerfacto alpha transparency training

0bbaba6

nepfaff force-pushed the alpha_carving_v2 branch from a28c90a to 0bbaba6 Compare July 3, 2023 15:29

jkulhanek reviewed Jul 4, 2023

View reviewed changes

nepfaff added 4 commits July 4, 2023 16:22

Stop adding all ones alpha channel and clean up blend_background

e24f15b

Make background_color a required argument in blend_background

c15fa18

Enable alpha transparent training by default

bed6846

Improve background_color typing

119519d

jkulhanek requested a review from tancik July 4, 2023 20:53

nepfaff added 3 commits July 5, 2023 08:35

Blend uniform background color for metric computation

324f1d4

Fix typing error

32900a6

Clean up typing

4c51bd4

jkulhanek reviewed Jul 6, 2023

View reviewed changes

nerfstudio/models/nerfacto.py Outdated Show resolved Hide resolved

Fix background_color to outputs assignment

edd594c

jkulhanek requested changes Jul 13, 2023

View reviewed changes

nerfstudio/model_components/renderers.py Outdated Show resolved Hide resolved

Merge branch 'main' into alpha_carving_v2

8e7b388

jkulhanek removed the request for review from tancik July 17, 2023 18:54

nepfaff commented Jul 18, 2023

View reviewed changes

nerfstudio/models/base_model.py Outdated Show resolved Hide resolved

nerfstudio/exporter/exporter_utils.py Show resolved Hide resolved

nepfaff commented Jul 18, 2023

View reviewed changes

nerfstudio/exporter/exporter_utils.py Show resolved Hide resolved

nepfaff added 3 commits July 18, 2023 09:58

Fix pcd export masking

c981d14

Filter pcd view directions based on opacity

66625bd

Clean up get_rgba_image

a86c4aa

Merge branch 'main' into alpha_carving_v2

ca90467

jkulhanek approved these changes Jul 18, 2023

View reviewed changes

jkulhanek merged commit 9f487a4 into nerfstudio-project:main Jul 18, 2023
4 checks passed

jkulhanek deleted the alpha_carving_v2 branch July 18, 2023 10:55

machenmusik mentioned this pull request Nov 1, 2023

Suggestion: incorporate depth supervision into #2521 #2572

Open

arbab-ml mentioned this pull request Jan 11, 2024

How to render without background? #2578

Open

f-dy mentioned this pull request Jan 23, 2024

The result is bad when using mask #2771

Open

pa-la mentioned this pull request Feb 9, 2024

Add transparency carving to splatfacto #2889

Merged

Implement nerfacto alpha transparency training #2165

Implement nerfacto alpha transparency training #2165

Conversation

nepfaff commented Jul 3, 2023 • edited

Test data

Results

nepfaff commented Jul 3, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

machenmusik commented Jul 12, 2023

yonghoonkwon commented Jul 13, 2023 • edited

jkulhanek commented Jul 13, 2023

nepfaff commented Jul 13, 2023

jkulhanek commented Jul 13, 2023

yonghoonkwon commented Jul 13, 2023

nepfaff commented Jul 13, 2023

nepfaff commented Jul 13, 2023

yonghoonkwon commented Jul 13, 2023 • edited

yonghoonkwon commented Jul 13, 2023 • edited

nepfaff commented Jul 13, 2023

jkulhanek commented Jul 13, 2023

nepfaff commented Jul 13, 2023

nepfaff left a comment

Choose a reason for hiding this comment

nepfaff commented Jul 18, 2023 • edited

jkulhanek commented Jul 18, 2023

jkulhanek commented Jul 18, 2023

machenmusik commented Jul 18, 2023

nepfaff commented Jul 18, 2023 • edited

gradeeterna commented Jul 19, 2023

hnj5247 commented Jul 21, 2023

nepfaff commented Jul 22, 2023

nepfaff commented Jul 22, 2023

hnj5247 commented Jul 24, 2023 • edited

gradeeterna commented Jul 24, 2023

f-dy commented Aug 18, 2023

gradeeterna commented Aug 18, 2023 • edited

machenmusik commented Aug 18, 2023

gradeeterna commented Aug 18, 2023

nepfaff commented Jul 3, 2023 •

edited

yonghoonkwon commented Jul 13, 2023 •

edited

yonghoonkwon commented Jul 13, 2023 •

edited

yonghoonkwon commented Jul 13, 2023 •

edited

nepfaff commented Jul 18, 2023 •

edited

nepfaff commented Jul 18, 2023 •

edited

hnj5247 commented Jul 24, 2023 •

edited

gradeeterna commented Aug 18, 2023 •

edited