Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

viewer.exe not working #22

Closed
lh-dm opened this issue Feb 13, 2024 · 6 comments
Closed

viewer.exe not working #22

lh-dm opened this issue Feb 13, 2024 · 6 comments

Comments

@lh-dm
Copy link

lh-dm commented Feb 13, 2024

I can't get viewer to work.
I have downloaded "tt_scenes.zip" and unzipped it in the scene folder.
I downloaded "tt_experiments.zip" and unzipped it in the experiments folder.

This is what it currently "experiments" looks like for me:
grafik

But it should look so (???):

    experiments/
        checkpoint_train
        checkpoint_playground
        ...

I am already confused - here are no checkpoints.

I tried it anyway:

D:\TRIPS>D:\TRIPS\build\bin\RelWithDebInfo\viewer.exe --scene_dir D:\Trips\scenes\tt_train
register neural render info
register TnnInfo
Git ref: 2c2a5e1822791a0f53296c8e5c9fa2058a03c4a9

And then it stops... no error... but i found in %AppData% huge crash dump files - each over 1GB:

grafik


Then I started to train...
That didn't work first and stopped at Epoch 1.

complete log:
log.txt

Then I changed the crop-size and this solved:
--TrainParams.train_crop_size 256

I managed to train up to epoch 10.
Epoch 10 was my goal - it's OK for the 1st run.

This is my generated data:

grafik

grafik

The question is now... how can I view it? Nothing seems to work...


My GFX-Card has 8 GB and is during training fully loaded (>7 GB).

@iFimo
Copy link

iFimo commented Feb 14, 2024

Answer to: #23 (comment)

Hi, @dm-de, I will try to answer your questions here to keep these two different topics separate.

Can you also give me some hints?

  1. Can you confirm, that "tt_experiments.zip" works? Here are no checkpoint's in zip.
    See here my post: viewer.exe not working #22

I think the folders themselves that containing the training epochs are the checkpoints. Yes, they are not called checkpoints. That surprised me too, but I think the description in the readme file is just a structure tree.
Maybe the same as mine “dense_point_cloud.ply”. ?

  1. What is your GPU memory usage with viewer.exe & playground scene
    I have a issue with viewer and I'm not sure, if this happen, because I have only 8GB VRAM or because my exe ist compiled wrong. I think this has something to do with saiga & cuda compute capability.
    My special problem is, that I compile on one machine and copy release to another machine (where I have no admin rights).
    For this I need to set cuda version manually... It is hard to find out what is wrong.

From what I currently know, you don't need the experiments folder at all to use the viewer. I'll test this again tomorrow (up until now I had only moved the checkpoints/experiments to a subfolder for testing). I'll throw out the experiments folder completely tomorrow, then I'll try to start the viewer with just the scenes folder present and I'll pay attention to the VRAM. I'll report back tomorrow.

I have a 3090 with 24GB VRAM.

I also suspect that something has been compiled badly for you or is a question of admin rights.

Have you tried to recompile it?

  • Delete the build folder
  • Make sure all dependencies are there. A missing “Nsight NVTX” was my answer for a non-working viewer.
  • With the updated readme file it should actually work. (It did for me)

@lh-dm
Copy link
Author

lh-dm commented Feb 15, 2024

I am disappointed.
I have compiled again.
For some reason cmake has set different cuda versions. I have set everything to cuda 7.5 only...
But it makes no difference.

My 8GB VRAM is probably not enough. :-(
viewer crashes without error message.
It would be great if there was a very slimmed-down viewer.

train on the other hand works (but only with train_crop_size param)
VRAM is hard at the limit 7.8 of 8GB used!

@iFimo
Copy link

iFimo commented Feb 15, 2024

Hey, here are my test results.

VRAM usage:
If you have limited VRAM, don't try the viewer with the "train" or "playground", instead use the "horse" for testing.

Here is my VRAM usage in various scenes (1.1 GB, is already occupied on my system without the viewer, so that's where it comes from):

Train			=1,1 GB + 5,2 GB = 6,3 GB
Playground		=1,1 GB + 5,1 GB = 6,2 GB
Horse			=1,1 GB + 3,5 GB = 4,6 GB

Each scene should work with 8GB of VRAM, but if you have problems try the smaller scenes rather than the largest ones.


Result: If the experiments folder has been removed:

To open a scene, the scenes folder AND the experiments folder must exist. With removed experiments folder it dosnt work.
I also tried putting the experiments themselves in a subfolder and also renaming them to checkpoint_playground. Result: Both use cases seem to work fine, as long as the experiments are somewhere in the experiments folder it should work.

But I have it like in your first screenshot and that works fine.


I have compiled again.

You need:

CUDA 11.8 (make sure to at least include Nsight NVTX, Development/* , Runtime/Libraries/* and the Visual Studio Integration)

  • Reinstall CUDA 11.8 and make sure you install all parts of it. Also click on “Set Windows Paths”.
  • Install cuDNN.
  • In VS2022 I selected the “Desktop Development with C++” libraries, I don’t know if you need them, but it won’t hurt.

These are the things I paid attention to when compiling for the twentieth time, maybe it will help.

Otherwise I don't know what to do next.

@lfranke
Copy link
Owner

lfranke commented Feb 15, 2024

Hi,

yes, the folders in the tt_experiments.zip and tt_scenes.zip are named the same. Sorry for the oversight, I will update this in the supplemental soon. In the meantime, you can rename the folders in the experiments however you want.

Thanks @iFimo, yes the viewer fails if the experiments folder is completely missing, otherwise if the folder exists but no experiments are present just the Model View of the point cloud will start and you will be unable to switch to Neural View.

I just retried on Windows with a RTX2070 and my VRAM measurements were similar to @iFimo.

Maybe try to start the viewer with --render_scale 0.5, this forces the resolution to be lower and should require less VRAM.

@dm-de
Copy link

dm-de commented Feb 17, 2024

I (dm-de = lh-dm) got viewer working without crash.
I copied TRIPS root directory with all files to other computer.
No other tool installation was requred (conda, cuda etc.).
I don't know what was missing or wrong before.

@lfranke
Copy link
Owner

lfranke commented Feb 21, 2024

Hi, that is indeed odd! But I'm happy that it is working for you :)

@lfranke lfranke closed this as completed Feb 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants