Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crash at shape pre-training #28

Closed
Woolseyyy opened this issue Nov 8, 2022 · 7 comments
Closed

Crash at shape pre-training #28

Woolseyyy opened this issue Nov 8, 2022 · 7 comments

Comments

@Woolseyyy
Copy link

I am trying to reproduce the results but meet some problems at 'I. Shape Pre-Training'. I find the script would crash at validation of shape pre-training. It looks like a OOM issue because log says "killed" and it stop crashing if I set shuffle_buffer_size=False at shape.ini. Any suggestions would help!

I am using a machine with 4 3090 GPUs, 12 cpu cores and 60 GB memory. My dataset have 100 train data and 7 validate data. There are 120 test data, 99 train data, 99 val data at surf_root directory.

@Woolseyyy Woolseyyy changed the title Very Slow and Easy to crash at shape pre-training Crash at shape pre-training Nov 8, 2022
@xiumingzhang
Copy link
Collaborator

Try reducing the MLP chunk size? That should solve your OOM problem.

Also, consider trying out https://github.com/nerfstudio-project/nerfstudio as a drop-in replacement for NeRFactor's "Shape Pre-Training." nerfstudio will be much faster, and NeRFactor doesn't care who generated the surface points.

@Woolseyyy
Copy link
Author

Why dose MLP chunk size affect cpu memory? It seems that mlp chunk size is only about GPU memory.

@wangmingyang4
Copy link

hi! I have encountered the same problem. How did you solve it? @Woolseyyy

@Woolseyyy
Copy link
Author

I set shuffle_buffer_size=False

I set cache=False for validation and set shuffle_buffer_size=False

@wangmingyang4
Copy link

Does this setting affect the overall experiment?

@Woolseyyy
Copy link
Author

Does this setting affect the overall experiment?

modify the code to make it only affect validation

@wangmingyang4
Copy link

wangmingyang4 commented Nov 30, 2022

I found no shuffle_buffer_size in shape.ini, do I need to add shuffle_buffer_size = 0, or no_shuffle = True in modify base.py?
@Woolseyyy
An error : buffer_size must be greater than 0 occurred when setting shuffle_buffer_size = 0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants