Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Poor Genre results on demo images when compiled with Cuda10 #71

Open
shubham-goel opened this issue Feb 24, 2021 · 6 comments
Open

Poor Genre results on demo images when compiled with Cuda10 #71

shubham-goel opened this issue Feb 24, 2021 · 6 comments

Comments

@shubham-goel
Copy link

Hi,

I'm trying to run Genre on a machine with CUDA10 + 2080Tis (gencode arch 75). As far as I know, CUDA 9 cannot compile for gencode arch 75 hardware targets.

Therefore, I setup Genre in a singularity container containing pytorch0.5.0a + Cuda10.0 like this. This commit shows the minimal changes I had to make to the toolbox/*/setup.sh files.

Now, Genre (scripts/test_genre.sh) runs without any errors but the reconstructed 3D shapes look very poor. Do you have any idea why this would happen?

Here is the generated output/test_genre_full_model directory. Visualizations of input, predicted meshes:

0003_00_rgb
Screen Shot 2021-02-23 at 8 17 02 PM

0002_00_rgb
Screen Shot 2021-02-23 at 8 17 51 PM

@ztzhang
Copy link
Collaborator

ztzhang commented Feb 24, 2021

Thanks for bringing up the issue.
I'm under the impression that pytorch 0.4.1 does not support cuda10 and I can't find a pytorch 0.5.0a version.
Is 0.5.0a build from source, according to this issue?

@xiumingzhang
Copy link
Owner

Hi, could it be because you had too high of a threshold in visualizing the voxels?

@shubham-goel
Copy link
Author

shubham-goel commented Feb 24, 2021

Shouldn't be a voxel threshold issue because I'm using the default threshold for standard demo images by running scripts/test_grenre.sh.

Re: pytorch version, you're right that pytorch 0.4.1 does not officially provide any prebuilt binaries for cuda10. But it should still be possible to compile pytorch 0.4.1 from source with Cuda10. This nvidia container that I'm using does exactly that! I was confused why the torch.__version__ was 0.5.0a in the container but that issue explains it.

@ztzhang
Copy link
Collaborator

ztzhang commented Feb 24, 2021

In this case, it seems the only way for us to trace this issue is by reproducing the results you have here with that specific environment. I'll try to find sometime this weekend to look into this issue.

@ztzhang
Copy link
Collaborator

ztzhang commented Mar 1, 2021

@shubham-goel I spent some time setting up the environment on our cluster, but haven't gone through the testing. I'll ping in this thread when I have some updates.

@shubham-goel
Copy link
Author

Thanks @ztzhang! In interest of time, I setup Genre on a labmate's old machine (with CUDA9/1080Ti) yesterday -- so this issue isn't time-critical for me anymore. You may delay working on it, or even close it if you want.

But since the solution to this issue will probably provide a way of running Genre with CUDA10 on the newer-architecture GPUs, it'd still be useful to many others out there! I leave the decision to you. Thanks again!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants