Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training problem help #14

Closed
wasahaiah opened this issue Jun 18, 2024 · 10 comments
Closed

Training problem help #14

wasahaiah opened this issue Jun 18, 2024 · 10 comments

Comments

@wasahaiah
Copy link

Hi, I met a problem about training.

I training the processed dataset with 11 images as test set as illustrated in paper. The training results: training PSNR about 14.5 and the evaluated PSNR is about 25. At the beginning, I doubt the dataset has some problem. But when I evaluate the dataset by your released model(id=1, exp=2), the results are reasonable, i.e. PSNR=31.49 (in the train dataset) and PSRN=30.71(in the test dataset). Can you give me a help?

@aejion
Copy link
Owner

aejion commented Jun 19, 2024

Could you provide more details about your training configuration and results, such as the config file, the number of GPUs, and the rendered images?

@wasahaiah
Copy link
Author

Could you provide more details about your training configuration and results, such as the config file, the number of GPUs, and the rendered images?

Thank you for your reply. The config file is almost the same as the file "train_neuface.yaml", except that param 'num_gpus' is set to be 1, because I have only one nvidia 3090D chip. Two images in the tensorboard, are shown below:

Screenshot 2024-06-20 at 08-06-13 TensorBoard
imageData

The evaluation results (id=1, exp=2, image:045)are shown below:
1_045_albedo
1_045_diffuse
1_045_fit
1_045_normal
1_045_spec

@aejion
Copy link
Owner

aejion commented Jun 20, 2024

If you are using a single GPU to train NeuFace, it is recommended to increase the num_pixels in the config file. Note that the PSNR shown in TensorBoard is not accurate due to the calibration code. For an accurate PSNR evaluation, please run eval.py. Additionally, as observed in TensorBoard, the render result appears much better than the evaluation. This might be because the wrong ckpt was used when running eval.py.

@wasahaiah
Copy link
Author

wasahaiah commented Jun 24, 2024

If you are using a single GPU to train NeuFace, it is recommended to increase the num_pixels in the config file. Note that the PSNR shown in TensorBoard is not accurate due to the calibration code. For an accurate PSNR evaluation, please run eval.py. Additionally, as observed in TensorBoard, the render result appears much better than the evaluation. This might be because the wrong ckpt was used when running eval.py.

Hi, when I used 4 GPU(nvidia 4090) to train NueFace, the PSNR is 25.18, still too lower. I increase the 'num_pixels' to 6000 in the config file, and get PSRN about 26. the result is below:
PSNR:26
1_044
1_044_albedo
1_044_diffuse
1_0 1_044_normal 44_fit
1_044_spec

@aejion
Copy link
Owner

aejion commented Jun 24, 2024

Could you please provide the PSNR in detail for each test image?

@wasahaiah
Copy link
Author

wasahaiah commented Jun 25, 2024

Could you please provide the PSNR in detail for each test image?

the PSNR of images 47.png, 49.png, 50.png(large rotation), are small, below 17, others are above 24. And only image of 52.png , is above 33.

in facescape_multi_dataset.py, ply_path is the TU model and not the cut mesh?
I place TU model in the ply_path as suggested.

@aejion
Copy link
Owner

aejion commented Jun 25, 2024

'ply_path' refers to the original TU model, which provides the position of nose.

@wasahaiah
Copy link
Author

wasahaiah commented Jun 27, 2024

'ply_path' refers to the original TU model, which provides the position of nose.

Hi, I have trained a model in 4 V800 GPUs, and get PSNR 33.28 in train set, 26.87 in eval set, with num_pixels: 20000 and sdf_threshold:5e-5.

@aejion
Copy link
Owner

aejion commented Jun 27, 2024

Can you send me your results via email? (zhyzhy@buaa.edu.cn)

@wasahaiah
Copy link
Author

Can you send me your results via email? (zhyzhy@buaa.edu.cn)

Have send it to you. If not received, pls let me know.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants