Training issue due to environment.yml file is not updated #35

davidvct · 2023-08-10T04:36:13Z

The environment.yml file not updated, and has the exploding gradient issue.

I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

yhlleo · 2023-08-15T02:11:37Z

Yes, the environment is somehow a bit old. I am thinking about to update the project in some days.

citystrawman · 2024-03-15T08:42:40Z

The environment.yml file not updated, and has the exploding gradient issue.

I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

Hi, did you just change python and pytorch version? I changed these two but still failed to install these dependencies. I am not sure what is wrong here.

davidvct · 2024-03-16T12:27:53Z

The environment.yml file not updated, and has the exploding gradient issue.
I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

Hi, did you just change python and pytorch version? I changed these two but still failed to install these dependencies. I am not sure what is wrong here.

Yes, I only changed those for installation. Could you paste your error message so that others can better understand the error? Are you install on Mac/Win/Linux?

citystrawman · 2024-03-17T06:05:19Z

The environment.yml file not updated, and has the exploding gradient issue.
I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

Hi, did you just change python and pytorch version? I changed these two but still failed to install these dependencies. I am not sure what is wrong here.

Yes, I only changed those for installation. Could you paste your error message so that others can better understand the error? Are you install on Mac/Win/Linux?

I just managed to make it. But I got another problem saying that result type Float can't be cast to the desired output type Long at File "D:\Program Files\anaconda3\envs\deepsegmentlh\lib\site-packages\torch\nn\functional.py", line 3165, in binary_cross_entropy_with_logits . Is that what you mentioned in your post?

davidvct · 2024-03-18T06:32:12Z

The environment.yml file not updated, and has the exploding gradient issue.
I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

Hi, did you just change python and pytorch version? I changed these two but still failed to install these dependencies. I am not sure what is wrong here.

Yes, I only changed those for installation. Could you paste your error message so that others can better understand the error? Are you install on Mac/Win/Linux?

I just managed to make it. But I got another problem saying that result type Float can't be cast to the desired output type Long at File "D:\Program Files\anaconda3\envs\deepsegmentlh\lib\site-packages\torch\nn\functional.py", line 3165, in binary_cross_entropy_with_logits . Is that what you mentioned in your post?

I think it might be, but not quite sure as it was quite some time ago. Maybe try changing the file deepcrack_model.py as mentioned in my first post, and see if it would fix your issue.

citystrawman · 2024-03-18T08:11:42Z

The environment.yml file not updated, and has the exploding gradient issue.
I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

Hi, did you just change python and pytorch version? I changed these two but still failed to install these dependencies. I am not sure what is wrong here.

Yes, I only changed those for installation. Could you paste your error message so that others can better understand the error? Are you install on Mac/Win/Linux?

I just managed to make it. But I got another problem saying that result type Float can't be cast to the desired output type Long at File "D:\Program Files\anaconda3\envs\deepsegmentlh\lib\site-packages\torch\nn\functional.py", line 3165, in binary_cross_entropy_with_logits . Is that what you mentioned in your post?

I think it might be, but not quite sure as it was quite some time ago. Maybe try changing the file deepcrack_model.py as mentioned in my first post, and see if it would fix your issue.

Thank you. I've been able to run the program. However, I am not quite clear about the logic of testing for this program: At first I thought test_img folder should be the images for testing, while test_lab folder should be the testing results, however, when I run the program, the testing process requires not only test_img, but also the counterpart img in test_lab which is the identified cracks of test_img.

davidvct · 2024-03-20T02:09:45Z

The environment.yml file not updated, and has the exploding gradient issue.
I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

Hi, did you just change python and pytorch version? I changed these two but still failed to install these dependencies. I am not sure what is wrong here.

Yes, I only changed those for installation. Could you paste your error message so that others can better understand the error? Are you install on Mac/Win/Linux?

I just managed to make it. But I got another problem saying that result type Float can't be cast to the desired output type Long at File "D:\Program Files\anaconda3\envs\deepsegmentlh\lib\site-packages\torch\nn\functional.py", line 3165, in binary_cross_entropy_with_logits . Is that what you mentioned in your post?

I think it might be, but not quite sure as it was quite some time ago. Maybe try changing the file deepcrack_model.py as mentioned in my first post, and see if it would fix your issue.

Thank you. I've been able to run the program. However, I am not quite clear about the logic of testing for this program: At first I thought test_img folder should be the images for testing, while test_lab folder should be the testing results, however, when I run the program, the testing process requires not only test_img, but also the counterpart img in test_lab which is the identified cracks of test_img.

test_lab should be the mask for test images. Refer to this link.

citystrawman · 2024-03-20T06:19:19Z

The environment.yml file not updated, and has the exploding gradient issue.
I changed to python 3.8 and Pytorch 2.0.1, and the training ran smoothly. One other thing need to update is the tensor self.label needs to be converted to float before feed into self.criterionSeg, in file deepcrack_model.py.

Hi, did you just change python and pytorch version? I changed these two but still failed to install these dependencies. I am not sure what is wrong here.

Yes, I only changed those for installation. Could you paste your error message so that others can better understand the error? Are you install on Mac/Win/Linux?

I just managed to make it. But I got another problem saying that result type Float can't be cast to the desired output type Long at File "D:\Program Files\anaconda3\envs\deepsegmentlh\lib\site-packages\torch\nn\functional.py", line 3165, in binary_cross_entropy_with_logits . Is that what you mentioned in your post?

I think it might be, but not quite sure as it was quite some time ago. Maybe try changing the file deepcrack_model.py as mentioned in my first post, and see if it would fix your issue.

Thank you. I've been able to run the program. However, I am not quite clear about the logic of testing for this program: At first I thought test_img folder should be the images for testing, while test_lab folder should be the testing results, however, when I run the program, the testing process requires not only test_img, but also the counterpart img in test_lab which is the identified cracks of test_img.

test_lab should be the mask for test images. Refer to this link.

As I understand, the masks(labels) for test images are output of a test procedure, so test_lab should be empty before testing. but when I run the program, it will pop out error if test_lab is empty. Please point out if I have some misunderstanding. Thanks.

davidvct · 2024-03-20T09:08:38Z

yes, you stored in the image you want to predict in test_img folder, and test_lab is for groundtruth.

The predicted result should be the 'fused' file in .\results folder, after executing sh ./scripts/test_deepcrack.sh <gpu_id>

GoroYeh-HRI · 2024-04-09T22:00:17Z

yes, you stored in the image you want to predict in test_img folder, and test_lab is for groundtruth.

The predicted result should be the 'fused' file in .\results folder, after executing sh ./scripts/test_deepcrack.sh <gpu_id>

@davidvct Thank you for the explanation!
I wonder if we want to predict for images that "do not have ground-truth", is this doable?
For example, if I have a set of satellite images but do not have the segmentation of those images, can I run test script to predict the result?
If yes, how should I prepare the testing dataset?
Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training issue due to environment.yml file is not updated #35

Training issue due to environment.yml file is not updated #35

davidvct commented Aug 10, 2023

yhlleo commented Aug 15, 2023

citystrawman commented Mar 15, 2024

davidvct commented Mar 16, 2024

citystrawman commented Mar 17, 2024 •

edited

Loading

davidvct commented Mar 18, 2024

citystrawman commented Mar 18, 2024

davidvct commented Mar 20, 2024

citystrawman commented Mar 20, 2024

davidvct commented Mar 20, 2024

GoroYeh-HRI commented Apr 9, 2024

Training issue due to environment.yml file is not updated #35

Training issue due to environment.yml file is not updated #35

Comments

davidvct commented Aug 10, 2023

yhlleo commented Aug 15, 2023

citystrawman commented Mar 15, 2024

davidvct commented Mar 16, 2024

citystrawman commented Mar 17, 2024 • edited Loading

davidvct commented Mar 18, 2024

citystrawman commented Mar 18, 2024

davidvct commented Mar 20, 2024

citystrawman commented Mar 20, 2024

davidvct commented Mar 20, 2024

GoroYeh-HRI commented Apr 9, 2024

citystrawman commented Mar 17, 2024 •

edited

Loading