Issues faced during running the training and test scripts #5

kkaytekin · 2023-07-06T17:15:10Z

Hello again,
I would like to share some issues that I faced while running the training script. Note that I have prepared the datasets as explained in the R2D2 repository.

Mismatch of dimensions during det_loss calculation:
While executing this line I get

RuntimeError: The size of tensor a (64) must match the size of tensor b (65) at non-singleton dimension 1

I solved this by replacing the line 357 as follows:

        elif self.detloss in ['ce']:
            # det_loss = self.det_loss(pred_score=output["semi"], gt_score=output["gt_semi"], weight=output["weight"],
            #                          stability_map=None)
            det_loss = self.det_loss(pred_score=output["semi"], gt_score=output["gt_semi_norm"], weight=output["weight"],
                                     stability_map=None)

I think this error is caused by parsing of wrong values. In inputs, we got

output["gt_semi"].shape = (4,64,64,64) (==gt_score)
output["semi"].shape = (4,65,64,64) (==pred_score)

in output dict we also had

output["gt_semi_norm"] with shape (4,65,64,64)

So i replaced gt_semi with gt_semi_norm which has matching dimensions. I am not sure if this is a valid solution.

Learning rate decay parameters are not specified. In trainer.py, line 166 the interpreter complains that self.args.decay_rate and self.args.decay_iter cannot be found. Indeed, they are neither specified in the argparser nor in the config file. The workaround for now is to disable learning rate decay by replacing line 166 with

#lr = min(self.args.lr * self.args.decay_rate ** (self.iteration - self.args.decay_iter), self.args.lr)
lr = self.args.lr

I think this change will prevent us from replicating the results in the paper.

Also, while running the test script test_aachenv_1_1 there are some matters I would like to mention:

I am not sure whether to use the Aachen dataset that we prepared during the training, or Aachen v1.1 dataset that we can find online (for example, I downloaded it from here, as mentioned in the readme file). Since the datasets might be different, I would like to ask if there are any specific preprocessing steps I should follow to reproduce your results?
In line 31 of the test script, the file pairs-db-covis20.txt is missing. I found it here, but since I found this file and the aachen v1.1 database from different sources, I wanted to ask if there is some other source I should download the aachen v1.1 dataset from, maybe a source including this file already?
We need to specify outputs folder as shown here Does that mean I should first run some other script to do inference and collect the results under some outputs folder I specified?
Missing file aachen_db_imglist.txt here. Google search for this file was not successful.
Missing file day_night_time_queries_with_intrinsics.txt here. Google search for this file was not successful. The aachen v1.1 dataset I mentioned above only has night_time_queries_with_intrinsics.txt.
Thank you very much and best regards,

The text was updated successfully, but these errors were encountered:

meng152634 · 2023-08-17T01:02:19Z

same problem...

1561213 · 2023-09-12T13:38:59Z

same problem.

1561213 · 2023-09-13T02:28:46Z

And in trainer.py,lin385
eval_out = self.eval_on_data() is likely not defined,so I get

AributeError: 'Trainer' object has no attribute 'eval_on_data'

Thanks.

XZYuann · 2023-10-14T10:00:09Z

same problem.

pQWQq · 2023-10-23T07:40:53Z

I found that the data about decay_rate in the config_train_r2d2.json in the March 9th version of the code is set to decay_rate=0.99996 decay_iter=80000

zhengshunkai · 2023-11-24T07:38:56Z

+1

zhengshunkai · 2023-11-24T09:18:59Z

this txt maybe right
https://github.com/cvg/Hierarchical-Localization/blob/master/pairs/aachen_v1.1/pairs-db-covis20.txt

zhengshunkai · 2023-11-25T07:05:45Z

aachen_db_imglist.txt not used;
day_night_time_queries_with_intrinsics.txt may be the day+night

zhengshunkai · 2023-11-25T07:07:31Z

day_night_time_queries_with_intrinsics.txt

Inverse-function · 2023-12-21T13:18:30Z

that's why?

eronez · 2024-04-23T12:34:15Z

And in trainer.py,lin385 eval_out = self.eval_on_data() is likely not defined,so I get

AributeError: 'Trainer' object has no attribute 'eval_on_data'

Thanks.

@1561213
same issue with u.
Do you have solved this problem?

Inverse-function · 2024-04-24T06:27:11Z

I am pleasantly surprised to receive your reply. There are still many issues that have not been resolved, such as the path settings for colmap and many files used for evaluation on the aachen dataset. Could you please package your project and send it to me. Thanks Original From:"eronez"< ***@***.*** >; Date:2024/4/23 20:34 To:"feixue94/sfd2"< ***@***.*** >; CC:"Inverse-function"< ***@***.*** >;"Comment"< ***@***.*** >; Subject:Re: [feixue94/sfd2] Issues faced during running the training and testscripts (Issue #5) And in trainer.py,lin385 eval_out = self.eval_on_data() is likely not defined,so I get AributeError: 'Trainer' object has no attribute 'eval_on_data' Thanks. @1561213 same issue with u. Do you have solved this problem? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

feixue94 · 2024-04-25T09:06:02Z

Hi,

Thank you for your interest in our work. I will fix these bugs and update the code.

liutao23 · 2024-05-17T04:59:52Z

that's why?

your mmseg version is too high，you can chosse it following :https://mmsegmentation.readthedocs.io/zh-cn/0.x/faq.html

Inverse-function · 2024-05-20T03:03:23Z

that's why?

your mmseg version is too high，you can chosse it following :https://mmsegmentation.readthedocs.io/zh-cn/0.x/faq.html

thank you

Adolfhill · 2024-06-19T06:43:20Z

Hi guys, have you solved the issue of the missing implementation of eval_on_data?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues faced during running the training and test scripts #5

Issues faced during running the training and test scripts #5

kkaytekin commented Jul 6, 2023 •

edited

Loading

meng152634 commented Aug 17, 2023

1561213 commented Sep 12, 2023

1561213 commented Sep 13, 2023

XZYuann commented Oct 14, 2023

pQWQq commented Oct 23, 2023

zhengshunkai commented Nov 24, 2023

zhengshunkai commented Nov 24, 2023

zhengshunkai commented Nov 25, 2023

zhengshunkai commented Nov 25, 2023

Inverse-function commented Dec 21, 2023

eronez commented Apr 23, 2024

Inverse-function commented Apr 24, 2024 via email

feixue94 commented Apr 25, 2024

liutao23 commented May 17, 2024

Inverse-function commented May 20, 2024

Adolfhill commented Jun 19, 2024

Issues faced during running the training and test scripts #5

Issues faced during running the training and test scripts #5

Comments

kkaytekin commented Jul 6, 2023 • edited Loading

meng152634 commented Aug 17, 2023

1561213 commented Sep 12, 2023

1561213 commented Sep 13, 2023

XZYuann commented Oct 14, 2023

pQWQq commented Oct 23, 2023

zhengshunkai commented Nov 24, 2023

zhengshunkai commented Nov 24, 2023

zhengshunkai commented Nov 25, 2023

zhengshunkai commented Nov 25, 2023

Inverse-function commented Dec 21, 2023

eronez commented Apr 23, 2024

Inverse-function commented Apr 24, 2024 via email

feixue94 commented Apr 25, 2024

liutao23 commented May 17, 2024

Inverse-function commented May 20, 2024

Adolfhill commented Jun 19, 2024

kkaytekin commented Jul 6, 2023 •

edited

Loading