Testing results on SemanticKITTI #13

bobpop1 · 2022-10-06T05:39:21Z

Thank you for your great work! We achieve the claimed results on validation set. However, we only achieves 68.2 mIoU on the test set, which is much lower than the claimed 72.9.

ldkong1205 · 2022-10-06T07:43:23Z

Hi @bobpop1, could you share with me what scores you got on the validation set? Thanks a lot!

yanx27 · 2022-10-06T12:01:52Z

Hi @bobpop1, this checkpoint is gained by training only on the training set, and the results on the test set need to the model training on both training and validation set.

bobpop1 · 2022-10-06T13:21:22Z

Hi @bobpop1, this checkpoint is gained by training only on the training set, and the results on the test set need to the model training on both training and validation set.

Thanks for your reply. How do you choose the best checkpoint when train on both training and validation set?

bobpop1 · 2022-10-06T13:23:21Z

Hi @bobpop1, could you share with me what scores you got on the validation set? Thanks a lot!

The best miou is 69.023

ldkong1205 · 2022-10-06T13:24:15Z

Hi @bobpop1, could you share with me what scores you got on the validation set? Thanks a lot!

The best miou is 69.023

Thank you for your information!

yanx27 · 2022-10-07T02:01:52Z

@bobpop1 Since we use cosine learning rate scheduler, you can just choose the best or final checkpoint during the training. By the way, I found out that you only gain 69.023 mIoU on validation set, which is about 1% lower than ours. Did you correctly use TTA during the inference?

isunLt · 2022-10-07T06:11:55Z

@bobpop1 Since we use cosine learning rate scheduler, you can just choose the best or final checkpoint during the training. By the way, I found out that you only gain 69.023 mIoU on validation set, which is about 1% lower than ours. Did you correctly use TTA during the inference?

I get only 66.03 when test with num_vote=1 and 68.88 with num_vote=12. I also evaluate the weight you released through google drive and got 68.5 with num_vote=1 and 70.05 with num_vote=12. Is the model sensitive to the version of spconv?

The envs are listed as follow:
python 3.9.13
pytorch 1.12.0 py3.9_cuda11.3_cudnn8.3.2_0
pytorch-lightning 1.3.8
torchmetrics 0.5
torch-scatter 2.0.9
spconv-cu114 2.2.3 as spconv-cu114 is much faster than spconv-cu111 in the training process

The training process is based on a single RTX-A6000 with cuda=11.4. I follow the default setting but cannot achieve the same performance(68.5) with you. Do you have any advice? Thanks in advance!

yanx27 · 2022-10-07T06:43:03Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

bobpop1 · 2022-10-07T06:46:48Z

@bobpop1 Since we use cosine learning rate scheduler, you can just choose the best or final checkpoint during the training. By the way, I found out that you only gain 69.023 mIoU on validation set, which is about 1% lower than ours. Did you correctly use TTA during the inference?

Thank you for your reply. I use the default setting of TTA. And what is the correct setting? Can you show me more details?

yanx27 · 2022-10-07T06:49:07Z

@bobpop1

@bobpop1 Since we use cosine learning rate scheduler, you can just choose the best or final checkpoint during the training. By the way, I found out that you only gain 69.023 mIoU on validation set, which is about 1% lower than ours. Did you correctly use TTA during the inference?

Thank you for your reply. I use the default setting of TTA. And what is the correct setting? Can you show me more details?

Did you set `--num_vote=12' in the testing?

isunLt · 2022-10-07T06:54:40Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

I see! Thanks for your quick respondence!

bobpop1 · 2022-10-07T06:55:06Z

By the

@bobpop1

@bobpop1 Since we use cosine learning rate scheduler, you can just choose the best or final checkpoint during the training. By the way, I found out that you only gain 69.023 mIoU on validation set, which is about 1% lower than ours. Did you correctly use TTA during the inference?

Thank you for your reply. I use the default setting of TTA. And what is the correct setting? Can you show me more details?

Did you set `--num_vote=12' in the testing?

Yes, we have set '--num_vote=12'. By the way, there is another question. I found that the code is slower than the claimed results. What running environment do you use to achieve faster speed?

yanx27 · 2022-10-07T07:03:44Z

@bobpop1

@bobpop1 Since we use cosine learning rate scheduler, you can just choose the best or final checkpoint during the training. By the way, I found out that you only gain 69.023 mIoU on validation set, which is about 1% lower than ours. Did you correctly use TTA during the inference?

Thank you for your reply. I use the default setting of TTA. And what is the correct setting? Can you show me more details?

Did you set `--num_vote=12' in the testing?

Yes, we have set '--num_vote=12'. By the way, there is another question. I found that the code is slower than the claimed results. What running environment do you use to achieve faster speed?

I see. If 69.023 is gained by training with 64 epochs, it is very closed to ours. For the network speed, the default setting will consider the time of loading data, which is much time-consuming. The inference time in our paper is gained by the pure time for the network inference. You can follow the below setting to test the speed.

torch.cuda.synchronize()
start = time.time()
model(data_dict)
torch.cuda.synchronize()
end = time.time()
print("inference time:", end - start )

isunLt · 2022-10-07T07:04:05Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

Have you tried train semantickiiti with multiple gpus? Are there any difference in the config compared with training with a single gpu?

yanx27 · 2022-10-07T07:12:40Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

Have you tried train semantickiiti with multiple gpus? Are there any difference in the config compared with training with a single gpu?

@isunLt Hi, we haven't tried multiple gpus on semantickitti.

isunLt · 2022-10-07T07:14:52Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

Have you tried train semantickiiti with multiple gpus? Are there any difference in the config compared with training with a single gpu?

@isunLt Hi, we haven't tried multiple gpus on semantickitti.

I see.

KIM-5-WEE-8 · 2022-10-07T08:19:25Z

Hi @bobpop1, this checkpoint is gained by training only on the training set, and the results on the test set need to the model training on both training and validation set.

Thanks for your reply. How do you choose the best checkpoint when train on both training and validation set?

Hi @yanx27, I have a similar question as @bobpop1. Training on both val and train sets and validating on val set will make the model fit the val set. The validation results on val should become much higher. What scores did you obtain under this setting? Were you training with the val set from the beginning or you just fine-tuned the model on val after training on train? Thanks

yanx27 · 2022-10-09T04:15:52Z

Hi @bobpop1, this checkpoint is gained by training only on the training set, and the results on the test set need to the model training on both training and validation set.

Thanks for your reply. How do you choose the best checkpoint when train on both training and validation set?

Hi @yanx27, I have a similar question as @bobpop1. Training on both val and train sets and validating on val set will make the model fit the val set. The validation results on val should become much higher. What scores did you obtain under this setting? Were you training with the val set from the beginning or you just fine-tuned the model on val after training on train? Thanks

@KIM-5-WEE-8 We directly train from scratch on both training and validation sets. As you mentioned, the results on validation set is meaningless, since the model is already fitted on it. However, since we use cosine learning rate scheduler, the best model is easily gained by the last checkpoints (see the mIoU curve on validation set in #11). Also, you can gain the higher mIoU on test set if further fine-tune the model.

bobpop1 · 2022-10-09T05:46:13Z

@bobpop1

@bobpop1 Since we use cosine learning rate scheduler, you can just choose the best or final checkpoint during the training. By the way, I found out that you only gain 69.023 mIoU on validation set, which is about 1% lower than ours. Did you correctly use TTA during the inference?

Thank you for your reply. I use the default setting of TTA. And what is the correct setting? Can you show me more details?

Did you set `--num_vote=12' in the testing?

Yes, we have set '--num_vote=12'. By the way, there is another question. I found that the code is slower than the claimed results. What running environment do you use to achieve faster speed?

I see. If 69.023 is gained by training with 64 epochs, it is very closed to ours. For the network speed, the default setting will consider the time of loading data, which is much time-consuming. The inference time in our paper is gained by the pure time for the network inference. You can follow the below setting to test the speed.
torch.cuda.synchronize()
start = time.time()
model(data_dict)
torch.cuda.synchronize()
end = time.time()
print("inference time:", end - start )

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

yanx27 · 2022-10-11T00:40:36Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

bobpop1 · 2022-10-11T13:17:01Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

yanx27 · 2022-10-13T01:05:38Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

bobpop1 · 2022-10-13T02:36:35Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

Thank you very much! As I know, there are 19130 scans in the training set. Why do you set the training_size 19132?

yanx27 · 2022-10-13T02:40:07Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

Thank you very much! As I know, there are 19130 scans in the training set. Why do you set the training_size 19132?

We use the same config as SPVNAS.

bobpop1 · 2022-10-15T05:24:07Z

From Fig.6 in your paper, I observe that the AF2S3Net achives 84.4 mIoU in the range of 0-10m. To our knowledge, the source code of AF2S3Net is not released, how do you obtain this results?

bobpop1 · 2022-10-16T06:45:03Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

We follow the above settings to change the training size as 23203 and thus the last checkpoint is close to the best one. However, the best val miou is about 81.0 yet. The best test miou is 68.

yanx27 · 2022-10-17T00:43:42Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

We follow the above settings to change the training size as 23203 and thus the last checkpoint is close to the best one. However, the best val miou is about 81.0 yet. The best test miou is 68.

According to your results, using additional validation set seems to even reduce the performance (68.2 v.s. 68), which is different from our observation. Currently, we are very busy in the incoming deadline, and the codes and pre-trained model on benchmark need to appear in the future after everything prepared well.

bobpop1 · 2022-10-17T12:10:12Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

We follow the above settings to change the training size as 23203 and thus the last checkpoint is close to the best one. However, the best val miou is about 81.0 yet. The best test miou is 68.

According to your results, using additional validation set seems to even reduce the performance (68.2 v.s. 68), which is different from our observation. Currently, we are very busy in the incoming deadline, and the codes and pre-trained model on benchmark need to appear in the future after everything prepared well.

Could you share me with the miou score which trains on both training and validation set and test on the test set?

ZHUANGMINGXI · 2022-10-18T13:19:27Z

@bobpop1 Hi, I want to ask if your epoch is 64 on training? My training model uses the default 64 epoch, which only reaches about 65.4mIou on the val set.

chenst27 · 2022-10-28T14:10:39Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

Thanks for your great work!
As you mentioned in README, you trained the model for SemanticKITTI with more epochs and thus gained the higher mIoU. Could you please tell me how many epochs you used?

isunLt · 2022-10-28T14:36:27Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

Thanks for your great work! As you mentioned in README, you trained the model for SemanticKITTI with more epochs and thus gained the higher mIoU. Could you please tell me how many epochs you used?

I have trained the network on SemanticKITTI with 100 epoch but the result drop to 63.3 (it is strange. I wonder if it is because I interrupt the training and the resume from the checkpoint) when trained on a single gpu, and get 65.8 on 3 gpu. I tried to resume from the officially released checkpoint (the 70.05 one) and it start from the 63rd epoch. Usually, a network will reach its best performance during the last several epoch.

chenst27 · 2022-10-29T04:21:30Z

@isunLt Hi, I tested this code with pytorch 1.8 and spconv-cu111. It achieves 66.5 with num_vote=1 and 69.3 with num_vote=12, as reported in the paper. The results of 70.05 is gained by fine-tuning the model with more epochs (see README). The difference (68.88 v.s. 63.9) may cause by the running environment.

Thanks for your great work! As you mentioned in README, you trained the model for SemanticKITTI with more epochs and thus gained the higher mIoU. Could you please tell me how many epochs you used?

I have trained the network on SemanticKITTI with 100 epoch but the result drop to 63.3 (it is strange. I wonder if it is because I interrupt the training and the resume from the checkpoint) when trained on a single gpu, and get 65.8 on 3 gpu. I tried to resume from the officially released checkpoint (the 70.05 one) and it start from the 63rd epoch. Usually, a network will reach its best performance during the last several epoch.

Thanks for your reply.

aminebdj · 2023-01-04T08:53:13Z

@bobpop1 were you able to find the problem? I am also getting 68 test mIoU on codalab.

callzhang · 2023-01-15T11:42:25Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

Thank you very much! As I know, there are 19130 scans in the training set. Why do you set the training_size 19132?

Where is training_size being used in code? Is it even being used? I only find it exists in config files. And why can we ignore training_size when lr_scheduler is being set up?

Jaywxy · 2023-08-10T07:45:48Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

If I change the batchsize to 2, epoch is 60, do I need to change the training size?

Jaywxy · 2023-08-10T07:58:45Z

Thanks for your reply. And I found that there is the 'instance_label' in dataloader. How do you obtain the 'instance_label'?

Instance label is gained here. For the instance level augmentation, you can refer to RPVNet.

Thank you very much. We train the model on training and validation sets. The best val miou is 81.043. And we upload the last checkpoint and the best checkpoint to codalab. However, the mIoU is 65 and 69, respectively.

Hi, according to your results, you may ignore modifying training size before training on the additional valid set. In practice, the last checkpoint will very close to the best one. If you use the original training size, the model will achieve the minimum learning rate in earlier epochs, and cannot converge in the last one. You can also set lr_scheduler to CosineAnnealingLR if you don't want to change the training size.

Thank you very much! As I know, there are 19130 scans in the training set. Why do you set the training_size 19132?

Where is training_size being used in code? Is it even being used? I only find it exists in config files. And why can we ignore training_size when lr_scheduler is being set up?

training_size is used in configure_optimizers ,but i donot know If I change the batchsize to 2, epoch is 60, do I need to change the training_size?I changed the batch_size to 2 and the training_size to 19132, but progress_bar shows that the train_step per epoch is 13636, I don't know how this 13636 came about.

yanx27 changed the title ~~Testing results~~ Testing results on SemanticKITTI Oct 7, 2022

L-Reichardt mentioned this issue Oct 12, 2022

Baseline training schedule #16

Open

Testing results on SemanticKITTI #13

Testing results on SemanticKITTI #13

Comments

bobpop1 commented Oct 6, 2022 • edited

ldkong1205 commented Oct 6, 2022

yanx27 commented Oct 6, 2022

bobpop1 commented Oct 6, 2022

bobpop1 commented Oct 6, 2022

ldkong1205 commented Oct 6, 2022

yanx27 commented Oct 7, 2022

isunLt commented Oct 7, 2022

yanx27 commented Oct 7, 2022

bobpop1 commented Oct 7, 2022

yanx27 commented Oct 7, 2022

isunLt commented Oct 7, 2022

bobpop1 commented Oct 7, 2022

yanx27 commented Oct 7, 2022

isunLt commented Oct 7, 2022

yanx27 commented Oct 7, 2022

isunLt commented Oct 7, 2022

KIM-5-WEE-8 commented Oct 7, 2022

yanx27 commented Oct 9, 2022

bobpop1 commented Oct 9, 2022

yanx27 commented Oct 11, 2022

bobpop1 commented Oct 11, 2022

yanx27 commented Oct 13, 2022

bobpop1 commented Oct 13, 2022

yanx27 commented Oct 13, 2022

bobpop1 commented Oct 15, 2022

bobpop1 commented Oct 16, 2022

yanx27 commented Oct 17, 2022

bobpop1 commented Oct 17, 2022

ZHUANGMINGXI commented Oct 18, 2022

chenst27 commented Oct 28, 2022

isunLt commented Oct 28, 2022

chenst27 commented Oct 29, 2022

aminebdj commented Jan 4, 2023 • edited

callzhang commented Jan 15, 2023 • edited

Jaywxy commented Aug 10, 2023

Jaywxy commented Aug 10, 2023 • edited

bobpop1 commented Oct 6, 2022 •

edited

aminebdj commented Jan 4, 2023 •

edited

callzhang commented Jan 15, 2023 •

edited

Jaywxy commented Aug 10, 2023 •

edited