Do you have any idea why oneformer3d is very sensitive to different backbones? #49

yxchng · 2024-04-01T08:52:03Z

I tried changing the backbone from SpConv to supposedly more powerful new backbones like PTv3 and Swin3D without changing any other parameters but they both give much poorer results on S3DIS. You seem to be using different backbones for different tasks too which suggests that this framework might be sensitive to different backbones used. Do you have any idea why this is the case?

oneformer3d-contributor · 2024-04-01T12:47:26Z

One thing is you need to carefully follow the pre-processing of point clouds in our repo and your new backbones (e.g. PTv3). Like color normalization, voxel size or elastic transform. These things should also be changed.

Please share your results if got something interesting with these backbones :)

yxchng · 2024-04-01T13:35:55Z

Hmmm, I am not very familiar with elastic transform. Is this transform different for different backbones? Also, isn't this transform only applied during training? So even if it is slightly different, it shouldn't have such a big impact on evaluation?

oneformer3d-contributor · 2024-04-01T14:37:39Z

Not sure, but it is rather strong augmentation. If the backbone is not trained with it, this can possible break the pre-training weights. I recommend to completely follow the preprocessing and augmentations of the new backbone.

RayYoh · 2024-04-17T15:35:56Z

Not sure, but it is rather strong augmentation. If the backbone is not trained with it, this can possible break the pre-training weights. I recommend to completely follow the preprocessing and augmentations of the new backbone.

Hi, authors. Actually, I have tried this based on Pointcept from scratch.
My result is worse than yours even I add normal as features (about 1 point for AP50 Scannetv2, use topk 100). But SPFormer gets higher results than you claimed in the paper (maybe since I add normal). I am confused that elastic transform influences the results a lot?
And I also use your repo for the instance results (remove semantic and panoptic seg, top 100), just about 77.0. I'd like to ask if another two tasks can help the inst seg a lot?

oneformer3d-contributor · 2024-04-17T16:04:34Z

Hi @RayYoh ,
Yes, i think elastic transform gives like +4 for OneFormer3D and for SPFormer also. No semantic segmentation doesn't have positive impact on instance segmentation metrics. Also are you starting from pre-trained model? It is important for achieving good results.

RayYoh · 2024-04-18T01:13:12Z

Hi @RayYoh , Yes, i think elastic transform gives like +4 for OneFormer3D and for SPFormer also. No semantic segmentation doesn't have positive impact on instance segmentation metrics. Also are you starting from pre-trained model? It is important for achieving good results.

Hi, what does +4 mean? For AP50 result?
Actually, I didn't use a pre-trained backbone since it uses a totally different data augmentation pipeline (e.g., coord shift, etc.) I just training from scratch like Mask3D use 600 epoches and OnecycleLR, it works well for SPFormer but not for OneFormer3D on Scannet v2.

oneformer3d-contributor · 2024-04-18T07:59:24Z

I think smth like +4 mAP50 (may be less). If you use some backbone from pointcept e.g. ptv3, you can also start with their pre-trained weights. I think it should help much.

RayYoh · 2024-04-19T01:44:18Z

Yeah, thanks for your suggestion. Additionally, I am confused about the results of the checkpoint in the repo. It seems that the instance segmentation result is better than the original paper, but the semantic segmentation and panoptic results are a little bit worse. Why has this phenomenon, in my own opinion, better instance seg will get better semantic and panoptic results.

In addition, I'd like to ask if there are any tricks to get this better ins seg result because, in my reproduction, I get a similar result of the paper (maybe because of random, a little bit lower than paper, 1 point lower than your checkpoint).

oneformer3d-contributor · 2024-04-19T09:45:24Z

Unfortunately not many tricks, just the one about the loss weight from our readme and multiple train runs...

RongkunYang · 2024-06-17T07:26:24Z

Dear authors, I'm wondering whether the pretrain of backbone plays an importance effect in the performance of instance segmentation?

oneformer3d-contributor · 2024-06-17T07:48:38Z

Yes, when running our code without pre-trained checkpoint, the mAP for instance segmentation is about 4% worse.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do you have any idea why oneformer3d is very sensitive to different backbones? #49

Do you have any idea why oneformer3d is very sensitive to different backbones? #49

yxchng commented Apr 1, 2024

oneformer3d-contributor commented Apr 1, 2024

yxchng commented Apr 1, 2024 •

edited

Loading

oneformer3d-contributor commented Apr 1, 2024

RayYoh commented Apr 17, 2024

oneformer3d-contributor commented Apr 17, 2024

RayYoh commented Apr 18, 2024

oneformer3d-contributor commented Apr 18, 2024

RayYoh commented Apr 19, 2024

oneformer3d-contributor commented Apr 19, 2024

RongkunYang commented Jun 17, 2024

oneformer3d-contributor commented Jun 17, 2024

Do you have any idea why oneformer3d is very sensitive to different backbones? #49

Do you have any idea why oneformer3d is very sensitive to different backbones? #49

Comments

yxchng commented Apr 1, 2024

oneformer3d-contributor commented Apr 1, 2024

yxchng commented Apr 1, 2024 • edited Loading

oneformer3d-contributor commented Apr 1, 2024

RayYoh commented Apr 17, 2024

oneformer3d-contributor commented Apr 17, 2024

RayYoh commented Apr 18, 2024

oneformer3d-contributor commented Apr 18, 2024

RayYoh commented Apr 19, 2024

oneformer3d-contributor commented Apr 19, 2024

RongkunYang commented Jun 17, 2024

oneformer3d-contributor commented Jun 17, 2024

yxchng commented Apr 1, 2024 •

edited

Loading