Modality fusion implementation question #53

kjwkch · 2023-04-07T15:08:48Z

hello. I am studying 2DPASS with code.
It seems that the modality fusion implementation is in network/arch_2dpass.py from line 100 to line 105.
In the thesis, bitwise add is specified, but I do not see it in the code, so I ask a question.
Below is the code.

modality fusion

feat_learner = F.relu(self.leanersidx)
feat_cat = torch.cat([img_feat, feat_learner], 1)
feat_cat = self.fcs1idx
feat_weight = torch.sigmoid(self.fcs2idx)
fuse_feat = F.relu(feat_cat * feat_weight)

I think that [fuse_feat = F.relu(feat_cat*feat_wieght) + img_feat] implements the formula in the paper as a code.
Isn't it?

brahami14 · 2023-04-19T08:54:06Z

i have the same question

thanks in advance

LiXiang0021 · 2023-04-20T10:30:15Z

Have you guys successively reproduced the model on Nuscenes, I did several experiments but the performance is far away from the report results. And, I also tested the provided weight on Nuscences getting results similar to the reported results. I'd like to know if I forgot to set some arguments.

kjwkch · 2023-04-21T01:15:55Z

This issue is code implementation, not performance. I think open code is not implemented as described in the paper.

kjwkch · 2023-04-21T01:46:01Z

Missing in the code.

modality fusion

feat_learner = F.relu(self.leaners[idx](pts_feat))
feat_cat = torch.cat([img_feat, feat_learner], 1)
feat_cat = self.fcs1[idx](feat_cat)
feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
fuse_feat = F.relu(feat_cat * feat_weight)

I think that [fuse_feat = F.relu(feat_cat*feat_wieght) + img_feat] implements the formula in the paper as a code.

feat_learner = F.relu(self.leaners[idx](pts_feat))
feat_cat = torch.cat([img_feat, feat_learner], 1)
feat_cat = self.fcs1[idx](feat_cat)
feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
fuse_feat = F.relu(feat_cat * feat_weight) + img_feat

LiXiang0021 · 2023-04-21T01:50:03Z

Thanks for your reply, I will further check this issue.

LiXiang0021 · 2023-04-21T03:03:28Z

I just trained the modified version as you said, and the performance did improve a little bit around 2 on mIoU. I believe there may be some other wrong implements or missing in the released code. And thank you again.

jaywu109 · 2023-04-27T08:37:22Z

@kjwkch @LiXiang0021
After reviewing the current implementation, I noticed that besides the fusion modification, the point feature pass through the 2D learner needs to be added to the original point feature before passing through multihead_3d_classifier, in order to match the model architecture outlined in the paper as below:

        feat_learner = F.relu(self.leaners[idx](pts_feat)) 
        # feat_learner -> voxel-wise feature after 2D learner

        pts_pred_full = self.multihead_3d_classifier[idx]((pts_feat+feat_learner)) 
        # pts_feat+feat_learner -> voxel-wise Enhanced 3D Features

        # correspondence
        pts_label_full = self.voxelize_labels(data_dict['labels'], data_dict['layer_{}'.format(idx)]['full_coors'])
        pts_pred = self.p2img_mapping(pts_pred_full[coors_inv], point2img_index, batch_idx)

        # modality fusion

        feat_learner = self.p2img_mapping(feat_learner[coors_inv], point2img_index, batch_idx)
        # feat_learner -> point-wise feature after 2D learner and img_mapping

        feat_cat = torch.cat([img_feat, feat_learner], 1)
        feat_cat = self.fcs1[idx](feat_cat)
        feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
        fuse_feat = F.relu(feat_cat * feat_weight) + img_feat

Currently, the implementation takes the point feature as input directly for multihead_3d_classifier instead of adding the point feature after the 2D learner.

2DPASS/network/arch_2dpass.py

Lines 89 to 93 in 80b8646

    
           pts_feat = data_dict['layer_{}'.format(idx)]['pts_feat'] 
        
           coors_inv = data_dict['scale_{}'.format(last_scale)]['coors_inv'] 
        
           # 3D prediction 
        
           pts_pred_full = self.multihead_3d_classifier[idx](pts_feat)

@yanx27, I would appreciate any suggestions you may have regarding this matter.

brahami14 · 2023-06-06T13:02:10Z

@jaywu109 does the new script changes work for you ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modality fusion implementation question #53

Modality fusion implementation question #53

kjwkch commented Apr 7, 2023

brahami14 commented Apr 19, 2023

LiXiang0021 commented Apr 20, 2023

kjwkch commented Apr 21, 2023

kjwkch commented Apr 21, 2023

LiXiang0021 commented Apr 21, 2023

LiXiang0021 commented Apr 21, 2023

jaywu109 commented Apr 27, 2023

brahami14 commented Jun 6, 2023

Modality fusion implementation question #53

Modality fusion implementation question #53

Comments

kjwkch commented Apr 7, 2023

modality fusion

brahami14 commented Apr 19, 2023

LiXiang0021 commented Apr 20, 2023

kjwkch commented Apr 21, 2023

kjwkch commented Apr 21, 2023

modality fusion

LiXiang0021 commented Apr 21, 2023

LiXiang0021 commented Apr 21, 2023

jaywu109 commented Apr 27, 2023

brahami14 commented Jun 6, 2023