Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training process #14

Closed
ksh11023 opened this issue Jan 13, 2023 · 1 comment
Closed

Training process #14

ksh11023 opened this issue Jan 13, 2023 · 1 comment

Comments

@ksh11023
Copy link

Hello,

I want to ask some details about the training process.

CMKD uses pretrained SECOND-net for the Teacher Network.

When training the CMKD's Student network (CMKD Mono),
First, you use ~bev.yaml file to train the model with feature distillation loss

  • In this process, you don't calculate detection loss? If so, the purpose of this process is only to train BEV image feature to have similar patterns to the BEV lidar feature?

Second, you use ~V2.yaml file to train the model with feature distillation loss + detection loss

  • This process is used for final 3d object detection? Was 20 epochs enough for training?

+) Also, do you freeze the teacher network(SECOND) and do updates only for the Student network(CMKD Mono) ?

Thank you!

@Cc-Hy
Copy link
Owner

Cc-Hy commented Jan 13, 2023

Hi, for the fisrt question, here is a subsection in our paper explaing for this and you may take a look:

image

Note that in this setting we load the weights from the teacher model and thus it only needs to be finetuned, and 20 epochs are enouhgh.
For the second question, the anwser is yes. We fix the teacher model as the feature extractor during training.

@Cc-Hy Cc-Hy mentioned this issue Jan 13, 2023
@Cc-Hy Cc-Hy closed this as completed Jan 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants