Question about the training #36

DanHalp · 2021-12-01T21:35:21Z

Hey there!

First, I appreciate your fast responses - it is of course not obvious :)

I have a couple of questions about the training process:

What's the difference between the arguments --cpkt and --pretrained_model in train.py? Isn't a pre-trained model just a model that is trained for some epochs, like a checkpoint?
I was wondering if there are blocks of code that are not trained, and used as pretrained models. For example, the VoxelNet part - do we actually train the subnetwork that processes the voxels into the overhead-view pseudo image?
Is the VoxelNet backbone responsible for both voxelizing the point cloud and creating the overhead-view pseudo?
We've trained the model with 10% of the training data for 80 epochs and a batch_size of 4. To our big surprise, it performed almost as good as the model trained on the full data that you referred to here: pre-trained model #9. Does it make any sense?
Do we use RCNNs at the first stage for centerpoint.yaml config?
We're struggling to understand the heatmap concept. How it is created, and how the GaussianFocalLoss loss function is applied to it. Do you have any hint where we might find some answer for beginners? Google assume we've been born with that knowledge.

Thanks :)

tianweiy · 2021-12-02T16:24:29Z

not 100 percent sure, but ckpt also includes optimizer state and pertained model only gets models ?
yes, we train
no, the voxelization is done separately in the data loader

CenterPoint-KITTI/pcdet/datasets/processor/data_processor.py

Line 50 in 222232e

voxel_generator = VoxelGenerator(

I never tried but that seems possible
no, it is only used in second stage centerpoint_rcnn.yaml
yeah, it is a line of works from CornerNet to CenterNet to ours. You can read the previous papers https://arxiv.org/pdf/1904.07850.pdf
https://arxiv.org/abs/1808.01244

let me know if have any specific problems

Provide feedback