Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about the training hyperparameters #10

Closed
1349949 opened this issue Feb 9, 2022 · 7 comments
Closed

Questions about the training hyperparameters #10

1349949 opened this issue Feb 9, 2022 · 7 comments

Comments

@1349949
Copy link

1349949 commented Feb 9, 2022

Hi,
I have some questions about the training hyperparameters.

  1. What’s the best model’s training batchsize(samples_per_gpu) exactly is in the Table.2 & 3 of your paper? All the training configs you provided in the repo set samples_per_gpu=1, is the same as the best model?

  2. The ablation study in the paper using 20% data for training. What about other training hyperparameters? Such as training epochs, batchsize, training 3 classes together or separately.

I’m doing some reproducing experiments, so I need the training hyperparameters mentioned above.

@Abyssaledge
Copy link
Collaborator

@1349949 Thanks for using!

  1. We use samples_per_gpu=1 for all experiments.
  2. For ablation in Table 1, we train 3 classes together for 12 epoch. For other ablations, we train Vehicle and Pedestrian separately. Training 3 classes together will obtain better performances on Pedestrian and Cyclist.

@minwang-ai
Copy link

Hi,
I have two questions :

  1. Can we only use 20% data for ablation study?
  2. May I ask how long it takes to use the whole dataset for training with one frame and three frame?

@Abyssaledge
Copy link
Collaborator

@MinWang1997

  1. Although I don't know what you're worried about, you can certainly do it using 20% data.
  2. ~ 60h for single frame, maybe 80h for 3 frames with 8 2080Ti.

@ziqipang
Copy link
Collaborator

@MinWang1997 Just to clarify, 20% data for ablation is only valid for Waymo Open Dataset, since 20% of Waymo Open Dataset is still several times larger than KITTI/nuScenes. That said, using 20% data is only applicable and adopted for Waymo Open Dataset and not others.

@minwang-ai
Copy link

minwang-ai commented Feb 15, 2022 via email

@ziqipang
Copy link
Collaborator

@MinWang1997 Yes, it is valid to tune hyper-parameters with 20% waymo dataset.

@minwang-ai
Copy link

minwang-ai commented Feb 15, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants