Need some help about understanding the code #203

Oliverwang11 · 2024-03-07T23:00:53Z

Hi,

Thanks again for your contribution!

After reading the paper, I took a look at the code, especially for the GPT class, but I found something I am a little bit confused about.

In the paper, it says that the input image is down-sampled to 5x22xC and LiDAR to 8x8xC. If I understand correctly, for inference batch size, in your comments B, should be one? And why the input size of the image is Bx4xseq_len, C, H, W in your comments, where does the number 4 come from? Maybe I misunderstood something.

def forward(self, image_tensor, lidar_tensor, velocity): """ Args: image_tensor (tensor): B*4*seq_len, C, H, W lidar_tensor (tensor): B*seq_len, C, H, W velocity (tensor): ego-velocity """

Best wishes!
Thanks again!

The text was updated successfully, but these errors were encountered:

Kait0 · 2024-03-08T11:22:08Z

I think it's just a typo.

Oliverwang11 · 2024-03-08T11:42:39Z

Thanks for your reply, so the batch size, B, during inference is one right?

Kait0 · 2024-03-08T15:37:52Z

yes

Oliverwang11 · 2024-03-08T15:39:46Z

Thanks a lot!

Oliverwang11 closed this as completed Mar 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need some help about understanding the code #203

Need some help about understanding the code #203

Oliverwang11 commented Mar 7, 2024 •

edited

Loading

Kait0 commented Mar 8, 2024

Oliverwang11 commented Mar 8, 2024

Kait0 commented Mar 8, 2024

Oliverwang11 commented Mar 8, 2024

Need some help about understanding the code #203

Need some help about understanding the code #203

Comments

Oliverwang11 commented Mar 7, 2024 • edited Loading

Kait0 commented Mar 8, 2024

Oliverwang11 commented Mar 8, 2024

Kait0 commented Mar 8, 2024

Oliverwang11 commented Mar 8, 2024

Oliverwang11 commented Mar 7, 2024 •

edited

Loading