in petrv2, Which parts of the code do the timing embody #60

guohao02 · 2022-09-29T05:14:06Z

No description provided.

yingfei1016 · 2022-09-29T06:02:37Z

Hi，
（1）Temporal alignment is placed in the data processing https://github.com/megvii-research/PETR/blob/main/tools/generate_sweep_pkl.py.
（2）We load the sweep data in pipeline https://github.com/megvii-research/PETR/blob/main/projects/configs/petrv2/petrv2_vovnet_gridmask_p4_800x320.py#L161. The sweep data is concat with key frame data at view axis ( (B, 6, 3, H, W) -> (B, 12, 3, H, W) ). Then the data augmentation and training can be performed similar with single frame.

exiawsh · 2022-09-30T10:02:52Z

Hello,
I also conducted research on PETRV2. Here are some of my understandings：
（1）There are no explicit time positional embedding, so they regression the position offset instead.
（2）The way to distinguish the time is the multi-view embbeding.
MV in the table. That's may cause the generalization problem in the real application. But I found you can encode the time delay by using sincos embedding, which may cause similiar results (slightly poor in mAVE).

yingfei1016 closed this as completed Oct 22, 2022

miraclesheephaha mentioned this issue Apr 6, 2024

detection_head TRI-ML/PF-Track#31

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in petrv2, Which parts of the code do the timing embody #60

in petrv2, Which parts of the code do the timing embody #60

guohao02 commented Sep 29, 2022

yingfei1016 commented Sep 29, 2022

exiawsh commented Sep 30, 2022

in petrv2, Which parts of the code do the timing embody #60

in petrv2, Which parts of the code do the timing embody #60

Comments

guohao02 commented Sep 29, 2022

yingfei1016 commented Sep 29, 2022

exiawsh commented Sep 30, 2022