About paper #6

BMEI1314 · 2022-01-04T06:22:09Z

hi,
We think that mdetr has great potential, but we look at table 6 in the paper and find that the metics of moment retrieval on the charades-sta dataset is not much higher than that of ivg-dcl (in particular, ivg-dcl adopts C3d feature for video extractor and glove for text embedding), and your work uses clip feature + slowfast). Have you ever tested on other video grounding dataset, like activitynets?

jayleicn · 2022-01-04T17:22:26Z

Hi @BMEI1314, in our work, we primarily focus on collecting the QVHighlights dataset and developing the MomentDETR model on top of this collected dataset. On CharadesSTA, we did not quite tune the model, but we still notice significant performance improvement on R1@0.5 (e.g., +3, or +5 with pretraining). We did not test on other datasets.

BMEI1314 · 2022-01-05T01:12:58Z

Thanks for your quick reply and look forward to your follow-up work

BMEI1314 closed this as completed Jan 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About paper #6

About paper #6

BMEI1314 commented Jan 4, 2022

jayleicn commented Jan 4, 2022 •

edited

BMEI1314 commented Jan 5, 2022

About paper #6

About paper #6

Comments

BMEI1314 commented Jan 4, 2022

jayleicn commented Jan 4, 2022 • edited

BMEI1314 commented Jan 5, 2022

jayleicn commented Jan 4, 2022 •

edited