Skip to content

Latest commit

 

History

History
139 lines (127 loc) · 37.9 KB

video_grounding.md

File metadata and controls

139 lines (127 loc) · 37.9 KB

Visual Grounding/Localization

Survey

  • [2020 ICCST] A Survey of Temporal Activity Localization via Language in Untrimmed Videos, [paper], [bibtex].
  • [2021 ArXiv] A Survey on Natural Language Video Localization, [paper], [bibtex].
  • [2021 ArXiv] A Survey on Temporal Sentence Grounding in Videos, [paper], [bibtex].
  • [2022 ArXiv] The Elements of Temporal Sentence Grounding in Videos: A Survey and Future Directions, [paper], [bibtex].

Temporal Video Grounding

Weakly/Self/Semi/Un- Supervised Temporal Video Grounding

  • [2015 ICCV] Weakly-Supervised Alignment of Video With Text, [paper], [bibtex].
  • [2018 NeurIPS] Weakly Supervised Dense Event Captioning in Videos, [paper], [bibtex], sources: [XgDuan/WSDEC].
  • [2019 CVPR] Weakly Supervised Video Moment Retrieval From Text Queries, [paper], [bibtex], sources: [niluthpol/weak_supervised_video_moment].
  • [2019 EMNLP] WSLLN: Weakly Supervised Natural Language Localization Networks, [paper], [bibtex].
  • [2020 AAAI] Weakly-Supervised Video Moment Retrieval via Semantic Completion Network, [paper], [bibtex].
  • [2020 ArXiv] Look Closer to Ground Better: Weakly-Supervised Temporal Grounding of Sentence in Video, [paper], [bibtex].
  • [2020 ArXiv] Weakly-Supervised Multi-Level Attentional Reconstruction Network for Grounding Textual Queries in Videos, [paper], [bibtex].
  • [2020 ACMMM] Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos, [paper], [bibtex], sources: [ikuinen/regularized_two-branch_proposal_network].
  • [2020 ACMMM] Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos, [paper], [bibtex].
  • [2021 WACV] LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval, [paper], [bibtex].
  • [2021 ACMMM] AsyNCE: Disentangling False-Positives for Weakly-Supervised Video Grounding, [paper], [bibtex].
  • [2021 ACMMM] Towards Bridging Video and Language by Caption Generation and Sentence Localization, [paper], [bibtex].
  • [2021 ACMMM] Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval, [paper], [bibtex].
  • [2021 CVPR] Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning, [paper], [bibtex], [supplementary].
  • [2021 ICCV] Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation, [paper], [bibtex].
  • [2021 ArXiv] Self-supervised Learning for Semi-supervised Temporal Language Grounding, [paper], [bibtex].
  • [2021 EMNLP] Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding, [paper], [bibtex].
  • [2022 AAAI] Unsupervised Temporal Video Grounding with Deep Semantic Clustering, [paper], [bibtex].

Bias in Temporal Video Grounding

Spatio-Temporal Video Grounding

Video Corpus Moment Retrieval (Video Retrieval + Moment Localization)

Other Video Groundings

Video Re-localization

Audio based Temporal Video Grounding

Image based Temporal Video Grounding

  • [2019 IJCAI] Localizing Unseen Activities in Video via Image Query, [paper], [bibtex].

Sign Language Localization