#

video-grounding

Here are 22 public repositories matching this topic...

TheShadow29 / awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

natural-language-processing computer-vision paper awesome-list arxiv papers video-understanding captioning-images captioning-videos phrase-grounding language-grounding multimodal-deep-learning grounding visual-grounding embodied-agent video-grounding image-grounding paper-roadmap

Updated Apr 9, 2023

showlab / UniVTG

[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding

video-summarization video-grounding pretraining moment-retrieval highlight-detection video-language

Updated May 8, 2024
Python

mbzuai-oryx / Video-LLaVA

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

video transcription lmm grounding video-grounding llm video-conversation

Updated Jan 2, 2024
Python

JaywongWang / CBP

Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"

video-analysis vision-and-language video-grounding action-localization video-moment-retrieval

Updated Mar 24, 2023
Python

MichiganCOG / Video-Grounding-from-Text

Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"

youcook2 youcook2-boundingbox video-grounding

Updated Sep 9, 2019
Python

sutdcv / Animal-Kingdom

[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding

dataset action-recognition pose-estimation animal-behavior meta-learning ethology video-grounding cvpr2022 long-tailed-distribution multi-label-action-recognition animal-behavioral-understanding

Updated Feb 6, 2024
Python

wjun0830 / CGDETR

Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"

computer-vision video-summarization pytorch video-understanding video-grounding multi-modal-learning detr moment-retrieval highlight-detection detection-transformer temporal-grounding text-video-retrieval

Updated Apr 2, 2024
Python

zjr2000 / GVL

Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

pytorch representation-learning pytorch-implementation dense-video-captioning video-grounding video-language temporal-localization long-video-understanding

Updated Dec 8, 2023
Python

TheShadow29 / vognet-pytorch

[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)

nlp video vision captioning-videos vision-and-language grounding pytorch-implementation visual-grounding video-grounding video-object-grounding object-grounding

Updated Jun 10, 2020
Python

jshi31 / NAFAE

Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Losses"

weakly-supervised video-grounding

Updated Jun 29, 2020
Python

JaywongWang / TGN

Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"

vision-and-language video-grounding action-localization video-moment-retrieval

Updated Nov 21, 2022
Python

ttengwang / Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

video-representation-learning video-dataset dense-video-captioning video-grounding temporal-action-detection temporal-action-localization temporal-sentence-grounding audio-visual-event-localization long-term-video video-large-language-models video-llms

Updated Jun 13, 2024

r-cui / ViGA

"Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022.

video-grounding video-moment-retrieval

Updated Jun 27, 2022
Python

sangminwoo / Explore-And-Match

Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos"

vision-and-language video-grounding natural-language-video-localization moment-retrieval

Updated Aug 5, 2022
Python

henryhungle / MM_DST

Code for the paper Multimodal Dialogue State Tracking (NAACL22)

dialogue machinelearning dialogue-systems transformer-architecture video-grounding dialoguestatetracker

Updated Jul 8, 2022
Python

doc-doc / NExT-GQA

Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)

videoqa video-grounding video-question-answering video-language-understanding trustworthy-vqa visual-evidence-grounding

Updated May 3, 2024
Python

ZhenZHAO / awesome-video-moment-retrieval

paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV))

natural-language-queries video-grounding video-moment-retrieval moment-retrieval temporal-grounding

Updated Jan 12, 2023

afcedf / SOONet

Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos

video-grounding vision-language natural-language-video-localization

Updated Sep 29, 2023
Python

sunoh-kim / PLRN

This repository contains an official PyTorch implementation of Position-aware Location Regression Network (PLRN) for temporal video grounding, which is presented in the paper Position-aware Location Regression Network for Temporal Video Grounding.

attention-mechanism multimodal-learning video-grounding

Updated Apr 16, 2022
Python

minjoong507 / BM-DETR

[arXiv 23] Pytorch code for "Overcoming Weak Visual-Textual Alignment for Video Moment Retrieval"

multimodal-learning video-retrieval video-grounding

Updated May 28, 2024
Python

Improve this page

Add a description, image, and links to the video-grounding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-grounding topic, visit your repo's landing page and select "manage topics."