Skip to content

waybarrios/guidance-based-video-grounding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 

Repository files navigation

arXiv PWC

Share to Community

Guidance Based Video Grounding.

The official implementation of the paper: "Localizing Moments in Long Video Via Multimodal Guidance". In this repository, we provide the predicted scores from the Guidance Model using MAD Dataset.

News

07/14/2023: "Localizing Moments in Long Video Via Multimodal Guidance" was accepeted at ICCV 2023.

Citation

If you find this implementation useful in your research, please use the following BibTeX entry for citation:

@article{Barrios2023LocalizingMI,
  title={Localizing Moments in Long Video Via Multimodal Guidance},
  author={Wayner Barrios and Mattia Soldan and Fabian Caba Heilbron and Alberto M. Ceballos-Arroyo and Bernard Ghanem},
  journal={ArXiv},
  year={2023},
  volume={abs/2302.13372}
}

Code

We are going go to release soon the basecode, pretrained models and audio features in MAD Dataset.

Prediction Zoo.

The provided predictions correspond to the scores generated by the Guidance model using sliding windows of 64 frames and 128 frames in length. The predictions are stored in a pickle object with the following structure:

In [1]: import pickle
In [2]: with open("guidance_scores_MAD_test_128.pkl",'rb') as f:
   ...:     scores = pickle.load(f)
In [3]: len(scores)
Out[3]: 72044
In [4]: scores[0].keys()
Out[4]: dict_keys(['qid', 'vid', 'windows', 'score'])
{   'qid': '0',
    'score': array([1.48404761e-05, 1.40372722e-05, 1.46572347e-05, 1.28814381e-05,
       1.34291167e-05, 1.32850864e-05, 1.61252574e-05, 6.24697859e-05,
       4.70118430e-05, 1.63803907e-05, 2.77301951e-05, 2.59740209e-05,
       9.86061990e-01, 4.11081433e-01, 1.71889886e-02, 1.37453452e-01,
       1.75393507e-05, 1.92647931e-05, 5.38236709e-05, 6.90551009e-04,
       7.63237834e-01, 9.73204970e-02, 1.73201097e-05, 2.48163269e-05,
       5.99260893e-05, 1.84824003e-05, 2.14560350e-05, 1.04043145e-04,
       5.24206553e-05, 1.88337926e-05, 1.62523775e-05, 1.23760619e-05,
       1.15747998e-05, 1.85713252e-05, 3.93810224e-05, 4.38277610e-04,
       4.63226315e-05, 2.76185543e-04, 6.71112502e-05, 2.05889755e-05,
       5.27229131e-05, 4.56629896e-05, 2.62997986e-04, 1.23860036e-05,
       1.19574897e-05, 1.27713274e-05, 1.34036281e-05, 1.49246125e-05,
       1.66437039e-05, 1.32685755e-05, 1.36442995e-05, 1.39407657e-05,
       9.44265649e-02, 5.19266985e-02, 3.09179362e-04, 1.66565824e-05,
       1.52278981e-05, 1.34415832e-05, 1.16731699e-05, 1.19617898e-05,
       1.34421471e-05, 1.35606424e-05, 1.40685788e-05, 1.44712585e-05,
       1.49164434e-05, 1.32006107e-05, 1.23232739e-05, 1.22480678e-05,
       1.36934423e-05, 8.42598165e-05, 1.90059054e-05, 1.52820303e-05,
       1.25335091e-05, 1.30556955e-05, 1.18760063e-05, 1.14885261e-05,
       1.17362497e-05, 1.12321404e-05, 1.24243248e-04, 1.45946506e-05,
       4.47804232e-05, 1.39249141e-05, 1.34848015e-05, 3.25621368e-05,
       1.44184843e-01, 2.68866897e-05, 1.92906227e-05, 1.76019021e-05,
       1.58657276e-05, 1.28230713e-05, 1.28012252e-05, 1.29981381e-05,
       1.67807830e-05, 1.70492331e-05, 1.40562279e-05, 1.61650114e-05,
       1.47591518e-05, 1.63778402e-02, 1.42061428e-04, 6.93475548e-03,
       6.02264590e-05, 8.72147648e-05, 9.83794928e-01, 9.91553962e-01,
       9.63991106e-01, 8.97689939e-01, 1.28758256e-04, 2.88744595e-05,
       1.70378244e-05, 2.29878224e-05, 2.43768354e-05, 1.59022475e-05,
       1.30911794e-05, 1.81753130e-05, 2.05728411e-05, 1.25869919e-05,
       1.25580364e-05, 1.16062802e-05, 1.37536981e-05, 1.34730390e-05,
       1.40373795e-05, 1.33059066e-05, 1.30285189e-05, 1.37811385e-05,
       2.23064744e-05, 1.44057722e-05, 1.42116378e-05, 1.93661017e-05,
       1.58555758e-05, 1.43071402e-05, 1.38224150e-05, 1.28803194e-05,
       1.20950817e-05, 1.41009232e-05, 1.45958602e-05, 1.23285527e-05,
       1.38767664e-05, 1.59005958e-05, 1.49218240e-05, 1.21883040e-05,
       1.24096860e-05, 1.63976423e-04, 3.71323113e-05, 1.49581110e-05,
       1.28865731e-05, 8.20189889e-05, 1.94104978e-05, 1.45575204e-05,
       1.19119395e-05, 1.17359577e-05, 1.33997301e-05, 1.31552797e-05,
       1.29547625e-05, 1.46081702e-05, 1.37864763e-05, 2.89076870e-05,
       2.40834688e-05, 2.44160365e-05, 3.74382762e-05, 4.72434871e-02,
       1.53820711e-05, 1.25494762e-05, 1.16858791e-05, 1.33582507e-05,
       6.86281201e-05, 1.72452001e-05, 1.32617952e-05, 1.24350836e-05,
       1.32563446e-05, 1.50281312e-05, 2.07685662e-05, 3.12883203e-05,
       5.31642836e-05, 7.05183193e-05, 1.51949525e-05, 1.41901855e-05,
       1.51822069e-05, 3.32951342e-04, 8.94680124e-05, 1.65749607e-05,
       2.18829446e-05, 2.16037024e-05, 1.89978218e-05, 4.97834710e-03,
       2.03153506e-01, 1.54585496e-03, 1.23195614e-05, 1.28703259e-05,
       1.51874347e-05, 1.30843009e-05, 1.32952518e-05, 1.83968314e-05,
       3.42841486e-05, 9.24622072e-05, 1.33280428e-05, 1.38418063e-05,
       1.52235261e-05, 1.41796754e-05, 1.46450093e-05, 2.20195379e-05,
       1.83107302e-04, 1.82420099e-05, 1.50840988e-05, 1.33859876e-05,
       1.51073200e-05, 1.47391929e-05, 1.49910848e-05, 1.53916826e-05,
       1.31657725e-05, 1.38312898e-05, 1.90024621e-05, 1.58155744e-05,
       1.31786610e-05, 1.57141967e-05, 1.65828824e-05, 1.46924167e-05,
       1.38433634e-05, 5.21887268e-05, 2.85502132e-02, 2.30753481e-01,
       7.06195598e-04, 1.50714346e-04, 1.27303065e-03, 1.33986650e-02,
       7.64285505e-04, 2.07327234e-04, 6.83149046e-05, 3.26294066e-05,
       3.00217052e-05, 3.59058060e-04, 1.75943842e-05, 4.50351909e-05,
       6.54372343e-05, 7.06970895e-05, 3.67312983e-04, 1.05719395e-01,
       4.43235294e-05, 2.82063011e-05, 7.51458792e-05, 1.61291231e-04,
       4.26617444e-05, 8.98458238e-05, 5.37320266e-05, 7.81280905e-05,
       4.74652685e-02, 6.73964678e-04, 7.80265400e-05, 2.98924297e-05,
       4.71418061e-05, 9.99735785e-05, 5.41929447e-04, 8.76590490e-01,
       7.32870936e-01, 9.47873652e-01, 9.83479261e-01, 9.41197515e-01,
       3.02340268e-05, 5.52863061e-01, 4.90591303e-02, 5.52392844e-03,
       1.66527767e-04, 6.01128559e-05, 2.75078182e-05, 5.36037696e-05,
       2.72706511e-05, 5.20218709e-05, 1.74067172e-04, 9.59624112e-01,
       9.92105484e-01, 6.41801059e-01, 7.50956178e-01, 1.66324535e-05,
       1.36247700e-05, 1.38954510e-05, 1.32978639e-05, 2.76602568e-05,
       8.64359558e-01, 2.82314628e-01, 6.86250278e-04, 1.61339794e-05,
       1.76240802e-01, 6.14342950e-02, 1.79430062e-05, 1.85770459e-05,
       2.49132900e-05, 4.90641105e-05, 1.38329369e-05, 1.35371911e-05,
       1.19879533e-05, 1.28572465e-05, 1.49452917e-05, 1.34064794e-05,
       1.20641280e-05, 1.38642654e-05, 1.28597740e-05, 1.21135636e-05,
       1.19547185e-05, 1.27106450e-05, 1.24800990e-05, 1.45651029e-05,
       1.51306494e-05, 1.31757206e-05, 1.44625528e-05, 2.93072371e-05,
       1.55961770e-05, 1.38226005e-05, 2.85501122e-01, 9.54893649e-01,
       4.26807284e-01, 7.88133383e-01, 1.15605462e-05, 1.27675758e-05,
       1.74503912e-05, 1.22338257e-04, 4.07951375e-05, 6.67655331e-05,
       2.63322181e-05, 6.43799603e-01, 9.40359533e-01, 8.85976017e-01,
       4.58170444e-01, 1.68637175e-03, 5.94505800e-05, 9.05500948e-01,
       3.18567127e-01, 4.67336411e-03, 2.84927974e-05, 3.81192891e-03,
       4.18508105e-04, 6.88799983e-03, 9.18629944e-01, 8.45510900e-01,
       1.88187569e-01, 1.15205767e-02, 6.14926934e-01, 9.16110933e-01,
       3.21912378e-01, 9.68408361e-02, 2.36877706e-03, 3.30457231e-04,
       9.32341874e-01, 6.69624686e-01, 3.61131132e-02, 4.71764088e-01,
       3.23702669e-04, 5.40765934e-04, 2.96235172e-04, 1.00755557e-01,
       2.59187482e-02, 9.91479377e-04, 5.00017107e-02, 9.33302939e-03,
       8.73835742e-01, 9.06303883e-01, 1.98892485e-02, 2.06603622e-03,
       2.67300452e-03, 1.63171062e-05, 4.14947972e-05, 2.11949199e-02,
       5.66720143e-02, 6.37245998e-02, 3.02139521e-01, 4.86139301e-03,
       6.51149167e-05, 8.24632589e-05, 2.42551632e-05, 2.16892213e-01,
       9.93161321e-01, 9.07774687e-01, 9.85157251e-01, 7.91489899e-01,
       6.24064269e-05, 2.82448274e-03, 6.10993884e-05, 4.63459146e-05,
       6.72110255e-05, 2.53440558e-05, 2.50527592e-05, 4.85404918e-04,
       7.80891351e-05, 4.56315975e-05, 1.90765320e-04, 8.94685328e-01,
       9.85134244e-01, 9.36044097e-01, 1.42211165e-05, 1.49489415e-05,
       1.69001578e-05, 1.66201044e-05, 2.41175085e-01, 5.41068694e-05,
       1.77346919e-05, 3.90491296e-05, 2.48894852e-04, 1.45345357e-05,
       1.64555768e-05, 1.53538731e-05, 1.38164451e-05, 1.68559291e-05,
       3.19991705e-05, 2.60154466e-05, 1.41664159e-05, 1.22337908e-04,
       4.30386774e-02, 3.52067378e-04, 2.77736799e-05, 1.43605203e-05,
       1.33721569e-05, 1.43800498e-05, 1.23751524e-05, 2.31819286e-05,
       9.83208010e-05, 2.08199883e-04, 3.14763274e-05, 3.47468827e-04,
       1.10434856e-04, 3.18150487e-05, 1.72609471e-05, 2.70375167e-05,
       1.67231119e-05, 1.80254483e-05, 2.09855771e-05, 1.66565824e-05,
       1.64901703e-05, 3.01825115e-04, 7.29017615e-01, 1.12410297e-03,
       6.18876831e-04, 2.08720026e-04, 2.29539564e-05, 1.47635437e-05,
       4.10786743e-05, 2.57481169e-02, 8.77772836e-05, 4.92439649e-05,
       9.44633852e-04, 2.61720526e-03, 8.41950595e-01, 8.63339067e-01,
       5.76047751e-05, 8.71496499e-01, 9.07008648e-01, 8.54207218e-01,
       3.62060557e-04, 7.98364286e-04, 7.50755966e-02, 3.81207588e-04,
       6.62766863e-03, 1.50808028e-03, 5.67528963e-01, 7.69607246e-01,
       4.62092081e-04, 1.82087897e-04, 9.24605787e-01, 9.67480242e-01,
       3.22210602e-03, 3.38318609e-02, 7.42516349e-05, 2.80661490e-02,
       7.69108906e-03, 8.99414954e-05, 5.23393810e-01, 7.17914104e-01,
       5.11704478e-04, 2.06177612e-03, 7.79069304e-01, 1.28432157e-05,
       1.51723981e-01, 7.02154310e-03, 9.71324384e-01, 8.30839634e-01,
       6.24295863e-05, 1.97836489e-05, 1.80826428e-05, 1.67380622e-05,
       1.57646009e-05, 5.96713580e-05, 7.05929342e-05, 2.16401986e-05,
       1.69063496e-05, 1.36657072e-05, 1.44965925e-05, 2.01106413e-05,
       1.66287136e-05, 1.51022632e-05, 1.20727018e-05, 1.36815515e-05,
       1.57434170e-05, 3.38077080e-03, 1.93943546e-04, 1.50704973e-05,
       1.36058252e-05, 1.23554828e-05, 1.20090635e-05, 1.20484674e-05,
       1.15330831e-05, 1.24158278e-05, 1.21374187e-05, 1.20495934e-05,
       1.25650204e-05, 1.16137307e-05, 1.18198168e-05, 1.15763123e-05,
       1.24373146e-05, 1.25643137e-05, 1.48531772e-05, 1.28844113e-05,
       1.19790957e-05, 1.42352001e-05, 2.61451223e-05, 1.16819347e-05],
      dtype=float32),
    'vid': '3001_21_JUMP_STREET',
    'windows': array([[    0,   128],
       [   64,   192],
       [  128,   256],
       ...,
       [32576, 32704],
       [32640, 32768],
       [32704, 32832]])}

Download

The predictions are available on HuggingFace repo.

Contact

email: way.gr@dartmouth.edu or waybarrios@gmail.com

About

[ICCV 2023] The official PyTorch implementation of the paper: "Localizing Moments in Long Video Via Multimodal Guidance"

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published