Skip to content

Neleac/SpaceTimeGPT

Repository files navigation

SpaceTimeGPT: A Spatiotemporal Video Captioning Model

Checkpoint

Hugging Face Model Card

Dataset

VaTeX

Evaluation

67.3 CIDEr on VATEX test set

About

video captioning model using TimeSformer, GPT 2, and cross attention

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published