Skip to content

ClipShots is the first large-scale dataset for shot boundary detection collected from Youtube and Weibo covering more than 20 categories, including sports, TV shows, animals, etc.

License

Notifications You must be signed in to change notification settings

Tangshitao/ClipShots

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ClipShots dataset

This repository contains the ClipShots dataset introduced in our paper. The videos are here or Google cloud. Please rename the directory ClipShots to data.tar.gz to use data.

Introduction

ClipShots is the first large-scale dataset for shot boundary detection collected from Youtube and Weibo covering more than 20 categories, including sports, TV shows, animals, etc. In contrast to previous shot boundary detection dataset, e.g. TRECVID and RAI, which only consist of documentaries or talk shows where the frames are relatively static, we construct a database containing short videos from Youtube and Weibo. Many short videos are home-made, with more challenges, e.g. hand-held vibrations and large occlusion. The types of these videos are various, including movie spotlights, competition highlights, family videos recorded by mobile phones etc. Each video has a length of 1-20 minutes. The gradual transitions in our database include dissolve, fade in fade out, and sliding in sliding out.

Description

The database contains 3 sets of data, training set, testing set and 'only_gradual' set. The trainig set and the 'only_gradual' set are for training and the testing set is for evaluation. For the 'only_gradual' set, we annotate the gradual transitions because of insufficent gradual transitions in training set. In video_lists, there are 3 files that contain the video names of them respectively. The evaluation script is in tools.

Main results

We list some strong baselines here.

Methods Cut Gradual
Precision Recall F1-score Precision Recall F1-score
deepSBD (Alexnet-like, origin) 0.731 0.921 0.815 0.837 0.386 0.528
deepSBD (ResNet-18) 0.765 0.910 0.831 0.770 0.622 0.688
DSM 0.776 0.934 0.848 0.840 0.904 0.870

Please refer to this paper for deepSBD and our paper for DSM. We also release the baseline codes here. Please email shitaot@gmail.com if you have any questions.

About

ClipShots is the first large-scale dataset for shot boundary detection collected from Youtube and Weibo covering more than 20 categories, including sports, TV shows, animals, etc.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages