YouTube-NT Dataset

This repo contains python scripts can be used to prepare high quality video clips.

Prepare your videos

In our paper: Hierarchical Autoregressive Modeling for Neural Video Compression, YouTube-NT contains videos downloaded from the following YouTube Channels:

As training a learned compression model usually requires high quality training data, we only download high quality videos by executing following command (Please install youtube-dl first):

youtube-dl -f "bestvideo[ext=webm][height>=1080]" -ciw -o "%(id)s.%(ext)s" -v <url-of-channel>

REMARK1: Here, I recommend to download "webm" format videos as they are usually encoded with Google's VP9 codec, which usually has good quality and "ffmpeg" supports VP9 decode. Note that if you use "ext=mp4" you may download video with AV1 codec (really new), which may not be supported by some "ffmpeg" binary/package distribution (Of course, you can compile your own "ffmpeg" if you want). In addition, most mp4 files downloaded from youtube are encoded with H.264 codec, which usually has worse quality than VP9. For more details, please refer to youtube-dl.

REMARK2: Of course, you can prepare your own videos.

REMARK3: we also provide a list of youtube video ids, so that you can directly download via youtube id (ID List)

Use the scripts

Install opencv-python ffmpeg first.
cd PySceneDetect; python setup.py install # as I use a modified version of PySceneDetect
Then we need to customize some values in config.py to detect scene cut and then extract each scene clip:

start = '15s' # detection start time
duration = '120s' # duration of detection
input_dir = 'videos' # input path, your video folder
output_dir = 'splitted' # output path, each subfolder will contain a scene clip (a sequence of .png files)
grayscale_filter = False # whether use grayscale detection (will slow down the detection speed, use it only when you have lots of grayscale video)
gray_threshold = 35 # for scene cut detection, simply use this default value
threshold = 26 # for scene cut detection, simply use this default value
nframes = 10 # number of frames in each clip. Each scene clip may have "nframes//2" ~ "nframes" frames.
ffmpeg_override = f'-vf scale=iw*2/3:ih*2/3 -vsync 0 -vframes {nframes}' # ffmpeg command. Here we need to downsample the original video to remove compression artifact
n_process = 64 # number of process

Execute python main.py and wait.. (it depends on how many videos you have)
The scripts will generate a file "badlist.csv", which contains the names of subfolder containing bad scene clips. Feel free to remove all the subfolders showed in this file.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
PySceneDetect		PySceneDetect
splitted		splitted
videos		videos
.gitignore		.gitignore
LICENSE		LICENSE
badlist.csv		badlist.csv
config.py		config.py
main.py		main.py
readme.md		readme.md
youtube_idlist.csv		youtube_idlist.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YouTube-NT Dataset

Prepare your videos

Use the scripts

About

Languages

License

buggyyang/Youtube-NT

Folders and files

Latest commit

History

Repository files navigation

YouTube-NT Dataset

Prepare your videos

Use the scripts

About

Resources

License

Stars

Watchers

Forks

Languages