Keyword Spotting Data Generator

In order to improve the flexibility of Honk and Honkling, we provide a program that constructs a dataset from youtube videos. Key idea is to decrease the search space by utilizing subtitles and extract target audio using PocketSphinx.

< Preparation >

Necessary python packages can be downloaded with pip -r install requirements.txt
ffmpeg and SoX must be available as well.
YouTube Data API - follow this instruction to obtain a new API key

< Usage >

python keyword_data_generator.py
	-a < youtube data v3 API key >
	-k < list of keywords to search >
	-s < number of samples to collect per keyword (default: 10) >
	-o < output path (default: "./generated_keyword_audios") >

example:

python keyword_data_generator.py -a $YOUTUBE_API_KEY -k google slack -s 20 -o ./generated

< Improvements >

filtering non-english videos
adjust ffmpeg command to handle different types of video : mov,mp4,m4a,3gp,3g2,mj2
dynamic handling of long videos (currently simple filter)
improve throughput by parallelizing the process

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Keyword Spotting Data Generator

< Preparation >

< Usage >

< Improvements >

Files

README.md

Latest commit

History

README.md

File metadata and controls

Keyword Spotting Data Generator

< Preparation >

< Usage >

< Improvements >