Gated recurrent units: Trigger word detection

We will build a speech dataset and implement a recurrent neural network (RNN) for trigger word detection. Trigger word detection is the technology that allows devices like Amazon Alexa, Google Home, Apple Siri, and Baidu DuerOS to wake up upon hearing a certain word.

Our trigger word will be "activate". The algorithm will trigger a chime when it detects someone saying "activate".

We will perform following tasks:

Synthesize and process audio recordings to create train and development datasets
Train a trigger word detection model and make predictions

I did this project in the Sequence Models course as part of the Deep Learning Specialization.

Dataset

We synthesize a training dataset by mixing three types of audio clips:

Positive examples of people saying the word "activate".
Negative examples of people saying random words other than "activate".
Clips of background noise in different environments

For testing predictions, we use actual recordings.

Recurrent neural network with GRU layers

We build a unidirectional RNN that will ingest the spectrogram of an audio clip and output a signal when it detects the trigger word.

Example of detecting trigger words

Given the spectogram (the top figure), which tells us how much different frequencies are present in an audio clip at any moment in time, the model outputs the probability of the trigger word having been said at each time step (the bottom figure).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
XY_dev		XY_dev
audio_examples		audio_examples
images		images
models		models
raw_data		raw_data
README.md		README.md
_config.yml		_config.yml
chime_output.wav		chime_output.wav
generateTestCases.py		generateTestCases.py
gru_trigger_word_detection.ipynb		gru_trigger_word_detection.ipynb
insert_test.wav		insert_test.wav
td_utils.py		td_utils.py
test_utils.py		test_utils.py
tmp.wav		tmp.wav
train.py		train.py
train.wav		train.wav

jungsoh/gru-trigger-word-detection

Folders and files

Latest commit

History

Repository files navigation

Gated recurrent units: Trigger word detection

Dataset

Recurrent neural network with GRU layers

Example of detecting trigger words

About

Topics

Resources

Stars

Watchers

Forks

Languages