Installation

For install all packages run:

git clone https://github.com/Ryzagi/zero_shot_kws
cd zero_shot_kws
pip install .

Zero-shot KWS

Keyword Spotting (KWS) is an essential component of voice-assist technologies, where the user speaks a predefined keyword to wake up a system before speaking a complete command or query to the device. The project aims to create a zero-shot keyword spotter and compare it with a non-zeroshot (static) keyword spotter trained to recognize a specific set of keywords.
Zero-shot learning. Analogies can be drawn between KWS with unseen words and zero-shot learning for detecting new classes, such as words or phrases. KWS with unseen words is essentially a zero-shot learning problem, where attributes (letters) are shared between classes (words) so that the knowledge learned from seen classes is transfered to unseen ones

1. Train the baseline model on labeled Google Commands data.
1. Design and development of a zero-shot KWS model that will detect any word (/phrase):
(a) Create and label a training and testing dataset from ASR data. Come up with ways to create negative samples.
(b) Create a basic pipeline on graphemes, which will match the encoded sequence of graphemes and the encoded piece of audio (waves).
(c) Create a pipeline on phonemes, i.e. add a phonemization stage before text encoder.
(d) Create a pipeline on ASR features, instead of text encoder.
(e) Compare the results 2.1 - 2.3 on the dataset from point 2.0.
(f) Present the results of comparing models in the form of precision-recall curves and metrics (accuracy, f1, etc.)
1. Compare the best zero-shot model from point 2 with the model from point 1
(a) Prepare Google Commands in the same format as the datasets for point 2
(b) Compare the model from point 1 and point 2
(c) Present the results of the model comparison in the form of precision-recall curves and metrics (accuracy, f1, etc.)

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
zerospot		zerospot
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py