Skip to content
Keyword spotting using audio from speech synthesis services and YouTube
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
logo
.gitignore
LICENSE
README.md
add_silence.py
split_wav_file.py
word_to_audio.py

README.md

parrot

Keyword spotting using audio from speech synthesis services and YouTube

Example Use

Generate systhesized examples for words

python word_to_audio.py [microsoft_subscription_key] [watson_subscription_username] [watson_subscription_password] --words [word1] [word2] ...

Create 320 different systhesized examples for each word using Microsoft Cognitive Services, Google Cloud Text-to-Speech and IBM Watson Text to Speech.

Split a wav file into multiple wav files with same length

python split_wav_file.py [sample.wav] [length_in_second]

Add silence at the beginning and end of all wav files in a directory

python add_silence.py [length_of_silence_in_second] --directories [dir1] [dir2] ...
You can’t perform that action at this time.