Course assignments for LS621 (IIT Guwahati, Jan-May 2018)
Use DTW for word recognition (limited dictionary of 10 words) and demonstrate construction of a composite reference template
Demonstrate digit sequence recognition in Marathi using the HTK toolkit
Implement the Wavenet architecture for Text-To-Speech
- Target was to generate speech in French using the SIWIS French Speech Synthesis Database as a training source
- Able to show a working model which trains on limited memory and compute using multiprocessing to generate training batches-on the-fly
- Could not produce coherent test samples (generated audio) as the model was not run for sufficient time