Skip to content

ameyagodbole/LS621-Spoken-Language-Systems

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LS621-Spoken-Language-Systems

Course assignments for LS621 (IIT Guwahati, Jan-May 2018)

Assignment 1: Dynamic Time Warping (DTW)

Use DTW for word recognition (limited dictionary of 10 words) and demonstrate construction of a composite reference template

Midsem Project: HMM-based digit sequence recognition

Demonstrate digit sequence recognition in Marathi using the HTK toolkit

Endsem Project: Wavenet TTS

Implement the Wavenet architecture for Text-To-Speech

  • Target was to generate speech in French using the SIWIS French Speech Synthesis Database as a training source
  • Able to show a working model which trains on limited memory and compute using multiprocessing to generate training batches-on the-fly
  • Could not produce coherent test samples (generated audio) as the model was not run for sufficient time

Releases

No releases published

Packages

No packages published

Languages