Final project for the course Speech and Speaker Recognition at KTH 2018.
In this project we evaluated the effect of combining Dense, Convolutional and Recurrent layers in a single network for Phoneme classification.
We developed a network very similar to the one described in this paper.
Our paper can be found here.