Skip to content

Yashish92/Speech-Inversion-TCN

Repository files navigation

Speech_Inversion_TCN

This repository holds the baseline models used in the "THE SECRET SOURCE : INCORPORATING SOURCE FEATURES TO IMPROVE ACOUSTIC-TO-ARTICULATORY SPEECH INVERSION" paper. Current baseline models are from the ones trained on the XRMB articulatory dataset. The same model archtectures were used with the HPRC dataset for the experiments in the paper.

This repository is still under construction !!

Baseline models

BiGRNN-MFCC : Model trained with MFCCs as inputs and 6 TVs as target outputs

BiGRNN-SF-MFCC : Model trained with MFCCs as inputs and 6 TVs + source features as target outputs

CNN-BiGRNN-Mspec : Model trained with melspectrograms as inputs and 6 TVs as target outputs

CNN-BiGRNN-SF-Mspec : Model trained with melspectrograms as inputs and 6 TVs + source features as target outputs

CNN-BLSTM-Mspec : Model trained with melspectrograms as inputs and 6 TVs as target outputs

CNN-BLSTM-SF-Mspec : Model trained with melspectrograms as inputs and 6 TVs + source features as target outputs

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages