Simple implementation of DNN for phoneme recognition
- TIMIT
- TIMIT(Spikegram)
- python 3.6.8
- tensorflow==1.13.1
- numpy
- matplotlib
- librosa
python3 run_spikegram.py
But you don't have TIMIT(spikegram).
python3 run_mfcc.py
python3 run_spectrogram.py
python3 run_melspectrogram.py
Spikegram | MFCC | Spectrogram | Melspectrogram | ||
---|---|---|---|---|---|
Obstruent | Stops | 57.38 | 50.97 | 50.45 | 49.18 |
Obstruent | Affricate | 42.19 | 30.61 | 36.06 | 35.74 |
Obstruent | Fricative | 70.66 | 66.93 | 66.98 | 67.23 |
Sonorant | Glides | 55.14 | 56.59 | 55.98 | 55.43 |
Sonorant | Nasals | 59.15 | 62.36 | 60.39 | 60.09 |
Sonorant | Vowels | 53.05 | 53.38 | 52.44 | 53.70 |
Others | 92.24 | 91.96 | 91.79 | 91.94 |
Spikegram | MFCC | Spectrogram | Melspectrogram | |
---|---|---|---|---|
Obstruent | 65.76 | 61.06 | 61.23 | 61.06 |
Sonorant | 54.12 | 54.99 | 53.99 | 54.75 |
Others | 92.24 | 91.96 | 91.79 | 91.94 |
Spikegram | MFCC | Spectrogram | Melspectrogram | |
---|---|---|---|---|
Non mute | 57.49 | 56.77 | 56.11 | 56.61 |
mute | 92.24 | 91.96 | 91.79 | 91.94 |
Spikegram | MFCC | Spectrogram | Melspectrogram | |
---|---|---|---|---|
Total | 65.26 | 65.50 | 64.96 | 65.37 |
Han Seokhyeon