Skip to content
Permalink
Branch: master
Find file Copy path
Find file Copy path
2 contributors

Users who have contributed to this file

@jameslyons @groupw66
54 lines (36 sloc) 1.78 KB

Welcome to python_speech_features's documentation!

This library provides common speech features for ASR including MFCCs and filterbank energies. If you are not sure what MFCCs are, and would like to know more have a look at this MFCC tutorial: http://www.practicalcryptography.com/miscellaneous/machine-learning/guide-mel-frequency-cepstral-coefficients-mfccs/.

You will need numpy and scipy to run these files. The code for this project is available at https://github.com/jameslyons/python_speech_features .

Supported features:

To use MFCC features:

from python_speech_features import mfcc
from python_speech_features import logfbank
import scipy.io.wavfile as wav

(rate,sig) = wav.read("file.wav")
mfcc_feat = mfcc(sig,rate)
fbank_feat = logfbank(sig,rate)

print(fbank_feat[1:3,:])

From here you can write the features to a file etc.

Functions provided in python_speech_features module

.. automodule:: python_speech_features.base
    :members:


Functions provided in sigproc module

.. automodule:: python_speech_features.sigproc
    :members:


Indices and tables

You can’t perform that action at this time.