Can't Find the Training File #1

manashmandal · 2018-06-28T09:14:50Z

How did you train your model? Could you please provide the training file as well?

On Data_Process.ipynb file when calculating m, v and s you used mfcc parameters like this,

audio = mfcc(read_audio_from_filename(file, 16000),samplerate=16000,winlen=0.025,winstep=0.01,numcep=39,
                 nfilt=40)

But when I am running other cells on my data,

    inputs = convert_wav_mfcc(wav_path, 16000)
    normalize_inputs = (inputs - m)/s

This throws an exception that shape doesn't match, so I changed the function convert_wav_mfcc to this

samplerate = 16000
winlen = 0.025
winstep = 0.01
numcep = 39
nfilt = 40

def convert_wav_mfcc(file, fs=16000):
    """Turn raw audio data into MFCC with sample rate=fs."""
    inputs = mfcc(read_audio_from_filename(file, fs), samplerate=fs, winlen=winlen, winstep=winstep, numcep=numcep, nfilt=nfilt)
    return inputs

Now everything works fine.

The text was updated successfully, but these errors were encountered:

chiachunfu · 2018-06-28T14:37:20Z

@manashmndl if you are looking for the lstm model, you could find the training script, lstm_ctc.py in the repo. For the wavenet model, I'm using a pretrained model that I got from here.

manashmandal · 2018-06-29T06:09:31Z

@chiachunfu thanks for your answer, but which MFCC implementation did you follow? The librosa one or the other library?

chiachunfu · 2018-07-02T14:53:04Z

@manashmndl I used the mfcc function implemented in python_speech_features module.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't Find the Training File #1

Can't Find the Training File #1

manashmandal commented Jun 28, 2018

chiachunfu commented Jun 28, 2018

manashmandal commented Jun 29, 2018

chiachunfu commented Jul 2, 2018

Can't Find the Training File #1

Can't Find the Training File #1

Comments

manashmandal commented Jun 28, 2018

chiachunfu commented Jun 28, 2018

manashmandal commented Jun 29, 2018

chiachunfu commented Jul 2, 2018