Preprocessing of Dataset to feed into LSTM #12

divyeshrajpura4114 · 2019-05-27T12:18:07Z

Can you please explain procedure or different steps to pre-process data before feed to LSTM. I am working on paper by Zhuo Chen on "Speaker-Independent Speech Separation With Deep
Attractor Network", but I am not able to create batches because each audio file have different no of frames. So how do you handle variable length input to LSTM? I know techniques like padding sequence, but I dont think that would be effective because difference of no of frames is much large.

aishoot · 2019-05-28T08:15:08Z

Hi, you can read those two files tfrecords_io.py and run_lstm.py.

divyeshrajpura4114 · 2019-05-28T09:22:34Z

Ok. I will look into that. Thank You...

nagasaibharath · 2019-06-13T12:45:22Z

If we are able to create our own mixed wav files, then is there any need for getting SNR Signals of the Audio files?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preprocessing of Dataset to feed into LSTM #12

Preprocessing of Dataset to feed into LSTM #12

divyeshrajpura4114 commented May 27, 2019

aishoot commented May 28, 2019

divyeshrajpura4114 commented May 28, 2019

nagasaibharath commented Jun 13, 2019 •

edited

Preprocessing of Dataset to feed into LSTM #12

Preprocessing of Dataset to feed into LSTM #12

Comments

divyeshrajpura4114 commented May 27, 2019

aishoot commented May 28, 2019

divyeshrajpura4114 commented May 28, 2019

nagasaibharath commented Jun 13, 2019 • edited

nagasaibharath commented Jun 13, 2019 •

edited