question, performance not match with the paper and feature generation #6

meixitu · 2018-01-04T18:20:38Z

Hi,
Thanks for your wonderful work. It really help me much.
I have several question about this project
1 , I run the code with your train_commands.txt, and I found the performance is a little worse than the result in the paper, Table 7. For DS-CNN, small model, the highest validation accuracy is 92.98% in codee, and it is 93.6% in the paper.
My question is, do you get the Table 7 performance with the same code setting?

in the train.py, the test dataset is implemented after training is done.
It does not use the model when validation dataset accuracy is maximum.
do you calculate the test accuracy in the paper with the same method?

3, did you compare the performance of LBFE vs MFCC? in google's paper, it use LFBE. But MFCC can use small feature, you only use 10 MFCC features. If we use more MFCC features, can we get the higher performance?

4, do you consider the feature normalization to compatible with the different signal power range?

5, if some frame, the signal power is zero, how do you calculate the log(LBFE)? I can't see it in the code. In general, it will use log(LBFE+delta), delta is a constant small value. what is the delta value?

In many paper, they use window_size_ms=25 or 30ms, and window_stride_ms=10ms,
but in DS-CNN, you use window_size=40ms and window_stride=20ms
big window_stride can reduce the OPERATIONS, I can understand.
But I don't know why use 40ms window_size, for 16k input sample rate, it should use 1024 FFT, it is power consuming.

7, I run the simulation, almost need 4 hours. I use GPU Geforce 1080 TI and CPU E5-2650. But I saw in your other reply that you only need 1 hours to complete the simulation. Is there any way to speed up? I found the feature generation use most of the time.

Thanks
Jinhong

navsuda · 2018-01-08T17:55:59Z

Hi @zhangjinhong17

Such difference is expected because of differences in the weight initialization.
The Table 7 accuracies are obtained from the checkpoint with highest accuracy on validation set (i.e. the last saved checkpoint). Use test.py to test the accuracy on your checkpoints.
We did not compare LFBE vs. MFCC and we did not any observe higher performance (i.e. accuracy) using more MFCC features. It might be a function of the number and type of output words you are classifying, for example, you may need a higher resolution (i.e. more features) to differentiate "light" vs. "flight".
If you were asking about batch normalization to normalize the features across different inputs, it seems to work fine with this dataset. It would be interesting to see how well the batch norm parameters generalize to another dataset.
MFCC computation is a part of Tensorflow, where a delta of 1e-12 is used. Check here for more details: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/kernels/mfcc.cc#L60.
That's a good point, as 40ms will give 640 samples and you would have to do pad it to 1024 to perform FFT. However, typically, the total number of operations in neural network is much higher than the number of computations in FFT and hence it will not matter. It might matter, though, when the neural network is squeezed down to <1MOps per inference. In our case, window size of 40ms was the result of the initial hyperparameter search.
Training time is a function of network size and from what we have seen, for small networks you should get a good enough accuracy within the first hour and only incremental accuracy improvement (~1-2%) after that.

meixitu · 2018-01-09T23:49:15Z

Hi @navsuda ,

1. You are right. I run the same code twice, and the result is a little different.

Thanks for your other reply.

Thanks
Jinhong

meixitu changed the title ~~performance not match with the paper~~ question, performance not match with the paper and feature generation Jan 4, 2018

navsuda closed this as completed Jan 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question, performance not match with the paper and feature generation #6

question, performance not match with the paper and feature generation #6

meixitu commented Jan 4, 2018 •

edited

Loading

navsuda commented Jan 8, 2018

meixitu commented Jan 9, 2018

question, performance not match with the paper and feature generation #6

question, performance not match with the paper and feature generation #6

Comments

meixitu commented Jan 4, 2018 • edited Loading

navsuda commented Jan 8, 2018

meixitu commented Jan 9, 2018

meixitu commented Jan 4, 2018 •

edited

Loading