-
Notifications
You must be signed in to change notification settings - Fork 155
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The real-time speech enhance is poor #29
Comments
Hi Jinxue, thanks for your attention and feedback. I guess the main reason for this difference is the lack of hidden states and cell states of LSTM. If you pursue frame-wise processing, there are two things:
Note that changing from |
After these changes should we retrain the model ? Because Im using the pretrained model. Thanks for help. |
Hi, this week I will release a cumulative pre-trained model. |
Thanks. Can you push the code snippet for real-time frame wise processing ? |
Hi, @Spelchure. Q: After these changes should we retrain the model ? Because Im using the pretrained model. Thanks for help. Q: Can you push the code snippet for real-time frame wise processing ? |
Thanks for model and advice. |
I can't use pretrained cumulative model after changing LSTM to LSTMCell for frame - wise processing. It has error : missing arguments and unexpected arguments in model. It is possible to use cumulative model only inferencing without training ? If it is possible where i am doing wrong ? (Im changing LSTM to LSTMCell in sequence_model.py) |
I tested LSTM and LSTMCell , it did not help. Then, I tired to input the hidden states and cell states of the previous step to the current step, which works well. |
Thanks for your advice. |
Hi, Jinxue Generally speaking, the changing from
You could try to confirm these trivial things and if you have any further questions please contact me. Of course, if the problem still exists, directly contributing your frame-wise code to this project on GitHub is very welcome. |
Hi SongJinXue, can you share the real-time code? I would be so appreciated it. Many thanks for considering my request. |
I hope the inference code for real-time is this |
|
Hello author, Could you please share the revised content of this part of streaming inference? Thank you very much |
The block length of 32 ms and the block shift of 8 ms for real-time speech enhancement is poor,but a single audio speech enhancement works well.
What causes it?
How can I improve ?
Noisy:
The block length of 32 ms and the block shift of 8 ms for real-time speech enhancement :
A single audio speech enhancement :
The text was updated successfully, but these errors were encountered: