Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not an issue, but I wanted to show you some impressive results... #21

Closed
sjscotti opened this issue May 30, 2021 · 2 comments
Closed

Not an issue, but I wanted to show you some impressive results... #21

sjscotti opened this issue May 30, 2021 · 2 comments

Comments

@sjscotti
Copy link

Greetings!
I have been helping restore an old (1980) recording of a recording of an interview with an elderly person relaying stories of the early history of the Baha'i Faith in the U.S.. I have surveyed and tried a number of machine learning methods to denoise and enhance the recording. I just finished processing with your FullSubNet today, and it far surpassed the other ones I tried out in removing the recording noise to make the voice easier to understand. Enclosed is a graphic of the comparison of the frequency spectograms of the 3 files where the top one is the original recording, the middle is the result of using another method (that was dozens of times slower than yours and principally dealt with white noise) and at the bottom is the result of FullSubNet using your pretrained checkpoint. The reduction in noise going from the original recording to what your method produced was astonishing! I can check to see if the archivist would allow me to provide the recordings (so if you are interested in getting them, please let me know). Thanks so much for making the code available here!
Regards
-Steve

compare_orig_n2n_fullsubnet

@haoxiangsnr haoxiangsnr pinned this issue Jul 3, 2021
@haoxiangsnr
Copy link
Member

Thank you for your attention. I am glad to hear it.

If you have any questions, please feel free to contact me.

@faranaziz
Copy link

Greetings!
I have been helping restore an old (1980) recording of a recording of an interview with an elderly person relaying stories of the early history of the Baha'i Faith in the U.S.. I have surveyed and tried a number of machine learning methods to denoise and enhance the recording. I just finished processing with your FullSubNet today, and it far surpassed the other ones I tried out in removing the recording noise to make the voice easier to understand. Enclosed is a graphic of the comparison of the frequency spectograms of the 3 files where the top one is the original recording, the middle is the result of using another method (that was dozens of times slower than yours and principally dealt with white noise) and at the bottom is the result of FullSubNet using your pretrained checkpoint. The reduction in noise going from the original recording to what your method produced was astonishing! I can check to see if the archivist would allow me to provide the recordings (so if you are interested in getting them, please let me know). Thanks so much for making the code available here!
Regards
-Steve

compare_orig_n2n_fullsubnet

Hello,
How did you run inference on a single WAV file? Can you share the code?
Thanks you.

@haoxiangsnr haoxiangsnr unpinned this issue Nov 1, 2021
@haoxiangsnr haoxiangsnr pinned this issue Nov 1, 2021
@DiLiangWU DiLiangWU unpinned this issue Oct 23, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants