New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Any Ref about the MCEP post-processing ? #241
Comments
I believe @zhizhengwu wrote this piece of code -- he might have some reference. I once had it, but can't find now. |
This might be helpful #38 |
The main idea seems to be multiplying the MGC coefficients, excluding c(0) and c(1), with some post-filtering constant. I think the post-filter is similar to what Takaki et al. used in their recent paper: Both that and Baji's reference point to this paper: |
It is originally from HTS. Please check this paper: https://docslide.us/documents/incorporating-a-mixed-excitation-model-and-postfilter-into-hmm-based-text-to-speech.html |
Thanks a lot guys! (and sorry for the duplicate ...) The HTML link is dead though. |
I see some confusion, in the paper referenced by @zhizhengwu, they describe a postfilter on Mel frequency cepstral coefficients but in your code (in generate.py), the input file mgc is Mel generalized cepstral coenficients. So I think, it have a bit differences. |
Hi,
Does anybody has any bibliographic reference about the post-processing used in Merlin ?
Starting at
merlin/src/utils/generate.py
Line 199 in 7c458c7
I dare saying the SPTK code is a bit cryptic.
The text was updated successfully, but these errors were encountered: