Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to generate audio #3

Open
kily-wmz opened this issue Mar 24, 2024 · 2 comments
Open

How to generate audio #3

kily-wmz opened this issue Mar 24, 2024 · 2 comments

Comments

@kily-wmz
Copy link

Hi!

I have successfully run the code for the SFT model, but I was wondering how to convert the output into audio. Is this part of the code publicly available?
Thanks!

@Mark12Ding
Copy link
Collaborator

Hi,

Actually, we provide the code for parsing the generated string into the midi file at utils.py and an example in the inference.ipynb. You can check it out.

For the audio part, we use the commercial app ACE to convert the midi file into audio. Other vocal synthesizer apps are also capable of doing that such as synthesizerv.

Best,
Shuangrui

@kily-wmz
Copy link
Author

Hi!
I'm glad you could reply! I've successfully converted the output of the SFT model into a MIDI file and played it. But currently, it generates only pure audio. How can I generate an audio file that plays both the melody and the lyrics together? Is there any code available for this part? Looking forward to your reply.
Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants