Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multichannel / stereo #54

Open
ediooo opened this issue Mar 29, 2023 · 1 comment
Open

multichannel / stereo #54

ediooo opened this issue Mar 29, 2023 · 1 comment

Comments

@ediooo
Copy link

ediooo commented Mar 29, 2023

I think I've heard some examples in stereo?
Is this possible using the CLI version?

@olaviinha
Copy link
Contributor

olaviinha commented Apr 30, 2023

In the Colab notebook stereo is simulated by separate generations of left and right channel and mashing them together into one stereo file. Left channel audio is generated by text prompt, right channel audio is generated by style-transferring the same prompt to the newly generated left channel audio with a low transfer strength. It's not very authentic, but it works pretty well most of the time.

I suppose you can do that in CLI version too, then just use a third party tool like ffmpeg or sox to merge the channels:
sox -M left.wav right.wav stereo.wav

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants