Skip to content

Trained TTS Vitsmodel Restricted to 11 Seconds of Audio Generation #6972

Answered by treacker
shuvohishab asked this question in Q&A
Discussion options

You must be logged in to vote

Sorry for late responce. The problem is in max_len parameter, which is set by default to 1000 and can't be changed from convert_text_to_waveform function. The workaround is either to use forward function which is called inside convert_text_to_waveform or add option to change max_len. @XuesongYang choose please what is preferred solution

Replies: 5 comments 19 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
19 replies
@PhilipAmadasun
Comment options

@shuvohishab
Comment options

@PhilipAmadasun
Comment options

@shuvohishab
Comment options

@PhilipAmadasun
Comment options

Answer selected by shuvohishab
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
5 participants