Skip to content
Discussion options

You must be logged in to vote

It's in the original code as well, so likely by design: https://github.com/neonbjb/tortoise-tts/blob/8a2563ecabe93c4fb626f876dd0c52c966edef2f/tortoise/models/arch_util.py#L64

I don't see any particular reason mentioned in the paper, but it probably helps to make training more stable.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by eginhard
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants