You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
this kind of architecture is really good at low bitrates but fail to be competitive at higher bitrates. specifically, it will not reach a state where the audio is indistinguishable from the uncompressed audio as you increase the bandwidth, unlike mp3.
That makes sense, thanks! Are you aware of any papers or architectures that can operate at higher quality levels while still having some sort of discrete representation internally (via RVQ or something else)?
❓ Questions
Hello! Do you intend to train models with a target bitrate of above 24kbps? I didn't see anything in the paper, but maybe I missed it.
I'd be curious to see how 48 and 96kbps models compare to mp3s at higher bitrates.
Thanks for the great work!
The text was updated successfully, but these errors were encountered: