-
Notifications
You must be signed in to change notification settings - Fork 699
Closed
Description
Hi! I am a researcher working on audio, including audio compression, using AI.
I am wondering:
Would it be possible / feasible to replace the static range coding PDFs used in opus with dynamically predicted ones using a deep recurrent net? (Similar to the VAD) The concept is that using a deep net to estimate the probability distribution over the alphabet can result in higher compression ratio.
I've read through the opus spec and some of the source and found a few places that use the range coder, and it seems hypothetically possible. However, I am having a hard time finding anyone who is truly familiar with the internals of the opus codec to help me determine if this path is likely to be fruitful.
Thanks!
Metadata
Metadata
Assignees
Labels
No labels