New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inverse Spectrogram and Mel-Spectrogram Layer? #40
Comments
Hi, good to know that Kapre is useful for you! I think that's good idea, although the development of Kapre has been paused for a while. I'll get back to work in a month and will make some update then. |
Maybe make a Griffin-Lim spectrogram inversion? BTW congrats, great work. |
Btw, librosa 0.7+ has Griffin-Lim and inverse melspectrogram. Not using GPU though. https://librosa.github.io/librosa/generated/librosa.feature.inverse.mel_to_audio.html#librosa.feature.inverse.mel_to_audio |
Wow! Is there a way to push a gradient through this? |
Forward pass probably not a problem. No idea about backpropagation though. Probably hard, as Griffin-Lim is a iterative algorithm. One-shot approximations for spectrogram inversion based on neural networks have been explored in the literature though, for example Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks. Those would be backprop-able by construction |
It's on my to-do now and will be added on |
Namaste!
kapre has become an integral part of all my audio Deep Learning experiments. Powerful! Thanks for providing such a great software!
I was thinking... I guess it would make sense to have layers for inverse spectrogram and inverse mel-spectrogram. Thinking about Autoencoders, this would be even more powerful. I know that reconstructing samples from spectrograms is not the best, but it is possible to a certain degree.
What do you think about that feature request?
Best,
Tristan
The text was updated successfully, but these errors were encountered: