Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inverse Spectrogram and Mel-Spectrogram Layer? #40

Closed
AI-Guru opened this issue Jun 11, 2018 · 7 comments
Closed

Inverse Spectrogram and Mel-Spectrogram Layer? #40

AI-Guru opened this issue Jun 11, 2018 · 7 comments

Comments

@AI-Guru
Copy link

AI-Guru commented Jun 11, 2018

Namaste!

kapre has become an integral part of all my audio Deep Learning experiments. Powerful! Thanks for providing such a great software!

I was thinking... I guess it would make sense to have layers for inverse spectrogram and inverse mel-spectrogram. Thinking about Autoencoders, this would be even more powerful. I know that reconstructing samples from spectrograms is not the best, but it is possible to a certain degree.

What do you think about that feature request?

Best,
Tristan

@keunwoochoi
Copy link
Owner

Hi, good to know that Kapre is useful for you! I think that's good idea, although the development of Kapre has been paused for a while. I'll get back to work in a month and will make some update then.

@douglas125
Copy link
Contributor

douglas125 commented Jul 23, 2018

Maybe make a Griffin-Lim spectrogram inversion? BTW congrats, great work.

@jonnor
Copy link

jonnor commented Dec 14, 2019

Btw, librosa 0.7+ has Griffin-Lim and inverse melspectrogram. Not using GPU though. https://librosa.github.io/librosa/generated/librosa.feature.inverse.mel_to_audio.html#librosa.feature.inverse.mel_to_audio

@AI-Guru
Copy link
Author

AI-Guru commented Dec 15, 2019

Wow! Is there a way to push a gradient through this?

@jonnor
Copy link

jonnor commented Dec 15, 2019

Forward pass probably not a problem. No idea about backpropagation though. Probably hard, as Griffin-Lim is a iterative algorithm. One-shot approximations for spectrogram inversion based on neural networks have been explored in the literature though, for example Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks. Those would be backprop-able by construction

@HudsonHuang
Copy link

Maybe we have STFT before this line ?
So, the inverse of Spectrogram and Mel-Spectrogram might be diffcult, but the inverse of STFT might be easy? A pytorch version is here

@keunwoochoi
Copy link
Owner

It's on my to-do now and will be added on 0.3.x.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants