How to convert fbank features back to audio ? #15

linmou · 2022-08-17T09:24:07Z

Given that the fbank feature reconstructed by ssast is not so straight forward, how to transform it into pure audio data for further analysis ?

YuanGongND · 2022-08-17T17:02:33Z

Hi there,

The goal of reconstruction loss here is just to force the model to learn a good audio representation. We didn't mean to make the model a strong reconstructor. But if you want to convert spectrogram back to waveforms, you will need a vocoder (not included in this repo).

-Yuan

linmou · 2022-08-19T07:57:17Z

Thanks for your warmly reply.
Any vocoder recommend? I want to inverse fbank features to audios.

YuanGongND · 2022-08-19T21:15:12Z

Hi there,

I am not familiar with vocoder - you can check the github list: https://github.com/topics/vocoder. Note most of these are for TTS (speech) rather than general audio.

-Yuan

YuanGongND added the question Further information is requested label Aug 17, 2022

linmou closed this as completed Jan 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to convert fbank features back to audio ? #15

How to convert fbank features back to audio ? #15

linmou commented Aug 17, 2022

YuanGongND commented Aug 17, 2022

linmou commented Aug 19, 2022

YuanGongND commented Aug 19, 2022

How to convert fbank features back to audio ? #15

How to convert fbank features back to audio ? #15

Comments

linmou commented Aug 17, 2022

YuanGongND commented Aug 17, 2022

linmou commented Aug 19, 2022

YuanGongND commented Aug 19, 2022