Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NSNet 2 baseline ONNX model with 512-size STFT and 32ms window #48

Closed
hashemelezabi opened this issue May 16, 2021 · 2 comments
Closed
Assignees
Labels
wontfix This will not be worked on

Comments

@hashemelezabi
Copy link

Hello,

In the paper describing the NSNet 2 baseline, you use an STFT of size 512 with a 32ms square-root Hann window, but in the provided nsnet2 baseline model under NSNet2-baseline, you use a size 320 STFT with a 20ms window. Is there a link where I can find the pretrained ONNX model for the 512-size STFT with 32ms window? I'm working on hardware-accelerating NSNet 2 inference using Spatial, and the FFT algorithm I'm using is the efficient Cooley-Tukey algorithm which requires power-of-2 inputs. Right now I need to pad the input audio frame of size 320 by zeros to reach size 512, after which I discard redundant/useless information from the output frame DFT to get a 161-size feature vector to feed into the provided model (after computing the log-power spectrum). This wastes computation, so having access to the 512-size STFT model would be very helpful.

I can't train it myself because I don't have the compute resources or even the storage to store the training data available at the moment.

@motus motus self-assigned this Nov 11, 2021
@motus motus added help wanted Extra attention is needed question Further information is requested wontfix This will not be worked on and removed help wanted Extra attention is needed labels Nov 11, 2021
@motus
Copy link
Member

motus commented Nov 11, 2021

@hashemelezabi I'll check if we have that model laying around and if we can share it. Stay tuned :)

@motus motus removed the question Further information is requested label Nov 15, 2021
@vishakg
Copy link
Contributor

vishakg commented Dec 13, 2021

@hashemelezabi check if the new 48khz model uploaded recently works better for your needs.

@hdubey hdubey closed this as completed Jan 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

4 participants