Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

small updates to match the paper's settings #2

Merged
merged 1 commit into from
Sep 22, 2023

Conversation

shenberg
Copy link
Contributor

Section 4.4 specifies "We apply a Hann window size of 2048 and a
hop size of 10 ms for STFT to compute the complex spectrogram..." and Section 4.1 says "All recordings are stereo with a sampling rate of 44.1k Hz" so I updated the constants accordingly

@lucidrains
Copy link
Owner

@shenberg hey Roee! thank you! i don't know the audio hyperparameters well enough

hello from SF as well :)

@lucidrains lucidrains merged commit a002fd5 into lucidrains:main Sep 22, 2023
@shenberg
Copy link
Contributor Author

Wow that was peak fast-response! Thanks!
Also, I'm just moving to Paris, my profile ain't up to date :)

@lucidrains
Copy link
Owner

from one great city to another!

@faroit
Copy link

faroit commented Sep 22, 2023

@shenberg @lucidrains I wouldn't recommend a hop size that isn't //2 or //4 though as this hurts perfect reconstruction abilities of the the stft window

@lucidrains
Copy link
Owner

@faroit oh, so you would recommend 512?

@faroit
Copy link

faroit commented Sep 22, 2023

Yeah. That's much safer

@lucidrains
Copy link
Owner

ok, let's roll with that!

@lucidrains lucidrains mentioned this pull request Oct 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants