Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Statistics script for task embeddings #104

Closed
turian opened this issue Jul 24, 2021 · 1 comment
Closed

Statistics script for task embeddings #104

turian opened this issue Jul 24, 2021 · 1 comment

Comments

@turian
Copy link
Contributor

turian commented Jul 24, 2021

Summary statistics of task embedding output

Write a function for luigi that, for a particular directory, crawls it to find all WAV files and outputs stats to a JSON file:

  • number of files
  • mean, median, 25%, 75%, 10%, and 90% of audio file length

And then we want to run that both on the original WAV directories from extraction and then we want ALSO to run it at the end on the subsampled directories for each split. We will use this to sanity check stuff.

@jorshi
Copy link
Contributor

jorshi commented Jul 29, 2021

Completed in #123

@jorshi jorshi closed this as completed Jul 29, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants