Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Microphone style transfer (MicAugment) #88

Open
iver56 opened this issue Jul 16, 2021 · 4 comments
Open

Microphone style transfer (MicAugment) #88

iver56 opened this issue Jul 16, 2021 · 4 comments

Comments

@iver56
Copy link
Collaborator

iver56 commented Jul 16, 2021

À la https://arxiv.org/abs/2010.09658

@akashrajkn
Copy link
Contributor

I am curious - how do you plan on using micaugment? Is it the idea that we can submit a trained micaugment model as part of the transform?

@iver56
Copy link
Collaborator Author

iver56 commented Jan 5, 2022

I haven't thought it through, but yeah, we should ideally have a pretrained model that is ready to be used. The model can be uploaded as a binary in a github release, and can be downloaded and stored locally on demand (the first time the transform gets used). This approach is inspired by the way Keras did pretrained imagenet models.

Would https://github.com/akashrajkn/micaugment be suitable?

@akashrajkn
Copy link
Contributor

I think it is suitable - however, I still have to update the repo with a pretrained model.

@iver56
Copy link
Collaborator Author

iver56 commented Jan 7, 2022

It would be awesome if you could make that happen 🤩 But I guess the pretrained model would depend on a specific sample rate, right? Ideally, torch-audiomentations should be compatible with a wide range of sample rates 🤔 Maybe it could do some resampling to match the sample rate used in the model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants