Skip to content
This repository has been archived by the owner on Sep 1, 2024. It is now read-only.

Audio/Video data augmentation #49

Closed
YUCHEN005 opened this issue May 15, 2022 · 2 comments
Closed

Audio/Video data augmentation #49

YUCHEN005 opened this issue May 15, 2022 · 2 comments

Comments

@YUCHEN005
Copy link

Hi Authors,

Thank you for good sharing. Since you have added noise on input speech with 0.25 probability, I wonder if you have deployed data augmentation on the input image? (like the image flip that CV community usually do) If yes, how can I use it on the avhubert?

Thank you~

@chevalierNoir
Copy link
Contributor

Hi,

Yes. We do data augmentation on images, including horizontal flipping and random cropping. You can find details here.

@YUCHEN005
Copy link
Author

See it, thank you~

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants