-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Open
Description
I have a large library of mp3 songs. What is the best way to process these songs in order to get a good result with wavenet?
So far, I tried the following approach which doesn't seem to work very well: convert mp3s to wav files (16 bits per sample) and then run the training script (with default parameters). My questions are as follows:
- The mp3 files range between 3mb to 10mb (3 mins to 8 mins songs). Should I chunk those into smaller files of e.g. 30 seconds?
- Do you think the songs should be instrumental only or is it okay to have voices? (the songs I am using right now have voices and the output sounds very noisy)
- Do you have any ideas on how many songs/samples are considered as bare minimum for the training data? (just a ballpark estimate)
Please feel free to add any comments that you think its relevant to get a good result. Thanks a lot!
Metadata
Metadata
Assignees
Labels
No labels