What is the rule of thumb for generating music with wavenet?

I have a large library of mp3 songs. What is the best way to process these songs in order to get a good result with wavenet?

So far, I tried the following approach which doesn't seem to work very well: convert mp3s to wav files (16 bits per sample) and then run the training script (with default parameters). My questions are as follows:

- The mp3 files range between 3mb to 10mb (3 mins to 8 mins songs). Should I chunk those into smaller files of e.g. 30 seconds?
- Do you think the songs should be instrumental only or is it okay to have voices? (the songs I am using right now have voices and the output sounds very noisy)
- Do you have any ideas on how many songs/samples are considered as bare minimum for the training data? (just a ballpark estimate)

Please feel free to add any comments that you think its relevant to get a good result. Thanks a lot!



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What is the rule of thumb for generating music with wavenet? #195

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

What is the rule of thumb for generating music with wavenet? #195

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions