Skip to content

What is the rule of thumb for generating music with wavenet? #195

@pennygalaxi

Description

@pennygalaxi

I have a large library of mp3 songs. What is the best way to process these songs in order to get a good result with wavenet?

So far, I tried the following approach which doesn't seem to work very well: convert mp3s to wav files (16 bits per sample) and then run the training script (with default parameters). My questions are as follows:

  • The mp3 files range between 3mb to 10mb (3 mins to 8 mins songs). Should I chunk those into smaller files of e.g. 30 seconds?
  • Do you think the songs should be instrumental only or is it okay to have voices? (the songs I am using right now have voices and the output sounds very noisy)
  • Do you have any ideas on how many songs/samples are considered as bare minimum for the training data? (just a ballpark estimate)

Please feel free to add any comments that you think its relevant to get a good result. Thanks a lot!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions