YouTune makes it really easy to:
- fine-tune SDXL on images from YouTube videos
- fine-tune MusicGen on audio from YouTube videos
Just give it a URL and a model name on Replicate, and it’ll download the video, take screenshots of every 50 frames, remove near duplicates and very light/dark images, and create a training for you.
python tune.py <youtube-url>
With --audio
, it’ll download just the audio, convert it to mp3 and create a training for you.
python tune.py <youtube-url> --audio
Clone this repo, and setup and activate a virtualenv:
python3 -m pip install virtualenv
python3 -m virtualenv venv
source venv/bin/activate
Then, install the dependencies:
pip install -r requirements.txt
Make a Replicate account and set your token:
export REPLICATE_API_TOKEN=<token>
python tune.py <youtube-url>