-
Notifications
You must be signed in to change notification settings - Fork 769
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple GPU setup help #812
Comments
aceelerate launch --num_processes=[NUM_YOUR_GPUS_PER_MACHINE] --num_machines=[NUM_YOUR_INDEPENDENT_MACHINES] --multi_gpus --gpu_ids=[GPU_IDS] "train_network.py" args... If you have 4 gpus and one machine, give args as |
Thanks for the reply I'm slowly learning everything as I go along me and another friend spent hrs trying to figure it out before I asked read previous posts. So where does the arg go into what file into train_network.py? |
paperspace gradiertを使ってA6000二枚で学習させる場合は、"accelerate config"をターミナルから設定すれば"bmaltais/kohya_ss"での学習が実行できた "When using Paperspace Gradient with two A6000 GPUs for training, by initiating accelerate config from the terminal, training with bmaltais/kohya_ss became possible. Also, when training with sd-scripts, I recall being able to support multiple GPUs by using accelerate without setting specific arguments. What GPU(s) (by id) should be used for training on this machine as a comma-seperated list? [all]:all
|
You can identify args of train_network.py using following command line in terminal or prompt in sd-scripts directory.
And if you want to use multi-gpus in sd-scripts, you need to know what accelerate library is. |
Does this look like I'm not the right path?? D:\Kohya_ss\kohya_ss>accelerate launch --num_processes=2 --multi_gpu --num_machines=1 --gpu_ids=0,1 "train_network.py" -- --resolution 1024 [Dataset 0] [Dataset 0] |
Here is a example command lines for training lora
If you want to do full fine tuning model, use "fine_tune.py" instead of "train_network.py" |
what is the setup for two machines on the same network? I am failing to get that part setup, my second machine seems to be right, but the main one I have no idea what to place on the ip and port because when I run a training it says the port is already on use (by the kohya ui itself running on main) |
@Charmandrigo |
Hello I haven't found a guide for Multiple gpu setup for Kohya has anyone got a step by step guide I keep getting errors trying to go by this on my own. There is no clear guide for this. be greatly appreciated if someone can guide me in the right direction.
The text was updated successfully, but these errors were encountered: