Which GPU did you use? #14

nessessence · 2021-01-26T03:10:25Z

Sorry, there is training-time show in your experiment. I wonder which GPU did you use, and how many of them?

GalSang17 · 2021-01-29T06:11:30Z

I am also using this implementation, is it impossible to call distributed training?

jeonsworld · 2021-02-03T06:12:40Z

Transfer learning was performed using v100. Check the relative time of the tensorboard for the learning time.

jeonsworld · 2021-02-03T06:16:17Z

Distributed training on a single node can be executed as follows.

python3 -m torch.distributed.launch --nproc_per_node=NUM_OF_GPU train.py --train_batch_size BATCH_SIZE_PER_GPU --name cifar10-100_500 --dataset cifar10 --model_type ViT-B_16 --pretrained_dir checkpoint/ViT-B_16.npz

TitaniumOne · 2021-02-21T10:12:37Z

@jeonsworld Hello, how can I use multi-GPUs?

jeonsworld · 2021-04-16T04:54:20Z

@jeonsworld Hello, how can I use multi-GPUs?

There are DataParallel and Distributed ways to use multi-gpu in pytorch.
The current code supports distributed learning and uses the following command.

python3 -m torch.distributed.launch --nproc_per_node=NUM_OF_GPU train.py --train_batch_size BATCH_SIZE_PER_GPU --name cifar10-100_500 --dataset cifar10 --model_type ViT-B_16 --pretrained_dir checkpoint/ViT-B_16.npz

superxiaoying mentioned this issue Feb 22, 2021

Errors when use custom data to retrain the Vit-transformer #17

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which GPU did you use? #14

Which GPU did you use? #14

nessessence commented Jan 26, 2021

GalSang17 commented Jan 29, 2021

jeonsworld commented Feb 3, 2021

jeonsworld commented Feb 3, 2021

TitaniumOne commented Feb 21, 2021

jeonsworld commented Apr 16, 2021

Which GPU did you use? #14

Which GPU did you use? #14

Comments

nessessence commented Jan 26, 2021

GalSang17 commented Jan 29, 2021

jeonsworld commented Feb 3, 2021

jeonsworld commented Feb 3, 2021

TitaniumOne commented Feb 21, 2021

jeonsworld commented Apr 16, 2021