Pytorch-CGX-FFCV

This code will guide through the process of training CIFAR-10 using both CGX and FFCV frameworks, CGX is a Pytorch extension that optimizes multi-GPU training while FFCV will make your data loading faster and therefore make your training less costly. Significant results were achieved combining these two as the training time on an RTX 3090 Optimized offered by Genesis Cloud is cut to half, which makes the multi-GPU very much less expensive especially compared to other cloud providers such as AWS and GCP.

In order to run this training, it is very important to follow the installation steps in FFCV and CGX of both frameworks. We'd recommend you to start by building the FFCV environment and then install Pytorch-CGX inside. Once the installation of both is complete, you should be able to import both FFCV and CGX in your python code. Last steo would be to run your code, to do so, you first need to create your dataset in FFCV-compatible format, run your write_datasets.py file using the line of code:

python write_datasets.py --config-file default_config.yaml

CGX is based on MPI backend, it requires MPI-compliant, so to run the script with multiple GPUs, use the following command in your terminal:

mpirun -np $NUM_GPUs python train_cifar_ffcv_cgx.py:

Replace $NUM_GPUs with the number of GPUs you wish to use.

On an RTX 3090 Optimized offered by Genesis Cloud, we launched a training with 4 GPUs with the train_cifar_ffcv_cgx.py file, and results were that we only needed a quarter of the training time a Vanilla code would take to train the CIFAR-10 for 20 epochs to get past an accuracy of 90%, this table showcases the numbers:

Server	Avg. Training time (s)	Avg. Training Cost ($)
Vanilla Code	190	0.17
CGX	62.63	0.057
CGX + FFCV	32.84	0.03

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
default_config.yaml		default_config.yaml
train_cifar_ffcv_cgx.py		train_cifar_ffcv_cgx.py
write_datasets.py		write_datasets.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pytorch-CGX-FFCV

About

Releases

Packages

Languages

License

MKhGenesis/Pytorch-CGX-FFCV

Folders and files

Latest commit

History

Repository files navigation

Pytorch-CGX-FFCV

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages