Fast Forward Computer Vision for Pretraining

This library is derived from FFCV to optimize the memory usage and accelerate data loading.

Installation

Running Environment

conda create -y -n ffcv "python>=3.9" cupy pkg-config "libjpeg-turbo>=3.0.0" "opencv=4.10.0" numba -c conda-forge
conda activate ffcv
conda install pytorch-cuda=11.3 torchvision  -c pytorch -c nvidia
pip install .

Prepackaged Computer Vision Benchmarks

From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.

Make Dataset

We provide a script to make the dataset examples/write_dataset.py, which provides three mode:

jpg: The script will compress all the images to jpg format.
png: The script will compress all the images to png format. This format is too slow.
raw: The script will not compress the images.
smart: The script will compress the images larger than the threshold.
proportion: The script will compress a random subset of the data with size specified by the compress_probability argument.

python examples/write_dataset.py --cfg.write_mode=smart --cfg.threshold=206432  --cfg.jpeg_quality=90  \
    --cfg.num_workers=40 --cfg.max_resolution=500 \
    --cfg.data_dir=$IMAGENET_DIR/train \
    --cfg.write_path=$write_path

ImageNet

We provide a self-contained script for training ImageNet fast. Above we plot the training time versus accuracy frontier, and the dataloading speeds, for 1-GPU ResNet-18 and 8-GPU ResNet-50 alongside a few baselines.

TODO:

Link to Config	top_1	top_5	# Epochs	Time (mins)	Architecture	Setup
Link	0.784	0.941	88	77.2	ResNet-50	8 x A100
Link	0.780	0.937	56	49.4	ResNet-50	8 x A100
Link	0.772	0.932	40	35.6	ResNet-50	8 x A100
Link	0.766	0.927	32	28.7	ResNet-50	8 x A100
Link	0.756	0.921	24	21.7	ResNet-50	8 x A100
Link	0.738	0.908	16	14.9	ResNet-50	8 x A100
Link	0.724	0.903	88	187.3	ResNet-18	1 x A100
Link	0.713	0.899	56	119.4	ResNet-18	1 x A100
Link	0.706	0.894	40	85.5	ResNet-18	1 x A100
Link	0.700	0.889	32	68.9	ResNet-18	1 x A100
Link	0.688	0.881	24	51.6	ResNet-18	1 x A100
Link	0.669	0.868	16	35.0	ResNet-18	1 x A100

Train your own ImageNet models! You can use our training script and premade configurations to train any model seen on the above graphs.

CIFAR-10

We also include premade code for efficient training on CIFAR-10 in the examples/ directory, obtaining 93% top1 accuracy in 36 seconds on a single A100 GPU (without optimizations such as MixUp, Ghost BatchNorm, etc. which have the potential to raise the accuracy even further). You can find the training script here.

Features

Compared to the original FFCV, this library has the following new features:

crop decode: RandomCrop and CenterCrop are now implemented to decode the crop region, which can save memory and accelerate decoding.
cache strategy: There is a potential issue that the OS cache will be swapped out. We use FFCV_DEFAULT_CACHE_PROCESS to control the cache process. The choices for the cache process are:
- 0: os cache
- 1: process cache
- 2: Shared Memory
lossless compression: PNG is supported for lossless compression. We use RGBImageField(mode='png') to enable the lossless compression.
few memory: We optimize the memory usage and accelerate data loading.

Comparison of throughput:

img_size	112	160	192	224					512
batch_size	512	512	512	128	256	512			512
num_workers	10	10	10	10	10	5	10	20	10
loader
ours	23024.0	19396.5	16503.6	16536.1	16338.5	12369.7	14521.4	14854.6	4260.3
ffcv	16853.2	13906.3	13598.4	12192.7	11960.2	9112.7	12539.4	12601.8	3577.8

Comparison of memory usage:

img_size	112	160	192	224					512
batch_size	512	512	512	128	256	512			512
num_workers	10	10	10	10	10	5	10	20	10
loader
ours	9.0	9.8	11.4	5.8	7.7	11.4	11.4	11.4	34.0
ffcv	13.4	14.8	17.7	7.6	11.0	17.7	17.7	17.7	56.6

Name		Name	Last commit message	Last commit date
Latest commit History 977 Commits
.github/workflows		.github/workflows
assets		assets
docker		docker
docs		docs
examples		examples
ffcv		ffcv
libffcv		libffcv
test_data		test_data
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
ffcv-conda.yml		ffcv-conda.yml
install.sh		install.sh
mypy.ini		mypy.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast Forward Computer Vision for Pretraining

Installation

Running Environment

Prepackaged Computer Vision Benchmarks

Make Dataset

ImageNet

CIFAR-10

Features

About

Releases

Packages

Languages

License

erow/ffcv

Folders and files

Latest commit

History

Repository files navigation

Fast Forward Computer Vision for Pretraining

Installation

Running Environment

Prepackaged Computer Vision Benchmarks

Make Dataset

ImageNet

CIFAR-10

Features

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages