Image Classification using PyTorch

Installation

Requirements

Python >=3.10
Poetry

Install

If simply using the package on cpu:

poetry install --with cpu

If developing you can add any of the following options:

poetry install --with dev,test,gpu

Then use poetry shell to enter the virtualenv.

Data

Get some data and format it:

You need to split the images into a validation and a train folders. For each class, place all the images in a folder with the class's name. You then need to create a classes.names file next to the train and validation folders, with the names of the classes (one per line).

Structure example

cifar-10/ ├── Train/ │ ├── airplaine │ ├── automobile │ ├── bird │ ├── cat │ └── ... ├── Validation/ │ ├── airplaine │ ├── automobile │ ├── bird │ ├── cat │ └── ... └── classes.names

CIFAR-10 instructions

The commands below will download, extract and format the cifar 10 dataset into the ./data/cifar_10_images folder.

wget https://www.cs.toronto.edu/\~kriz/cifar-10-python.tar.gz -P data
tar -xvf data/cifar-10-python.tar.gz -C data
python utils/cifar_10.py data/cifar-10-batches-py
rm data/cifar-10-python.tar.gz
rm -r data/cifar-10-batches-py/

Note: You'll need to modify a few values in config/model_config.py in the next step since cifar10's images are small.

    CROP_IMAGE_SIZES: tuple[int, int] = (32, 32)  # Center crop
    RESIZE_IMAGE_SIZES: tuple[int, int] = (32, 32)  # All images will be resized to this size
...
    CHANNELS: list[int] = field(default_factory=lambda: [3, 16, 32, 16])
    SIZES: list[int | tuple[int, int]] = field(default_factory=lambda: [3, 3, 3])   # Kernel sizes
    STRIDES: list[int | tuple[int, int]] = field(default_factory=lambda: [2, 2, 2])
    PADDINGS: list[int | tuple[int, int]] = field(default_factory=lambda: [1, 1, 1])
    BLOCKS: list[int] = field(default_factory=lambda: [1, 2, 1])

Imagenette instructions

The commands below will download, extract and format the cifar 10 dataset into the ./data/cifar_10_images folder.

wget https://s3.amazonaws.com/fast-ai-imageclas/imagenette2.tgz -P data
tar -xvf data/imagenette2.tgz -C data
python utils/preprocess_imagenette.py data/imagenette2
rm data/imagenette2.tgz

Config files

In the config folder of this repo you will find two config template files. You need to copy them and remove the "_template" part like this:

cp config/data_config_template.py config/data_config.py
cp config/model_config_template.py config/model_config.py

RecordConfig

Contains config for recording TensorBoard and checkpoints. You probably just want to modify _training_name.

TrainConfig

Contains the parameters that influence training. Most default values should work okayish, but you'll need to modify a few:

MAX_EPOCHS: usually around 400 epochs is enough, you will need to train at least once to get an idea for your particular dataset.
IMG_MEAN and IMG_STD: The defaults are the imagenet ones. You can keep them as long as they are not too different from the actual ones (especially if using a pretrained model).

Imagenette example

The default, gitted config should give decent-ish (~85% val acc) result.

Cifar-10 example

If training on Cifar-10, you'll need to modify the model in the config `src/classfication/configs/train_config.py` since cifar10's images are small. You'll also need to remove/modify the resize hardcoded in `src/classfication/train.py`. ```python MODEL: ModelHelper = ModelHelper.SmallDarknet CHANNELS: list[int] = field(default_factory=lambda: [3, 16, 32, 16]) SIZES: list[int | tuple[int, int]] = field(default_factory=lambda: [3, 3, 3]) # Kernel sizes STRIDES: list[int | tuple[int, int]] = field(default_factory=lambda: [2, 2, 2]) PADDINGS: list[int | tuple[int, int]] = field(default_factory=lambda: [1, 1, 1]) BLOCKS: list[int] = field(default_factory=lambda: [1, 2, 1]) ```

Train

Once you have the environment all set up and your two config files ready, training an AI is straightforward.

classification-train \
    --train_data_path <path to train dataset> \
    --val_data_path <path to val dataset> \
    --classes_names_path <path to classes.names file>

Imagenette example

classification-train \
    --train_data_path data/imagenette2/train/ \
    --val_data_path data/imagenette2/val/ \
    --classes_names_path data/imagenette2/classes.names

Results

The resulting checkpoints can be found in CHECKPOINTS_DIR (see the RecordConfig). The resulting checkpoints can be found in TB_DIR (see the RecordConfig).

Inference

classification-test \
    checkpoints/imagenette_resnet32/train_50.pt \
    data/imagenette2/val \
    --classes_names_path data/imagenette2/classes.names \
    --limit 100

Gradcam

classification-gradcam \
    checkpoints/imagenette_resnet32/train_50.pt \
    data/imagenette2/val \
    --classes_names_path data/imagenette2/classes.names \
    --limit 10

Name		Name	Last commit message	Last commit date
Latest commit History 237 Commits
.github/workflows		.github/workflows
data		data
src/classification		src/classification
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Classification using PyTorch

Installation

Requirements

Install

Data

Get some data and format it:

Config files

RecordConfig

TrainConfig

Train

Results

Inference

Gradcam

About

Uh oh!

Releases 4

Packages

Uh oh!

Languages

hoel-bagard/image_classification_pytorch

Folders and files

Latest commit

History

Repository files navigation

Image Classification using PyTorch

Installation

Requirements

Install

Data

Get some data and format it:

Config files

RecordConfig

TrainConfig

Train

Results

Inference

Gradcam

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Languages

Packages