Optical coherence tomography dataset classification

Framework for 2D classification of the OCT dataset.

Optical coherence tomography (OCT) is an emerging technology for performing high-resolution cross-sectional imaging. It utilizes the interferometry concept to create a cross-sectional map of the retina. OCT images are two-dimensional data sets that represent the optical backscattering in a cross-sectional plane through the tissue. OCT allows non-secondary visualization of various structures of the eye, including the retina, vitreous body, and choroid, and to detect pathological changes in them. The study of these images is essential for the diagnosis, treatment, and monitoring of various eye diseases.

The database consists of the following categories and images:

Age-related Macular Degeneration (AMD): 1231
Diabetic Macular Edema (DME): 147
Epiretinal Membrane (ERM): 155
Normal (NO): 332
Retinal Artery Occlusion (RAO): 22
Retinal Vein Occlusion (RVO): 101
Vitreomacular Interface Disease (VID): 76

For more information and details about the dataset see: https://rdcu.be/dELrE https://arxiv.org/abs/2312.08255

Requirements:

pytorch
torchvision
torcheval
timm
tqdm
munch
packaging
tensorboard
omegaconf
opencv-python
hydra-core
scikit-learn

How to use

In this work we use folder-form dataset structure:

├── dataset
    ├── train
        ├── class1
            ├── image1.jpg
            ├── ...
        ├── class2
        ├── class3
        ├── ...
    ├── val
    ├── test

Unzip the archive to the directory and use preprocessing.py for image preprocessing and dataset preparation:

python preprocessing.py

Optional arguments:

--dataset_folder', type=str, default='./OCT_dataset', help='path to dataset folder')
--labels_path', type=str, default='./OCTDL_dataset/labels.csv', help='path to labels.csv'
--output_folder', type=str, default='./dataset', help='path to output folder')
--crop_ratio', type=int, default=1, help='central crop ratio of image')
--image_dim', type=int, default=512, help='final dimensions of image')
--val_ratio', type=float, default=0.15, help='validation size')
--test_ratio', type=float, default=0.25, help='test size')
--padding', type=bool, default=False, help='padding to square')

Training:

python main.py

Optional arguments:

-c yaml_file      Specify the config file (default: configs/default.yaml)
-p                Print configs before training

Resources

Huang, Y., Lin, L., Cheng, P., Lyu, J., Tam, R. and Tang, X., 2023. Identifying the key components in ResNet-50 for diabetic retinopathy grading from fundus images: a systematic investigation. Diagnostics, 13(10), p.1664. [link]

Citation

@article{kulyabin2024octdl,
  title={OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods},
  author={Kulyabin, Mikhail and Zhdanov, Aleksei and Nikiforova, Anastasia and Stepichev, Andrey 
          and Kuznetsova, Anna and Ronkin, Mikhail and Borisov, Vasilii and Bogachev, Alexander 
          and Korotkich, Sergey and Constable, Paul A and Maier, Andreas},
  journal={Scientific Data},
  volume={11},
  number={1},
  pages={365},
  year={2024},
  publisher={Nature Publishing Group UK London},
  doi={https://doi.org/10.1038/s41597-024-03182-7}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
configs		configs
data		data
modules		modules
utils		utils
LICENSE		LICENSE
README.md		README.md
main.py		main.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical coherence tomography dataset classification

How to use

Resources

Citation

About

Releases

Packages

Languages

License

MikhailKulyabin/OCTDL

Folders and files

Latest commit

History

Repository files navigation

Optical coherence tomography dataset classification

How to use

Resources

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages