Brain Tumor Classification: A Comparative Analysis of VGG16, ResNet50, Xception, and MobileNets Convolutional Neural Networks on MRI Images
-
Brain Tumor Classification: Analysis of VGG16, ResNet50, Xception, and MobileNets Convolutional Neural Networks on MRI ImagesMedium
-
Overview of VGG16, ResNet50, Xception and MobileNet Neural Networks Medium
Utilizing deep learning techniques, this project focuses on the classification of brain tumors in MRI scans, with an emphasis on comparing different transfer learning methods. Four specific deep neural network models - VGG16, MobileNet, Xception, and ResNet50 - are employed to classify brain tumors into four distinct categories: No Tumor, Glioma, Meningioma, and Pituitary. By analyzing these models, we aim to understand their effectiveness in predicting the presence of brain tumors based on MRI scans, providing valuable insights into their performance for tumor classification.
- Source of data Brain Tumor
The dataset consists of 3264 MRI images. 926 are “gliomas”, 937 are “meningiomas”, 901 are “pituitary” tumors, and 500 are images representing cases without tumors. Figure 1 illustrates sample images from each class, showing significant inter- and intra-class diversity.
The dataset was randomly divided into 2870 training and 394 test sets for the experiments. Figure below illustrates the statistics of the training and test sets, showing a notable difference in the distribution of classes between them. This difference poses a challenge for CNN models to generalize from the trained model to the test set. In addition, both sets exhibit class imbalances, presenting an additional challenge to the models. For training, a validation set consisting of 15% of the training set was randomly sampled with a class distribution similar to the training set.
During the training phase, the specified models are trained on the training dataset and validated on the validation set. A consistent batch size of 64, the Adam optimizer, and the Categorical Cross Entropy loss function are used for all models. Due to variations in structure and parameter count, different learning rates are assigned to ensure effective training, and different numbers of epochs are implemented to prevent overfitting as the models converge.
The trained models are then tested on the test dataset. Considering the training and validation curves, the best epoch is selected when the loss is the smallest, the accuracy is the largest for the training, and just before the model starts overfitting when the validation curves start to diverge. Therefore, for VGG16 epoch 15, for ResNet50 epoch 46, for Xception epoch 20, and for MobileNet epoch 50 are considered for evaluation.
- Satisfactory performance observed for VGG16, Xception, and MobileNet in the Brain Tumor Classification dataset.
- Exception for ResNet50, which exhibited suboptimal performance.
- Noteworthy success in classifying "meningioma" and "no-tumor" cases.
- Investigate methods for enlarging and diversifying datasets to bolster the resilience of the model.
- Implement data augmentation techniques to address class imbalances.
- Investigate the impact of hyperparameter tuning for improved model performance.
- Explore alternative optimizers and loss functions to optimize the training process.
- Address the challenges posed by misclassifications through further investigation.
- Future studies should focus on refining the model's ability to handle diverse and imbalanced datasets.
- Opportunities for enhancement involve refining algorithms and delving into sophisticated optimization methods.