# Model Card for ResNet-18 on DermaMNIST Image Classification

---

## Model Details

- **Person or Organization Developing Model**:  
  This implementation was developed by [Sai Madhavan](https://www.linkedin.com/in/sai-madhavan-g/) and [M Srinivasan](https://www.linkedin.com/in/srinivasan-m-668154228/) as part of a project for AI-839 for skin lesion classification using the DermaMNIST dataset.

- **Model Date**:  
  November 2024

- **Model Version**:  
  ResNet-18

- **Model Type**:  
  Convolutional Neural Network (CNN)

- **Information about Training Algorithms, Parameters, Fairness Constraints, or Other Applied Approaches, and Features**:  
  ResNet-18 is a deep residual network that introduces skip connections to solve vanishing gradient problems in deep networks. It was trained using the following parameters:  
  - Optimizer: Adam  
  - Learning Rate: 0.001  
  - Batch Size: 32  
  - Number of Epochs: 20  
  - Loss Function: Cross-Entropy Loss

- **Fine Tune**:
  We used a pre-trained Resnet-18 model from Pytorch and then fine tuned it for classification purposes on the DermaMNIST dataset.
  - Modified the final layers to match the number of classes in the DermaMnist dataset.
  - Trained the modified model on the DermaMnist dataset.
  - Evaluated the performance of the fine-tuned model.

- **Paper or Other Resource for More Information**:  
  - [ResNet Original Paper (He et al., 2015)](https://arxiv.org/abs/1512.03385)  
  - [DermaMNIST Dataset Resource](https://medmnist.com)

- **Citation Details**:  
  - ResNet: *He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.*  
  - DermaMNIST: *Yang, Jiancheng, et al. "MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis." arXiv preprint arXiv:2110.14795 (2021).*

- **License**:  
  Apache License 2.0 for the ResNet model implementation; refer to the MedMNIST dataset licensing for dataset usage.

 
<!-- /*************  ✨ Codeium Command 🌟  *************/ -->
   Please reach out to [M Srinivasan - (mail id)](mailto:m.srinivasan@iiitb.ac.in) or [Sai Madhavan - (mail id)](mailto:g.saimadhavan@iiitb.ac.in) for any questions or comments about the model.

---




## Intended Use

- **Primary Intended Uses**:  
  - Early screening for skin cancer and other serious skin conditions in resource-constrained settings.  
  - Assisting non-specialist healthcare workers in making preliminary assessments of skin lesions.  
  - Identifying high-risk cases that require referral to dermatology specialists for further evaluation.

- **Primary Intended Users**:  
  - Non-specialist healthcare workers, such as general practitioners, nurses, and community health workers in rural India.  
  - Healthcare providers in underserved regions lacking access to specialized dermatological support.

- **Out-of-Scope Use Cases**:  
  - Providing a definitive medical diagnosis without specialist consultation.  
  - Use in advanced clinical settings where dermatologists and specialized diagnostic tools are available.  
  - Screening for skin conditions outside the scope of the model's training (e.g., non-cancerous cosmetic dermatology issues).


## Factors

- **Relevant Factors**:  
  - Demographic and phenotypic groups, including variations in skin tone and lesion types.  
  - Environmental conditions like lighting in image capture.

- **Evaluation Factors**:  
  - Classification accuracy, precision, recall, and F1 score across demographic groups.  
  - Sensitivity to image preprocessing and augmentation techniques.

---



## Metrics

- **Model Performance Measures**:  
  - Accuracy: Overall percentage of correctly classified samples.  
  - Precision: Proportion of true positives among predicted positives.  
  - Recall (Sensitivity): Proportion of true positives among actual positives.  
  - F1 Score: Harmonic mean of precision and recall.

---

## Evaluation Data

- **Datasets**:  
  The evaluation was conducted on a subset of the dataset split into training, validation, and test sets. Images were resized to `(224, 224)` and normalized using ImageNet mean (`[0.485, 0.456, 0.406]`) and standard deviation (`[0.229, 0.224, 0.225]`). 

- **Motivation**:  
  The goal of the evaluation data preparation was to ensure consistent preprocessing and transformations for fair model assessment. This included resizing and normalization to align with ResNet-18's pretrained parameters.

- **Preprocessing**:  
  - Images were normalized by dividing pixel values by 255.  
  - Applied a `transforms.Compose` pipeline to perform resizing and normalization.  
  - For additional evaluation, data augmentation techniques like horizontal and vertical flips, rotation, and color jitter were applied during testing to assess model robustness.

---





## Training Data

- **Datasets**:  
  Training data consisted of the DermaMNIST dataset, augmented to increase diversity and robustness. Images were resized to `(224, 224)` and normalized similarly to the evaluation data.

- **Preprocessing**:  
  - Images were converted to tensors and resized to `(224, 224)`.  
  - Normalization used the ImageNet mean and standard deviation.  
  - Data augmentation techniques were extensively applied, including horizontal/vertical flips, random rotations, and color jittering.  
  - For each original image, multiple augmented versions were generated and added to the dataset using the provided augmentation function, with unique identifiers for each augmented sample.
---

## Quantitative Analyses

- **Unitary Results**:  
  - Accuracy: 75%  
  - Macro F1 Score: 0.539

- **Intersectional Results**:  
  - Performance variation observed across lesion types, with lower recall for rarer lesion classes.

---



## Ethical Considerations

- Potential bias in training data due to overrepresentation of certain demographics or lesion types.  
- Model is not a substitute for clinical expertise and should be used only for research purposes.  
- Misclassifications could lead to inappropriate confidence in model predictions.

---



## Caveats and Recommendations

- The model's performance is dependent on the quality of input images and generalizes poorly to out-of-distribution samples.  
- Clinical use requires thorough validation and expert oversight.  
- Future work should focus on improving robustness to demographic and environmental variations.