Computer Vision

Flatiron Data Science Program

Module 4 Project - Neural Networks

December 15th, 2020

Computer Vision

X-Ray Image Classification

Overview

Objective: Build a model that can classify whether a patient has pneumonia, given a chest x-ray image.

Repository Contents:

- images : images for display through this analysis
- project_4_notebook.ipynb : Google Colab notebook containing full analysis/modeling
- README.md : project summary and contents
- Mod04_presentation.pdf : presentation slides with comments

Data

"Chest X-ray images (anterior-posterior) were provided from pediatric patients of Guangzhou Women and Children’s Medical Center, Guangzhou, China. The diagnoses for the images were then graded by two expert physicians before being cleared for training the AI system."

In the above images, some opacity can be observed in the bottom row of pneumonia lungs. This is due to the X-ray picking up the fluid filled air sacs due to the infection.

Sources: original study source, data download source/Kaggle competition

Classifier

Full analysis: Colab Notebook

For this project I wanted to optimize the model to arrive at as few fasle predictions as possible. False positives could be costly in resources for the provider and patient, or result in unnecessary treatment. False negatives could allow illness to be missed and treatment delayed. Thus I selected accuracy and the AUC (area under the ROC curve) as target metrics for a balanced model, for both penalize false predictions.

The final model was trained on images with the following preprocessing:

- Images were resized to 124 x 124 pixels, with 3 RGB color channels
- Pixel values were normalized to a 0-1 scale
- To prepare the model to discern noise, four data augmentations were used: rotation, vertical/horizontal shifting, and zoom
- The imbalanced data set (75% pnemonia vs. 25% normal X-rays) was corrected by applying class weights

The final model arcitecture was a convolutional neural network (CNN) with 3 convolution blocks (convolution, drop out, pool).

Resulting in performances scores of:

- Accuracy - 89.77%
- Recall - 94 %
- Precision - 82 %
- AUC - 0.88

Reccomendations

- Continue collecting labeled images to progressively train the model.
- Store image data at 128 x 128 to conserve storage memory (this is up to a 10% reduction in original image size).
- Use the model to improve efficiency of Xray review.

Future Work

This is a supervised learning task and thus performance is based on the quality of the dataset used.

- Collect more labeled images or continue data augmentation to increase the quantity of images in the training set.
- Try transfer learning - use an established x-ray classifier and build model on top of that.
- Progressively resize the model input image size to find the smallest possible input size without sacrificing performance.

Thank you for viewing my project!

Please review the full analysis in the Colab Notebook or view my presentation slideshow.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
images		images
.canvas		.canvas
.gitignore		.gitignore
Mod04_Presentation.pdf		Mod04_Presentation.pdf
README.md		README.md
project_4_notebook.ipynb		project_4_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Vision

Overview

Data

Classifier

Reccomendations

Future Work

Thank you for viewing my project!

About

Contributors 2

Languages

anna-dang/mod04-computer_vision_CNN

Folders and files

Latest commit

History

Repository files navigation

Computer Vision

Overview

Data

Classifier

Reccomendations

Future Work

Thank you for viewing my project!

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages