GitHub - data-pioneer/MLops-Chest-Disease-Classification-from-Chest-CT-Scan-Image-: Implementation of MLops pipeline for Chest Disease Classification from Chest CT Scan Images using computer vision Vgg16 pretrained Image classification model. further perform deployment on AWS EC2 using Docker, CI/CD Jenkins tool, using Flask as front end interface.

MLops-Chest-Disease-Classification-from-Chest-CT-Scan-Image

This repository implements an MLOps pipeline for classifying chest diseases from chest CT scan images using a VGG16 deep learning model.
The project leverages various tools to ensure efficient development, deployment, and monitoring:
Machine Learning Model: VGG16 pre-trained convolutional neural network (CNN) for image classification.
Download: Images are downloaded from Google Drive (https://drive.google.com/file/d/1zD6V2BSjrO-QWwFPpv7s3JaqtemFoL98/view?usp=sharing).

Key Technologies:

Version Control: DVC tracks data versions for reproducibility.
Experiment Tracking & Model Registry:
- MLflow: Logs experiments, tracks metrics, and stores models for comparison and deployment.
- Dagshub: Collaboratively manages the MLOps pipeline and integrates with MLflow for experiment tracking.
CI/CD Pipeline: GitHub Actions automates building, testing, and deployment upon code changes.
Deployment:
- Jenkins server running on EC2 instance 1
- Containerization with Docker running on Elastic Container Registry
- Deployment on AWS EC2 instances 2, application running using flask interface

Project Structure:

/src/cnnClassifier/components/ DataIngestion: During this phase CT scan images are downloaded from Google drive using gdown package. Images are preprocessed to normalize the pixel values and stored in local folder.
/src/cnnClassifier/components/ PrepareBaseModel: VGG16, a pre-trained model is used to prepared a base CNN model. Then I have customized VGG16 model to train on image dataset by dropping output dense layer, adding the custom dense layer as my dataset has two output classes eg: normal or adenocarcinoma.
/src/cnnClassifier/components/ Training: I have trained the custom CNN VGG15 base model on image dataset. Further, image dataset is split into train and testing dataset during this phase.
/src/cnnClassifier/components/ Evaluation: Model's performance is evaluated on a test dataset. Various evaluation metrics are also used to determine its performance like F1-score, accuracy, precision, recall.
Template/index.html: This file contains user interface using flask
Dvc.yaml: This file contains for DVC pipeline tracking code.
Params.yaml: This file contains model related code.
Config/ config.yaml: constants are defined inside this file
Dvc.yaml: This file contains code to control pipeline tracking.
.Jenkins/Jenkinsfile : This file handle the Jenkins pipeline with GitHub.
.github/workflows/main.yaml : it trigger Jenkins deployment on EC2 instance 1.

Flow Chart of Project Architecture

Workflows of each component updation

Update config.yaml
Update params.yaml
Update the entity
Update the configuration manager in src config
Update the components
Update the pipeline
Update the main.py
Update the dvc.yaml

Git commands

git add .

git commit -m "Updated"

git push origin main

How to run?

conda create -n chest python=3.8 -y

conda activate chest

pip install -r requirements.txt

python app.py

Mlflow dagshub connection uri

MLFLOW_TRACKING_URI="https://dagshub.com/data-pioneer/MLops-Chest-Disease-Classification-from-Chest-CT-Scan-Image-.mlflow"
MLFLOW_TRACKING_USERNAME="data-pioneer"
MLFLOW_TRACKING_PASSWORD="dfbf143408af6a87f453866fb9ee6b6fe0bba8a9"
python script.py

RUN from bash terminal or environment variable

export MLFLOW_TRACKING_URI=""""

export MLFLOW_TRACKING_USERNAME="""" 

export MLFLOW_TRACKING_PASSWORD=""

DVC cmd

dvc init
dvc repro
dvc dag

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.dvc		.dvc
.github/workflows		.github/workflows
.jenkins		.jenkins
config		config
model		model
research		research
scripts		scripts
src/cnnClassifier		src/cnnClassifier
templates		templates
.dockerignore		.dockerignore
.dvcignore		.dvcignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
demo.py		demo.py
docker-compose.yml		docker-compose.yml
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
inputImage.jpg		inputImage.jpg
main.py		main.py
params.yaml		params.yaml
requirements.txt		requirements.txt
scores.json		scores.json
setup.py		setup.py
template.py		template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLops-Chest-Disease-Classification-from-Chest-CT-Scan-Image

Key Technologies:

Project Structure:

Flow Chart of Project Architecture

Workflows of each component updation

Git commands

How to run?

Mlflow dagshub connection uri

RUN from bash terminal or environment variable

DVC cmd

About

Releases

Packages

Contributors 2

Languages

License

data-pioneer/MLops-Chest-Disease-Classification-from-Chest-CT-Scan-Image-

Folders and files

Latest commit

History

Repository files navigation

MLops-Chest-Disease-Classification-from-Chest-CT-Scan-Image

Key Technologies:

Project Structure:

Flow Chart of Project Architecture

Workflows of each component updation

Git commands

How to run?

Mlflow dagshub connection uri

RUN from bash terminal or environment variable

DVC cmd

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages