You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repository contains the codes for reproducibility of the results using the ROI Hide and Seek Protocol
The new coronavirus unleashed a worldwide pandemic in early 2020, and a fatality rate roughly 10 times that of the flu. As the number of infections soared, and the capabilities for testing lagged behind, chest imaging became essential in the early diagnosis and treatment planning for patients with suspected or confirmed COVID-19 infection. In a few weeks, new methods for lung screening using machine learning (ML) exploded, while quality assurance discussions remain timid. This paper discusses crucial assessments missing in such tools proposed in the recent literature, and shows several blind-spots by running a set of systematic tests over automated methods that offer a more realistic perspective on the advantages and limitations of ML when using a few heterogeneous data sources.
run COVID-Net/Create_COVIDx.ipynb making sure to change paths of source folders to ~/base/data_sources/ and change destination path to ~/base/datasets/data
we will be creating two test sets, one from the cohen dataset masks availible and another from the NIH dataset provided by the XLSor paper
run ~/base/COVID_19_CXR_CLASSIFICATION/create_LungSegData.ipynb (make sure directories in .ipynb are correct and point to the location that you downloaded these sources)
step 6: train segmentation model
mkdir segmentation_models
run train_unet.py, make sure that the TRAIN_PATH and TEST_PATH and MODEL_PATH point to the correct locations
step 6.5 evaluate segmentation model (optional but recommended)
run /base/COVID_19_CXR_CLASSIFICATION/segmentation_scripts/eval_unet.py to get the performance metrics of the trained UNET model on the NIH test set and the Cohen Test set for segmentation
expected performance:
NIH test data set
metric
class 0
class 1
recall
.981
.968
precision
.991
.934
jaccard
.973
.906
f1
.986
.95
accuracy
.979
.979
Cohen test data set
metric
class 0
class 1
recall
.971
.919
precision
.971
.923
jaccard
.945
.854
f1
.971
.918
accuracy
.958
.958
step 7: use unet to create modified COVIDx5 datasets
run /base/COVID_19_CXR_CLASSIFICATION/segmentation_scripts/create_modified_COVIDx5.ipynb, make sure in folder points to base/datasets/classification/data and out folders point to folders listed above