<a href="https://colab.research.google.com/github/locastre/pyCERR/blob/main/autosegment_CT_Heart_OARs.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Introduction

In this tutorial, we will demonstrate how to apply a pre-trained AI model to segment the Heart sub-structures on a lung CT scan using pyCERR.

## Requirements
* Python>=3.8
* Applying this model requires access to a GPU.  

## AI model
* The segmentation model was trained and validated on CT scans used for RT planning. It does not work optimally on diagnostic CTs or scans in positions other than Head First Supine.
* The trained model is distributed along with python libraries and other dependencies via a conda package.

## Required input data
* RT planning DICOM CT
* Lung segmentation in RTSTRUCT

### Running the model

Update locations of input data and model directorues in section 2 of this notebook.
* Conda packge is location: condaEnvDir
* Inference script location: wrapperPath
* Inference script args
```python
!python {wrapperPath} {input_nii_directory} {output_nii_directory}
```

### License
By downloading the software you are agreeing to the following terms and conditions as well as to the Terms of Use of CERR software.

THE SOFTWARE IS PROVIDED "AS IS" AND CERR DEVELOPMENT TEAM AND ITS COLLABORATORS DO NOT MAKE ANY WARRANTY, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE, NOR DO THEY ASSUME ANY LIABILITY OR RESPONSIBILITY FOR THE USE OF THIS SOFTWARE.
    
This software is for research purposes only and has not been approved for clinical use.

Software has not been reviewed or approved by the Food and Drug Administration, and is for non-clinical, IRB-approved Research Use Only. In no event shall data or images generated through the use of the Software be used in the provision of patient care.

You may publish papers and books using results produced using software provided that you reference the appropriate citations:
*  Heart sub-structures model: https://doi.org/10.1016/j.phro.2020.05.009
*  CERR library of model implementations: https://doi.org/10.1016/j.ejmp.2020.04.011
*  CERR software: https://doi.org/10.1118/1.1568978
*  CERR radiomics: https://doi.org/10.1002/mp.13046


YOU MAY NOT DISTRIBUTE COPIES of this software, or copies of software derived from this software, to others outside your organization without specific prior written permission from the CERR development team except where noted for specific software products.

All Technology and technical data delivered under this Agreement are subject to US export control laws and may be subject to export or import regulations in other countries. You agree to comply strictly with all such laws and regulations and acknowledge that you have the responsibility to obtain such licenses to export, re-export, or import as may be required after delivery to you.



##  Define paths for input DICOM, output segmentation and session directories

Specify paths to the DICOM input data, desired output directory, and temporary (session) directory used to store intermediate results. The input data is structures such that DICOM per patient scan is in an individual directory.

&nbsp;&nbsp;Input Directory  
&nbsp;&nbsp;&nbsp;&nbsp;Pat1  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;img1.dcm  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;img2.dcm  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;....  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;....  
&nbsp;&nbsp;&nbsp;&nbsp;Pat2  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;img1.dcm  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;img2.dcm  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;....  
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;....  

In [1]:
import os
# work dir
#workDir = r'/home/jupyter' #
workDir = r'/content' #for Colab # r'AI_workshop/heart_model' # local

# Location of dicom data
dataDownloadDir = os.path.join(workDir,'input')

os.makedirs(dataDownloadDir,exist_ok=True)

In [2]:
wrapperInstallDir = os.path.join(workDir, 'CT_cardiac_structures_deeplab')
wrapperPath = os.path.join(wrapperInstallDir, 'model_wrapper','runSegmentation.py')
#modelWeightZipPath = os.path.join(wrapperInstallDir,'model.gz')
modelWeightPath = os.path.join(wrapperInstallDir,'model')
#modelHash = 'H4sIAAAAAAAAAwXBUQqAIBAFwP8O46YlZLexLTVELN9S0umbSSIXVqKCzKy22hXXQki+HTtBvJxM1Zq5I1o4fY+hvdnrMTwIi5mcRYwqfsMPzAV1z0gAAAA='

# Location of LungOAR conda environment
#heartCondaEnvDir = os.path.join(workDir, 'pretrainedHeartModel')
condaEnvDir = os.path.join(wrapperInstallDir, 'cpack')

# Path to conda environment activate script
condaEnvActivateScript = os.path.join(condaEnvDir, 'bin', 'activate')

# Downloads

## Download planning CT DICOM from ***dataUrl*** to ***dataDownloadDir***

In [3]:
model_installer_git = 'https://github.com/cerr/model_installer.git'
os.chdir(workDir)
!git clone  {model_installer_git}

Cloning into 'model_installer'...
remote: Enumerating objects: 49, done.[K
remote: Counting objects: 100% (49/49), done.[K
remote: Compressing objects: 100% (44/44), done.[K
remote: Total 49 (delta 15), reused 26 (delta 5), pack-reused 0[K
Receiving objects: 100% (49/49), 14.14 KiB | 329.00 KiB/s, done.
Resolving deltas: 100% (15/15), done.


In [5]:
os.chdir(os.path.join(workDir,'model_installer'))
!source installer.sh -m 1 -d {workDir} -p C

      ___           ___           ___           ___     
     /  /\         /  /\         /  /\         /  /\    
    /  /:/        /  /:/_       /  /::\       /  /::\   
   /  /:/        /  /:/ /\     /  /:/\:\     /  /:/\:\  
  /  /:/  ___   /  /:/ /:/_   /  /:/~/:/    /  /:/~/:/  
 /__/:/  /  /\ /__/:/ /:/ /\ /__/:/ /:/___ /__/:/ /:/___
 \  \:\ /  /:/ \  \:\/:/ /:/ \  \:\/:::::/ \  \:\/:::::/
  \  \:\  /:/   \  \::/ /:/   \  \::/~~~~   \  \::/~~~~ 
   \  \:\/:/     \  \:\/:/     \  \:\        \  \:\     
    \  \::/       \  \::/       \  \:\        \  \:\    
     \__\/         \__\/         \__\/         \__\/    
 
Medical Physics Department, Memorial Sloan Kettering Cancer Center, New York, NY
 
Welcome to the CERR segmentation model installer! For usage information, run with -h flag
 
git clone https://github.com/cerr/CT_cardiac_structures_deeplab.git
Cloning into 'CT_cardiac_structures_deeplab'...
remote: Enumerating objects: 109, done.[K
remote: Counting objects: 100% (109/1

In [4]:
%%capture
pip install pyxnat

In [7]:
#Paths to input data and conda env with pre-trained models
#inputDicomPath = os.path.join(dataDownloadDir,'rtog-0617','0617_test')  # Replace with apropriate path to your dataset

inputDicomPath = os.path.join(workDir,'input')
outputDicomPath = os.path.join(workDir, 'output')
sessionPath = os.path.join(workDir, 'session')

if not os.path.exists(outputDicomPath):
  os.makedirs(outputDicomPath, exist_ok = True)

if not os.path.exists(sessionPath):
  os.makedirs(sessionPath, exist_ok = True)


In [6]:
from pyxnat import Interface
import urllib3, shutil
from glob import glob
urllib3.disable_warnings()

xhost = 'https://pixnat.mskcc.org'
user = 'locastre'
scandict = [{'proj':'RTOG0617','subj':'SPI_XNAT_S00196','exp':'SPI_XNAT_E00348', 'scan_list':['1','2']},{'proj':'RTOG0617','subj':'SPI_XNAT_S00195','exp':'SPI_XNAT_E00347','scan_list':['1','2']}]

In [10]:
def getXNATData(xhost,user,scandict,downloadDir):
  xnat = Interface(xhost, user, verify=False)
  os.makedirs(downloadDir, exist_ok=True)
  for scan_entry in scandict:
    proj = scan_entry['proj']
    subj = scan_entry['subj']
    exp = scan_entry['exp']
    scan_list = scan_entry['scan_list']
    expdir = os.path.join(downloadDir,exp)
    os.makedirs(expdir, exist_ok = True)
    xexp = xnat.select.project(proj).subject(subj).experiment(exp)
    for scan in scan_list:
      try:
        xnat.select.project(proj).subject(subj).experiment(exp).scan(scan).resource('DICOM').get(downloadDir,extract=True)
      except:
        xnat.select.project(proj).subject(subj).experiment(exp).scan(scan).resource('secondary').get(downloadDir,extract=True)
    for dcmfolder in ['DICOM','secondary']:
      dcmlist = glob(os.path.join(downloadDir,dcmfolder,'*.dcm'))
      print(dcmlist)
      for dcm in dcmlist:
        shutil.move(dcm, expdir)
  for dcmfolder in ['DICOM','secondary']:
    if os.path.exists(os.path.join(downloadDir,dcmfolder)):
      os.rmdir(os.path.join(downloadDir,dcmfolder))
    if os.path.exists(os.path.join(downloadDir,dcmfolder + '.zip')):
      os.remove(os.path.join(downloadDir,dcmfolder + '.zip'))
  xnat.disconnect()

In [11]:
getXNATData(xhost,user,scandict,inputDicomPath)

··········
['/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-59-x5tmwa.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-48-1t7idsf.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-32-1cemjsq.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-40-155ln4r.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-45-1cncsqf.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-64-88rrvo.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-55-g514yj.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-58-7gj0b1.dcm', '/content/input/DICOM/1.3.6.1.4.1.14519.5.2.1.6329.6468.109218621610560682044382179111-1-92-1mwrbch.dcm', '/content/input/DICOM/1.3.6.1.4.1.1451

# Install pyCERR

pyCERR is used for pre and post-processing of DICOM as required by the model.

In [12]:
! pip install "pyCERR[napari] @ git+https://github.com/cerr/pyCERR.git@testing"

Collecting pyCERR[napari]@ git+https://github.com/cerr/pyCERR.git@testing
  Cloning https://github.com/cerr/pyCERR.git (to revision testing) to /tmp/pip-install-f4ml8om8/pycerr_722a8eca98db47e1af614f5434b2949c
  Running command git clone --filter=blob:none --quiet https://github.com/cerr/pyCERR.git /tmp/pip-install-f4ml8om8/pycerr_722a8eca98db47e1af614f5434b2949c
  Running command git checkout -b testing --track origin/testing
  Switched to a new branch 'testing'
  Branch 'testing' set up to track remote branch 'testing' from 'origin'.
  Resolved https://github.com/cerr/pyCERR.git to commit 42633b9a107dec55f0c0bd79ea60d2dc68aaa60a
  Installing build dependencies ... [?25l[?25hdone
  Getting requirements to build wheel ... [?25l[?25hdone
  Installing backend dependencies ... [?25l[?25hdone
  Preparing metadata (pyproject.toml) ... [?25l[?25hdone
Collecting pydicom (from pyCERR[napari]@ git+https://github.com/cerr/pyCERR.git@testing)
  Downloading pydicom-2.4.4-py3-none-any.whl (

# Functions for data pre- and post-processing

## Crop scan to Lung extents

In [13]:
from cerr.dataclasses import structure as cerrStr
from cerr.contour import rasterseg as rs
from cerr.utils import aiPipeline, mask
import numpy as np

def processInputData(scanNum, planC, lungNameList=['Lung_total', 'Lung_L', 'Lung_R']):

    if isinstance(lungNameList, str):
        lungNameList = [lungNameList]

    # Extract scanArray
    scan3M = planC.scan[scanNum].getScanArray()
    mask3M = np.zeros(scan3M.shape, dtype=bool)

    # List of Structure names
    strNames = [s.structureName for s in planC.structure]
    numOrigStructs = len(strNames)

    # Get total lung mask
    for lungName in lungNameList:
        lungInd = cerrStr.getMatchingIndex(lungName, strNames, 'exact')
        if len(lungInd) > 0:
            # Get lung extents
            mask3M = mask3M | rs.getStrMask(lungInd[0], planC)

    if not np.any(mask3M):
        raise Exception('Lung contour name did not match any structures in planC')

    # Create cropped scan
    rmin,rmax,cmin,cmax,smin,smax,_ = mask.compute_boundingbox(mask3M)
    x,y,z = planC.scan[0].getScanXYZVals()
    xCropV = x[cmin:cmax]
    yCropV = y[rmin:rmax]
    zCropV = z[smin:smax]
    scan3M = planC.scan[0].getScanArray()
    scanCrop3M = scan3M[rmin:rmax,cmin:cmax,smin:smax]

    return scanCrop3M, (xCropV, yCropV, zCropV)


## Import AI segmentations to planC and retain only the largest connected component for each structure

In [14]:
#Import label map to CERR
import glob
from cerr import plan_container as pc

atriaLabelDict = {1: 'DL_Atria'}
heartSubSegDict = {2: 'DL_AORTA', 3: 'DL_LA',
                   4: 'DL_LV', 5: 'DL_RA',
                   6: 'DL_RV', 7: 'DL_IVC',
                   8: 'DL_SVC', 9: 'DL_PA'}
heartSegDict = {1: 'DL_heart'}
periLabelDict = {1: 'DL_Pericardium'}
ventriLabelDict = {1: 'DL_Ventricles'}

def postProcAndImportSeg(outputDir,procScanNum,scanNum,planC):
    niiGlob = glob.glob(os.path.join(outputDir,'*.nii.gz'))
    for segFile in niiGlob:
        print('Importing ' + segFile + '...')
        # Get segFile name
        if 'heart.nii.gz' in segFile:
            strToLabelMap = heartSubSegDict
        elif 'heartStructure.nii.gz' in segFile:
            strToLabelMap = heartSegDict
        elif 'atria.nii.gz' in segFile:
            strToLabelMap = atriaLabelDict
        elif 'pericardium.nii.gz' in segFile:
            strToLabelMap = periLabelDict
        elif 'ventricles.nii.gz' in segFile:
            strToLabelMap = ventriLabelDict
        numLabel = len(strToLabelMap)
        numStrOrig = len(planC.structure)
        planC = pc.load_nii_structure(segFile, scanNum, planC, \
                                  labels_dict = strToLabelMap)
        numStructs = len(planC.structure)
        cpyStrNumV = np.arange(numStrOrig,numStructs)
        numConnComponents = 1
        for structNum in cpyStrNumV:
            _, planC = cerrStr.getLargestConnComps(structNum, numConnComponents, planC, \
                                            saveFlag=True, replaceFlag=True)

    return planC

# Segment OARs for all the CT scans located at *inputDicomPath*

In [16]:
#%%capture
import os
import subprocess
import numpy as np
import cerr
from cerr import plan_container as pc
from cerr.dataclasses import scan as cerrScn
from cerr.utils.aiPipeline import createSessionDir
from cerr.dcm_export import rtstruct_iod

# Loop over pyCERR files
fileList = os.listdir(inputDicomPath)
numFiles = len(fileList)
modality = 'CT SCAN'
lungNameList = ['LUNG_TOTAL', 'LUNG_CNTR', 'LUNG_IPSI']

for iFile in range(numFiles):

    dcmDir = os.path.join(inputDicomPath,fileList[iFile])

    # Create session dir to store temporary data
    modInputPath, modOutputPath = createSessionDir(sessionPath, inputDicomPath)

    # Import DICOM to planC
    planC = pc.load_dcm_dir(dcmDir)

    # Identify scan index in  planC
    scanIdS = {"imageType": modality}
    matchScanV = aiPipeline.getScanNumFromIdentifier(scanIdS, planC, False)
    scanNum = matchScanV[0]

    # Pre-process data
    procScan3M, resizeGridS = processInputData(scanNum, planC, lungNameList)
    planC = pc.import_scan_array(procScan3M, resizeGridS[0], \
            resizeGridS[1], resizeGridS[2], modality, scanNum, planC)
    procScanNum = len(planC.scan) - 1

    # Export inputs to NIfTI
    scanFilename = os.path.join(modInputPath, f"{fileList[iFile]}_scan_3D.nii.gz")
    planC.scan[procScanNum].save_nii(scanFilename)

    numOrigStructs = len(planC.structure)

    # Apply model
    runScript = "source " + condaEnvActivateScript + " && python " + wrapperPath \
                  + " " + modInputPath + " " + modOutputPath
    print(runScript)
    subprocess.run(runScript,
                 capture_output=False,
                  shell=True,
                  executable="/bin/bash")

    # Import results to planC
    planC = postProcAndImportSeg(modOutputPath,procScanNum,scanNum,planC)

    numStructs = len(planC.structure)

    # Export segmentations to DICOM
    structFileName = fileList[iFile] + '_AI_seg.dcm'
    structFilePath = os.path.join(outputDicomPath,structFileName)
    structNumV = np.arange(numOrigStructs, numStructs)
    indOrigV = np.array([cerrScn.getScanNumFromUID(planC.structure[structNum].assocScanUID, planC) for structNum in structNumV], dtype=int)
    origIndsToExportV = structNumV[indOrigV == scanNum]
    seriesDescription = "AI Generated"
    exportOpts = {'seriesDescription': seriesDescription}
    rtstruct_iod.create(origIndsToExportV,structFilePath,planC,exportOpts)


('0617-292370', '0617-292370', '1.3.6.1.4.1.14519.5.2.1.6329.6468.180596963996965825122162035932', '1.3.6.1.4.1.14519.5.2.1.6329.6468.314188004903460989626820333570', 'CT', 'CT', 'CT', 'CT', 'CT', 'CT', 'CT')
('0617-292370', '0617-292370', '1.3.6.1.4.1.14519.5.2.1.6329.6468.180596963996965825122162035932', '1.3.6.1.4.1.14519.5.2.1.6329.6468.880278915833505391469023528647', 'RTSTRUCT', 'RTSTRUCT', 'RTSTRUCT', 'RTSTRUCT', 'RTSTRUCT', 'RTSTRUCT', 'RTSTRUCT')
source /content/CT_cardiac_structures_deeplab/cpack/bin/activate && python /content/CT_cardiac_structures_deeplab/model_wrapper/runSegmentation.py /content/session/sessioninput183139128.788/inputNii /content/session/sessioninput183139128.788/outputNii


IndexError: index 0 is out of bounds for axis 0 with size 0

# Optional - Download the outputDicomPath to Workspace bucket

In [None]:
# workspaceBucket = os.environ['WORKSPACE_BUCKET']
# !gcloud storage cp -r {outputDicomPath} {workspaceBucket}

# Display results for the last CT scan

## Display using matplotlib

In [None]:
from cerr.viewer import showMplNb

showMplNb(scanNum, origIndsToExportV, planC,\
          windowCenter=-400, windowWidth=2000)

interactive(children=(IntSlider(value=63, description='slcNum', max=189, min=-63), Text(value='axial', descrip…

interactive(children=(IntSlider(value=256, description='slcNum', max=768, min=-256), Text(value='sagittal', de…

interactive(children=(IntSlider(value=256, description='slcNum', max=768, min=-256), Text(value='coronal', des…