MultiTalent: A Multi-Dataset Approach to Medical Image Segmentation

Please cite the following work if you find this model useful for your research:

Ulrich, C., Isensee, F., Wald, T., Zenk, M., Baumgartner, M., & Maier-Hein, K.(2023). 
MultiTalent: A Multi-Dataset Approach to Medical Image Segmentation. arXiv preprint arXiv:2303.14444.

Please also cite the following work if you use this pipeline for training:

Isensee, F., Jaeger, P. F., Kohl, S. A., Petersen, J., & Maier-Hein, K. H. (2020). 
nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nature Methods, 1-9.

Requirements

This repository is based on nnU-Net V1. Please get familiar with it first.

Installation

The repository can be cloned and installed using the following commands.

git clone https://github.com/MIC-DKFZ/MultiTalent.git MultiTalent
cd MultiTalent
pip install -U .

Set the Paths of your enviroment according to nnU-Net V1

Note: We are working on an update to nnU-NetV2.
In nnU-Net V1, the project was still very experimental. In V2 we will make the project more user-friendly.

Apply MultiTalent

You can download a trained Multitalent (U-Net and Residual U-Net) model here using the following command:

nnUNet_download_pretrained_model Task100_MultiTalent

Please use the command above, because it makes some modifications to your plan files!

After that, you could run the following script for inference:

CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch --master_port=1224 --nproc_per_node=1 ./nnunet/inference/predict_MultiTalent.py -i inputpath -o outputpath -m path_to_model_folds

Fine-tuning MultiTalent

We recommend to preprocess your target dataset as we did for the MultiTalent dataset collection. We expect, that the dataset is already in the raw data folder as expected by nnU-Net V1.

nnUNet_plan_and_preprocess -t TASK_ID -pl3d ExperimentPlanner3D_v21_Pretrained --planner2d None -overwrite_plans Path_To_MultiTalent_Plan -overwrite_plans_identifier New_Plans_Name --verify_dataset_integrity

Path_To_MultiTalent_Plan should point to the plans you downloaded above.

You can fine-tune a MultiTalent model on a new task using the following command:

nnUNet_train 3d_fullres nnUNetTrainerV2_warmupsegheads TASK_ID Fold -p New_Plans_Name -pretrained_weights  path_to_pretrained_model/model_final_checkpoint.model

For fine-tuning a residual encoder, use the trainer nnUNetTrainerV2_warmupsegheads_resenc.
Note, that these trainers only train the new segmentation heads for the first 10 epochs, followed by a warm-up for the whole network.

Training with MultiTalent:

Dataset collection
In the following, we will give you an instruction how to get and preprocess our partially labelled dataset collection. You need to download the raw data individually from each dataset origin:
Dataset 1-6: MSD (only CT datasets)
Dataset 7 & 8: BTCV
Dataset 9: BTCV2
Dataset 10: StructSeg
Dataset 11: SegThor
Dataset 12: NIH-Pan
Dataset 13: KiTS19

After downloading, you have to save the files as expected by nnU-Net in your nnUNet_raw_data_base directory and to use the right folder structure (nnU-Net dataset conversion).
It is important that you choose the same folder names as we did:
'Task003_Liver'
'Task006_Lung'
'Task007_Pancreas'
'Task008_HepaticVessel'
'Task009_Spleen'
'Task010_Colon'
'Task017_AbdominalOrganSegmentation'
'Task046_AbdOrgSegm2'
'Task051_StructSeg2019_Task3_Thoracic_OAR'
'Task055_SegTHOR'
'Task062_NIHPancreas'
'Task064_KiTS_labelsFixed'
'Task018_PelvicOrganSegmentation'

Preprocessing for training
First, we need to generate our raw dataset for the multi-class training. Run the following script to copy all the images in the right folder, convert the labelmaps, generate the dataset.jason file and transpose the images if needed:

/nnunet/dataset_conversion/Task100_MultiTalent.py

It can take around 3 hours, depending on your system. If you want to extend or change the base dataset collection, you would need to adapt this file!

Now, we can use the nn-UNet preprocessing function with a specialized preprocessor:

nnUNet_plan_and_preprocess -t 100 -pl3d ExperimentPlanner3D_v21_MultiTalent -pl2d None -tf 16 --verify_dataset_integrity -overwrite_plans_identifier multitalent_bs4

Again, this takes some time.

This also generates a training plan file that we need for the following network training. By default, this plan generates a batchsize of 2. It is very easy to change the batchsize (see extending nnU-Net), but you could also use the plans that we provide.

Next, copy the ./splits_custom.pkl file in your preprocessed MultiTalent folder

We are almost done, but we need to add the information about the valid labels for each image to the .pkl files:

python /nnunet/dataset_conversion/Task100_MultiTalent_addregions.py

This takes only a few seconds.

Training of the Multi-Class network
First, you should take a look at the nnU-Net V1 distributed training instructions. To train a MultiTalent network, run the following command:

CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --master_port=1234 --nproc_per_node=2 ./nnunet/run/run_training_DDP.py 3d_fullres MultiTalent_trainer_ddp 100  0 -p MultiTalent_bs4 --dbs

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
MultiTalent_plans		MultiTalent_plans
documentation		documentation
nnunet		nnunet
tests		tests
.gitignore		.gitignore
HI_Logo.png		HI_Logo.png
LICENSE		LICENSE
__init__.py		__init__.py
boxplot_mean_final.svg		boxplot_mean_final.svg
overview_figure.svg		overview_figure.svg
readme.md		readme.md
setup.cfg		setup.cfg
setup.py		setup.py
splits_custom.pkl		splits_custom.pkl

License

MIC-DKFZ/MultiTalent

Folders and files

Latest commit

History

Repository files navigation

MultiTalent: A Multi-Dataset Approach to Medical Image Segmentation

Requirements

Installation

Apply MultiTalent

Fine-tuning MultiTalent

Training with MultiTalent:

About

Resources

License

Stars

Watchers

Forks

Languages