RP3D-Diag (Code is still under updating and completing...)

The official codes for paper "Large-scale Long-tailed Disease Diagnosis on Radiology Images"

In this paper, we build up an academically accessible, large-scale diagnostic dataset, present a knowledge enhanced model architecture that enables to process arbitrary number of input scans from various imaging modalities, and initialize a new benchmark for multi-modal multi-anatomy long-tailed diagnosis. Our method shows superior results on it. Additionally, our final model serves as a pre-trained model, and can be finetuned to benefit diagnosis on various external datasets.

Dataset

Overview of RP3D-DiagDS. There are 39,026 cases (192,675 scans) across 7 human anatomy regions and 9 diverse modalities covering 930 ICD-10-CM codes.

The images used in our dataset can be downloaded from BaiduYun or onedrive.

The train/test split strategy and label csv files can be found in HuggingFace repository RP3D-DiagDS.

Model

The architecture of our proposed visual encoder and fusion module, together with the knowledge enhancement strategy. (a) shows the details of the vision encoder. We design two variants to fit in the two main visual backbones, i.e., ResNet and ViT. (b) shows the transformer-based fusion module, enabling case-level information fusion. (c) shows the knowledge enhancement strategy. We first pre-train a text encoder with extra medical knowledge with contrastive learning, i.e., synonyms, descriptions and hierarchy, termed as knowledge encoder and then we view the text embedding as a natural classifier to guide the diagnosis classification.

The model checkpoint can also be found in HuggingFace repository RP3D-DiagModel. (Uploading).

Setup

Environment

python = 3.9
pytorch(gpu) >= 2.0
transformers = 4.32.1
scikit-learn = 1.3.2

Data Preparation

See the readme.txt in src/Datapath and download files from HuggingFace.
Modify all the paths (files in DataPath,all the python files and bash files) to your own.
The DataPath folder shuold be like:

src
|   DataPath
|   |    RP3D_train.json
|   |    RP3D_test.json
|   |    disorder_label_dict.json
|   |    icd10_label_dict.json
|   |    xxxCheckpoint
|   |    |    pytorch.model.bin 
|   |    |    ......

Evaluation

Modify eval.sh (OUTDIR, CHECKPOINT)
Run in terminal: bash eval.sh

Training

From Scratch

Modify train.sh (OUTDIR, LEVEL, DEPTH, FUSE, KE...)
Run in terminal: bash train.sh

Load Checkpoint

Modify train.sh (SAFETENSOR), the path should be the path for pytorch.model.bin, Modify CHECKPOINT to "None".
Run in terminal: bash train.sh

Benchmark

Classification results on Disorders and ICD-10-CM levels. In this table Fusion Module and Knowledge Enhancement strategy are both used. We report the results on Head/Medium/Tail class sets separately.

Granularity	Class	AUC	AP	F1	MCC	R@0.01	R@0.05	R@0.1
Disorders	Head	94.24	15.13	25.68	26.71	37.06	66.55	81.37
Disorders	Medium	94.69	12.38	20.64	24.07	31.52	65.34	78.73
Disorders	Tail	90.64	9.25	9.89	14.38	10.98	27.98	43.53
ICD-10-CM	Head	90.89	14.37	22.67	24.29	26.11	53.82	69.16
ICD-10-CM	Medium	91.67	9.56	18.32	20.77	25.68	52.85	66.63
ICD-10-CM	Tail	86.55	4.81	8.69	12.74	8.86	22.75	37.97

ROC curves on Disorders and ICD-10-CM, including head/medium/tail parts respectively. The shadow in the figure shown the 95% CI~(Confidence interval) and FM, KE are short for Fusion Module and Knowledge Enhancement.

Comparison

The AUC Score Comparison on Various External Datasets. SOTA denotes the best performance of former works (pointed with corresponding reference) on the datasets. Scratch means use our model but train from scratch. Ours means use our checkpoint to fintune.

Dataset	Scratch	Ours	SOTA
VinDr-Mammo	76.25	78.53	77.50
CXR14	79.12	83.38	82.50
VinDr-Spine	87.35	87.73	88.90
MosMedData	71.24	75.39	68.47
ADNI	82.40	84.21	79.34

Acknowledgment

We sincerely thank all the contributors who uploaded the relevant data in our dataset online. We appreciate their willingness to make these valuable cases publicly available.

Contact

If you have any questions, please feel free to contact three-world@sjtu.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
Images		Images
src		src
.gitattributes		.gitattributes
README.md		README.md
folder-alias.json		folder-alias.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Images

Images

src

src

.gitattributes

.gitattributes

README.md

README.md

folder-alias.json

folder-alias.json

Repository files navigation

RP3D-Diag (Code is still under updating and completing...)

Dataset

Model

Setup

Environment

Data Preparation

Evaluation

Training

From Scratch

Load Checkpoint

Benchmark

Comparison

Acknowledgment

Contact

About

Releases

Packages

Contributors 3

Languages

qiaoyu-zheng/RP3D-Diag

Folders and files

Latest commit

History

Repository files navigation

RP3D-Diag (Code is still under updating and completing...)

Dataset

Model

Setup

Environment

Data Preparation

Evaluation

Training

From Scratch

Load Checkpoint

Benchmark

Comparison

Acknowledgment

Contact

About

Resources

Stars

Watchers

Forks

Languages