-
- Proposes a collaborative cross-modal fusion network for radiology report generation.
-
- Constructs visual collaborative enhancement module to accurately identify abnormal lesions.
-
- Designs cross-modal semantic fusion to perform information alignment between different modalities.
Python >= 3.6Pytorch >= 1.7torchvison
Download IU X-Ray and MIMIC-CXR datasets, and place them in data folder.
- config : setup training arguments and data path
- data : store IU and MIMIC dataset
- models: our model
- modules:
- the layer define of our model
- dataloader
- loss function
- metrics
- tokenizer
- some utils
- preprocess: data preprocess
- pycocoevalcap: Microsoft COCO Caption Evaluation Tools
The source code for training can be found here:
Run train.py to train a model on the IU X-Ray data and MIMIC-CXR data.
Run test.py to test a model on the IU X-Ray data and MIMIC-CXR.
To run the command, you only need to specify the config file and the GPU ID and iteration version of the model to be used
We appreciate for all code providers, especially for R2GenCMN.