GitHub - haifangong/HCP-MIC-at-ImageCLEF-VQA-Med-2020

HCP-MIC at ImageCLEF VQA-Med 2020

This repository is the official implementation of paper HCP-MIC at VQA-Med 2020: Effective Visual Representation for Medical Visual Quesion Answering.

Citing this repository

If you find this code useful in your work, please consider citing us:

@inproceedings{chen2020hcp-mic,
  author    = {Guanqi Chen and
               Haifan Gong and
               Guanbin Li},
  title     = {{HCP-MIC} at VQA-Med 2020: Effective Visual Representation for Medical Visual Question Answering},
  booktitle = {Working Notes of {CLEF} 2020 - Conference and Labs of the Evaluation Forum, Thessaloniki, Greece, September 22-25, 2020},
  series    = {{CEUR} Workshop Proceedings},
  volume    = {2696},
  year      = {2020},
}

Main requirements

torch == 1.4.0
torchvision == 0.5.0
tensorboardX == 2.0
Python 3

Pretrain models for VQA-Med 2020

We provide the pretrain models of both medical images classifier and medical questions classifier for VQA-Med 2020. These models should be under the BBN-BioBert-Inference/pretrain folder.

Baidu Cloud code:93nw

The BBN is mainly modified from BBN, Bio-Bert pretrain is obtained from Biobert, the pickle data should be under the BBN-BioBert-Inference/data/ folder.

Usage

# To train long-tailed abnormal images classification with BBN-ResNeSt-50:
python BBN/main/train.py  --cfg BBN/configs/BBN-ResNeSt-50.yaml     

# To train medical questions classification with bio-bert:
cd BioBert
python train.py

# To validate with the best model
cd BBN-BioBert-Inference
python inference.py

You can change the experimental setting by simply modifying the parameter in the yaml file.

Tools

# Get json from the original format of VQA-MED 2020
python BBN/txt2json.py

# Create cache for VQA
python BBN/create_cache4VQA.py

# Create pickel for inference
python BBN/bbn_create_pickel.py

# Expanding dataset via KL divergence
python BBN/expand_dataset.py

# Create feature dict
python BBN-Biobert-Inference/create_feature_dict.py

Data format

The annotation of a dataset is a dict consisting of two field: annotations and num_classes. The field annotations is a list of dict with image_id, fpath, im_height, im_width and category_id.

Here is an example.

{
    'annotations': [
                    {
                        'image_id': 1,
                        'fpath': '/home/data/train1920/images/synpic593.jpg',
                        'im_height': 600,
                        'im_width': 800,
                        'category_id': 74
                    },
                    ...
                   ]
    'num_classes': 330
}

You can use the following code to convert from the original format of VQA-Med. The images and annotations can be downloaded at VQA-MED 2020

Contacts

If you have any questions about our work, please do not hesitate to contact us by emails.

Haifan Gong: haifangong@outlook.com

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
BBN-BioBert-Inference		BBN-BioBert-Inference
BBN		BBN
BioBert		BioBert
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HCP-MIC at ImageCLEF VQA-Med 2020

Citing this repository

Main requirements

Pretrain models for VQA-Med 2020

Usage

Tools

Data format

Contacts

About

Releases

Packages

Languages

haifangong/HCP-MIC-at-ImageCLEF-VQA-Med-2020

Folders and files

Latest commit

History

Repository files navigation

HCP-MIC at ImageCLEF VQA-Med 2020

Citing this repository

Main requirements

Pretrain models for VQA-Med 2020

Usage

Tools

Data format

Contacts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages