Please download the I3D features from Baidu disk first, and then unzip *.npy file and put it in /code/data/features/I3D
.
Deberta-v3-large model is used by default, or birdbird model can be replaced
The model and experimental records after training will be saved in the /code/paperlog
folder.
cd code
python Train_One.py
cd code
python Train_Two_1.py
python Train_Two_2.py
cd code
python Train_One_TV.py
cd code
python Train_Two_1_TV.py
python Train_Two_2_TV.py
Our scheme achieved SOTA performance.
Please feel free to cite our [paper]{https://aclanthology.org/2022.bionlp-1.21/).
@inproceedings{li2022vpai_lab,
title={Vpai\_lab at medvidqa 2022: A two-stage cross-modal fusion method for medical instructional video classification},
author={Li, Bin and Weng, Yixuan and Xia, Fei and Sun, Bin and Li, Shutao},
booktitle={Proceedings of the 21st Workshop on Biomedical Language Processing},
pages={212--219},
year={2022}
}