Skip to content
/ MoCA Public

[Pattern Recognition] The implementation of MoCA

Notifications You must be signed in to change notification settings

xufangzhi/MoCA

Repository files navigation

MoCA

This is the official implementation for MoCA, which focuses on the task of Textbook Question Answering.

MoCA: Incorporating Multi-stage Domain Pretraining and Cross-guided Multimodal Attention for Textbook Question Answering
(Pattern Recognition) [paper]

External Corpus and Model Checkpoints

For the external corpus and some important checkpoints, please download them from here.

Main Model

  • Multi-stage Pretrain for text part
  • Dense Layer of Text-guided Visual Attention for diagram part

Experiments

Conducted on single Tesla-v100

Citation

If you find it helpful, please kindly cite the paper.

@article{xu2021moca,
  title   =   {MoCA: Incorporating Multi-stage Domain Pretraining and Cross-guided Multimodal Attention for Textbook Question Answering},
  author  =   {Xu, Fangzhi and Lin, Qika and Liu, Jun and Zhang, Lingling and Zhao, Tianzhe and Chai, Qi and Pan, Yudai},
  journal =   {arXiv preprint arXiv:2112.02839},
  year    =   {2021}
}

About

[Pattern Recognition] The implementation of MoCA

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages