This repository contains the models implemented for the Biomedical causal relationship extraction.
All the models are implemented using Python version 3.7+ with the libraries listed in requirement.txt . All the hyperparameters of the models can be changed inside the configure.yaml file.
All the experiments have been condcating on two freely available datasets:
- Data used in Hahn-Powell, G., Bell, D., Valenzuela-Escárcega, M. A., & Surdeanu, M. (2016). This before that: Causal precedence in the biomedical domain. arXiv preprint arXiv:1606.08089. It is available here.
- BioCause (Biomedical Discourse Causality Corpus)
This repository consists of the codes for different tasks and models:
- crossValidation
- Fine tuning the BioBERT models
- Random oversampling
- Layer Visualization
- ruleBased relation Extraction
- etc.
For the all implemented models, a pretarine biomedical word emebedding model is used. For the ElMo the option and weight files for the allennlp module must be downloaded first from here.