Relation-Extraction---DrugProt

This reposetory cosists of three folder contains three models to perform the task of relation extraction between chemical and protien entities.

Each folder contains train file. To train the models:

CNN : 1.1 Download the pre-traind embedding model: BioWordVec vector 13GB (200dim, trained on PubMed+MIMIC-III, word2vec bin format) from https://github.com/ncbi-nlp/BioSentVec 2.2 Build vocabulary: python3 build_vocab.py 3.3 Train the model
```
   python3 train.py  const_vec high_num_sent 
   dec_vec: stands for padding the position vectors 
```

2.SciBERT+CNN and SciBERT+LSTM+CNN:

      python3 train.py 16 50 no_cv 0.00002

Parameters are: batch size, number of epochs, training with or without cross validation(CV) set to Y_CV if you want to train it with CV, learning rate.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
CNN-model		CNN-model
SciBERT+CNN		SciBERT+CNN
SciBERT+LSTM+CNN		SciBERT+LSTM+CNN
checkpoints		checkpoints
data		data
predictions		predictions
presentations		presentations
M2_Thesis_BERT Model and Convolutional Neural Networks for Relation Extraction.pdf		M2_Thesis_BERT Model and Convolutional Neural Networks for Relation Extraction.pdf
README.md		README.md
generate_samples.py		generate_samples.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Relation-Extraction---DrugProt

About

Releases

Packages

Languages

FatimaHabib/Relation-Extraction---DrugProt

Folders and files

Latest commit

History

Repository files navigation

Relation-Extraction---DrugProt

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages