Skip to content

nguyenvanhoang7398/NEAT

Repository files navigation

Neural Side Effect Discovery from User Credibility and Experience-Assessed Online Health Discussions

Full workshop paper is available here

The neural architecture of our proposed NEAT. The blue boxes denoted general text encoders with attention. The highlighted words in red denoted the text segments that are being attended by the encoder. The yellow oval denotes contextual information of User Experience (UE), Credibility Weight (CW) and Cluster Attention (CA). alt text

Getting started

Download People-On-Drugs dataset here, unzip and move to data/pod

Download Glove here, unzip and move to data/glove.840B.300d.txt

Neural ADR extractor can be found here

Extract side effects

python side_effects.py

Preprocess People-On-Drugs into docs

python pod.py

Run model training

python train.py --model_name=<neat_lstm|neat_wpe|neat_wpeu|neat_full|neat_cnn|neat_cnn_wpe|neat_cnn_wpeu|neat_fulll>

Run ADR extraction benchmark

python adr_extraction.py

Output of 3 approaches: UMLS tagging, ADR extractor - Ding et al. (2018), and NEAT's Attention for 5 drugs Alprazolam, Ibuprofen, Levothyroxine, Metoformin, Omeprazole are available in test_alprazolam.json, test_ibuprofen.json, test_levothyroxine.json, test_metoformin.json, test_omeprazole.json under the keys umls, neural and neat respectively.

Bag-of-word Random Forest implementation is available at baseline.py

Environments

python==3.7
nltk==3.2.1
keras==2.1.3
tensorflow==1.13.1
spacy==2.2.3
pytorch=1.3.1

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages