Getting Started | Documentation | Package | Paper
Adverse Polypharmacy Reaction Intelligent Learner and Explainer (APRILE) is an explainable framework to reveal the mechanisms underlying adverse drug reactions (ADRs) caused by polypharmacy therapy. After learning from massive biomedical data, APRILE generate a small pharmacogenomic knowledge graph (i.e. drug targets and protein interactions) as mechanistic explanation for a drug-drug interaction (DDI) which associated an ADR and a set of such interactions.
APRILE is able to answer the following example questions:
- Why the combination use of a pair of drugs (nicotine, ondansetron) causes anxiety?
- When taking fexofenadine, hydroxyzineand and loratadine simultaneously, what side effects may occur, and why?
- Which genes are associated with the infection diseases?
- What are the common mechanisms among peptic ulcers (such as duodenal ulcer, gastric ulcer and esophageal ulcer)?
We have demonstrated the viability of discovering polypharmacy side effect mechanisms by learning from an AI model trained on massive biomedical data (see [paper])
- APRILE predicts side effects for drug combinations and gives the prediction reasons
- APRILE delineates non-intuitive mechanistic associations between {genes, proteins, biological processes} and {symptoms, diseases, mental disorders ∈ ADRs)
- Using our pre-trained model, molecular mechanisms for 843,318 (learned) + 93,966 (novel) side effect–drug pair events, spanning 861 side effects (472 diseases, 485 symptoms and 9 mental disorders) and 20 disease categories, have been suggested.
Prerequisites:
Before installing aprile
, PyTorch and PyTorch Geometric are required to be installed matching your hardware.
We recommend using torch 1.4.0 (python3.7+cuda10.1), torch-cluster 1.5.4, torch-scatter 2.0.4, torch-sparse 0.6.1, torch-spline-cov 1.2.0 and torch-geometric 1.4.2
Install the environment dependencies of APRILE using pip
:
pip install aprile
Firstly, download the data file kgdata.pkl
using this link, and put it into your working directory.
Secondly, load data and APRILE model.
from aprile.model import *
gdata = AprileQuery.load_from_pkl('kgdata.pkl')
aprile = Aprile(gdata, device='cuda') # device='cpu' if using CPUs
Next, let us familiar with the data gdata
. It's data type is torch_geometric.data.data.Data
and its attribute list can be obtained by using var(gdata).keys()
. It mainly contains four parts:
- a pharmacogenomic knowledge graph:
gdata.pp_index
andgdata.pd_index
- the ADRs caused by polypharmacy:
gdata.dd_edge_index
- the data for training and testing APRILE-Pred:
gdata.train_idx
,gdata.train_et
,gdata.test_idx
andgdata.test_et
- the index mappings for drugs, genes, proteins and ADRs:
gdata.side_side_effect_idx_to_name
: mapping from side effect aprile index to side effect namegdata.drug_idx_to_id
: mapping from drug aprile index to CIDgdata.prot_idx_to_id
: mapping from protein aprile index to GeneIDgdata.geneid2symbol
: mapping from GeneID to gene symbol
Finally, use APRILE to predict ADRs caused by polypharmacy and generate explanations (e.g. molecular mechanisms of the ADRs). Here is an example,
# a list of DDIs in the formate of (D1, D2, SE)
d1, d2, se = [19, 37, 192], [37, 192, 19], [452]*3
# get predictions
query = aprile.predict(d1, d2, se)
# get prediction result table
pred_df = query.get_pred_table()
# get explain --> proteins and GOs
query = aprile.explain_query(query, regularization=2, if_auto_tuning=True)
# save query to file
query.to_pickle('tmp.pkl')
# load query from file
query = PoseQuery.load_from_pkl('tmp.pkl')
# print query summary
print(query)
# get detailed prediction and explaination results
prediction_df = query.get_pred_table()
go_df = query.get_GOEnrich_table()
# visualize explained query and save
subgraph_fig = query.get_subgraph(if_show=True, save_path='test.pdf')
If you found this work useful, please cite us:
@article{aprile,
title={APRILE: Exploring the Molecular Mechanisms of Drug Side Effects with Explainable Graph Neural Networks},
author={Hao Xu and Shengqi Sang and Herbert Yao and Alexandra I. Herghelegiu and Haiping Lu and Laurence Yang},
journal={bioRxiv preprint},
year={2021}
}