GraphCare

Code for the paper: GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs in ICLR'24.

Requirements:

pip install torch==1.12.0
pip install torch-geometric==2.3.0
pip install pyhealth==1.1.2
pip install scikit-learn==1.2.1
pip install openai==0.27.4

We follow the flow of methodology section (Section 3) to explain our implementation.

1. Concept-specific Knowledge Graph (KG) Generation

1.1 LLM-based KG extraction via prompting

The jupyter notebook to prompt KG for EHR medical code:

/graphcare_/graph_generation/graph_gen.ipynb

We place sample KGs generated by GPT-4 as

/graphs/{condition/CCSCM,procedure/CCSPROC,drug/ATC3}/{code_id}.txt

1.2 Subgraph sampling from existing KGs

The script for subgraph sampling from UMLS:

/KG_mapping/umls_sampling.py

We place 2-hop sample KGs randomly subsampled from UMLS as

/graphs/umls_2hop.csv

1.3 Word Embedding Retrieval for Nodes & Edges

The jupyter notebooks for word embedding retrieval:

/graphcare_/graph_generation/{cond,proc,drug}_emb_ret.ipynb

Due to the large size of word embedding, we do not include them in the repo. You can use our script to retrieve it and store it in either

/graphs/cond_proc/{entity_embedding.pkl, relation_embedding.pkl}
or
/graphs/cond_proc_drug/{entity_embedding.pkl, relation_embedding.pkl}

depending on the features used for the prediction tasks.

1.4 Node & Edge Clustering

The function for node & edge clustering:

clustering() in data_prepare.py

We place some clustering results (only "_inv" as cluster embedding has large size) in

/clustering/

2. Personalized Knowledge Graph Composition

process_sample_dataset() and process_graph() in data_prepare.py
&
get_subgraph() in graphcare.py

3. Bi-attention Augmented (BAT) Graph Neural Network

The implementation of our proposed BAT model is in

/graphcare_/model.py

4. Training and Prediction

The creation of task-specific datasets (using PyHealth) is in

data_prepare.py

The training and prediction details are in

graphcare.py

Run Baseline Models

The scripts running baseline models are placed in

ehr_models.py

Cite GraphCare

@inproceedings{jiang2023graphcare,
  title={GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs},
  author={Jiang, Pengcheng and Xiao, Cao and Cross, Adam Richard and Sun, Jimeng},
  booktitle={The Twelfth International Conference on Learning Representations},
  year={2023}
}

Thanks for your interest in our work! 😊

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
KG_mapping		KG_mapping
clustering		clustering
graphcare_		graphcare_
graphs		graphs
resources		resources
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_prepare.py		data_prepare.py
ehr_models.py		ehr_models.py
graphcare.py		graphcare.py
graphcare_analysis.py		graphcare_analysis.py
graphcare_analysis_cpd.py		graphcare_analysis_cpd.py
kg_ratio_data_gen.py		kg_ratio_data_gen.py
old_scripts.zip		old_scripts.zip
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GraphCare

Requirements:

1. Concept-specific Knowledge Graph (KG) Generation

1.1 LLM-based KG extraction via prompting

1.2 Subgraph sampling from existing KGs

1.3 Word Embedding Retrieval for Nodes & Edges

1.4 Node & Edge Clustering

2. Personalized Knowledge Graph Composition

3. Bi-attention Augmented (BAT) Graph Neural Network

4. Training and Prediction

Run Baseline Models

Cite GraphCare

About

Languages

License

pat-jj/GraphCare

Folders and files

Latest commit

History

Repository files navigation

GraphCare

Requirements:

1. Concept-specific Knowledge Graph (KG) Generation

1.1 LLM-based KG extraction via prompting

1.2 Subgraph sampling from existing KGs

1.3 Word Embedding Retrieval for Nodes & Edges

1.4 Node & Edge Clustering

2. Personalized Knowledge Graph Composition

3. Bi-attention Augmented (BAT) Graph Neural Network

4. Training and Prediction

Run Baseline Models

Cite GraphCare

About

Topics

Resources

License

Stars

Watchers

Forks

Languages