Skip to content

AI-SysBio/RA-drug-discovery

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 

Repository files navigation

Cell-type Specific Gene Networks and Drivers in Rheumatoid Arthritis

The increasing number of available large RNA-seq datasets, combined with genome-wide association studies (GWAS), differential gene expression (DEG) studies, and gene regulatory networks (GRN) analyses have to potential to lead to the discovery of novel therapeutics. Yet, despite this progress, our ability to translate GWAS and DEG analyses into an improved mechanistic understanding of many diseases remains limited, as different analyses often disregard information about the cell-types mediating the disease.

In this repository, We compute sample-specific GRNs which enables the use of statistical techniques to compare network properties between phenotypic groups. Then, we leverage this collection of networks to rank transcription factors (TFs) according to their contribution to the observed differential gene expression between RA and control.

This repository support our publications:

[1] Pelissier A, Laragione T, Martinez MR, & Gulko PS. Cell-Specific Gene Networks and Drivers in Rheumatoid Arthritis Synovial Tissues (2023). bioRxiv (2023): 2023-12.

[2] Pelissier A*, Laragione T*, Harris C, Gulko PS & , Martinez MR. Gene Network Analyses Identifies Co-regulated Transcription Factors and BACH1 as a Key Driver in Rheumatoid Arthritis Fibroblast-like Synoviocytes (2023). bioRxiv (2023): 2023-12.

 

Constructing cell-type specific gene regulatory network

In this work, Gene regulatory networks are bipartite graphs, with edges connecting TF and their target gene (TG). Each edge has a weight representing the probability of a regulatory interaction between the connected nodes. Briefly, we use PANDA [3] that integrates gene expression data with prior knowledge about TF-binding motif and protein-protein interactions by optimizing the weights of edges in the networks with iterative steps. PANDA's input consists of:

  • A gene expression matrix. In this study we used bulk RNA expression, available in Data/RA_gene_expression
  • Prior knowledge about TF-TF interactions and TF binding motif, available in Data/PANDA_prior_knowledge Run the script src/PANDA_network.py to compute the network and analyse their edges.

Applied to our data, PANDA produced fully connected and directed networks of TFs to their target genes, comprising 644 TFs and 18992 genes. Then, we used LIONESS [4] to estimate an individual gene regulatory network for each sample in the population, which we utilized to make differential analysis of their edges and identify key TF regulators.

 

Key driver analysis

We use mergeomics [5]. You need: Some network for your analysis. We used the GIANT network, downloaded at https://giant-v2.princeton.edu/download/.

  • Run the script src/KDA_analysis.py

 

Experimental validation

In our article we focused on Synovial fibroblast and detected FOSL1, THBS1 and CFH as potential novel key regulators. We performed silencing experiment on RA cell line and provide the results in Data/silencing_experiments/silencing_data.xlsx

  • Run the script src/silencing_analysis.py to run the statistical test and combine the p-values with the Brown-Fisher method.

 

Reference

[3] Glass, Kimberly, et al. "Passing messages between biological networks to refine predicted interactions." PloS one 8.5 (2013): e64832.

[4] Kuijjer, Marieke Lydia, et al. "Estimating sample-specific regulatory networks." Iscience 14 (2019): 226-240.

[5] Shu, Le, et al. "Mergeomics: multidimensional data integration to identify pathogenic perturbations to biological systems." BMC genomics 17.1 (2016): 1-16.

About

Cell-type Specific Gene Networks and Drivers in Rheumatoid Arthritis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages