Biomarker prioritisation and power estimation using ensemble gene regulatory network inference

F. Aziz, A. Acharjee, J.A. Williams, D. Russ, L. Bravo-Merodio and G.V. Gkoutos.

This repository contains the code to implement the above paper.

This code provides a modified implementation of the following:

MIDER: Network Inference with Mutual Information Distance and Entropy Reduction (https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0096732f)
Gene regulatory network inference using PLS-based methods (PLSNET) (https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1398-6)
A probabilistic graphical model for system-wide analysis of gene regulatory networks (https://academic.oup.com/bioinformatics/article-abstract/36/10/3192/5756207?redirectedFrom=fulltext)

Note that this folder does not conatin the full implementation of the codes. Full implmentation of MIDER and PLSNET are publicly avaliable and can be downloaded from the respctive links above.

Our biomarker identification strategy is based on the paper “-Omics biomarker identification pipeline for translational medicine” (https://translational-medicine.biomedcentral.com/articles/10.1186/s12967-019-1912-5). A github library for this work can be accessed at https://github.com/jaw-bioinf/Biomarker_Identification.

Code

To execute the code run main.m

Basic requirements

The code is implemented using MATLAB R2020b. The customized implementations of MIDER and PLSNET are already provided in the code, so those packages do not need to be downloaded.

Input

The code prompts for the following input:

Input file: If you want to test one of the datasets in the paper, then type a number from 1 to 5. Otherwise, if you want to test your own data type 6. In the later case, your data file must be stored in the data directory as a CSV file with the name data.csv. The CSV file should contian the gene expression data with gene names as header.
Method: The second input corresponds to the algorithm that you want to execute. If you want to execute plsnet once, enter 1. If you wnat to execute plsnet multiple times and compute the frequencies enter 2. If you want to execute MIDER enter 3. If you want to execute MIDER and also verify the GRN using LBP, press 4.
Fraction : Enter the fraction of edges that you want to be considered in the final GRN. PLSNET reurns all the edges, while MIDER implicitly computes a threshold and reutrns only a subset of edges. For MIDER, the value of the threhold further discards the edges.
For running plsnet multiple times, we need additional parameter, which is the number of times you want to execute PLSNET.

Output

The following output files are stored in the output folder:

The file Genes.txt stores the list of the genes
The file GRN.txt is the adjacency matrix of the graph regulatory network (For method 1,2,4 and 5 only)
The file frequencies.txt contains the frequencies of genes (For method 2 only)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
MIDERv2		MIDERv2
PLSNET		PLSNET
PgmGRNs		PgmGRNs
data		data
output		output
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md
main.m		main.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Biomarker prioritisation and power estimation using ensemble gene regulatory network inference

F. Aziz, A. Acharjee, J.A. Williams, D. Russ, L. Bravo-Merodio and G.V. Gkoutos.

Code

Basic requirements

Input

Output

About

Releases

Packages

Languages

azizfurqan/PGM

Folders and files

Latest commit

History

Repository files navigation

Biomarker prioritisation and power estimation using ensemble gene regulatory network inference

F. Aziz, A. Acharjee, J.A. Williams, D. Russ, L. Bravo-Merodio and G.V. Gkoutos.

Code

Basic requirements

Input

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages