Skip to content

This repository contains the code for the creation of gene regulatory networks presented in the publication "Widening the landscape of transcriptional regulation of green algal photoprotection"

License

Notifications You must be signed in to change notification settings

arendma/GRN_code

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Code to generate a Gene regulatory network for Chlamydomonas reinhardtii

The code included in the ../Program subfolder recreates the gene regulatory network based on the read count data from

Steps

1. Setup

Install R according to the installation guidelines and then source ../Program/setup.r

Before running the code make sure to have set

options(stringsAsFactors = FALSE)

2. Aggregating and processing read counts - some info graphics (tested on R.3.6.1)

The script 2-CountProcessing.R reads the raw read count data from ..\Data and aggregates it into one large matrix with genes as rows and samples as columns. Next the normalization steps of the edgeR wirkflow using TMM and voom normalization are applied to this matrix to derive at estimates for the log2 count values. In the second part of the script PCA plots and a heatmap of correlation values are generated to give a first overview of the data.

3. network from elastic net regression (tested on R4.2.2)

The script 3-Netreg_elnet_TF_G_me1519am17v2.R zscore transformes the log values gene wise and aggregates replicates to median values if applicable. Then it uses a implementation fo the elastic net algorithm to generate a gene regulatory network (Runtime on Core-i7 16GB RAM was 5 days)

4. network from other GRN inference methods (tested on R4.2.2)

The script 4-Netreg_bignet_TF_G_me1519am17v2 zscore transformes the log values gene wise and aggregates replicates to median values if applicable. It then generates GRNs inferred using the approaches

During later checks it turned out that ARACNE and SILENCING did not produce results of usabel quality. The script then aggregates the obtaine GRNs together with the output from 3-Netreg_elnet_TF_G_me1519am17v2.Rand saves them into one R object. (Runtime on Core-i7 16GB RAM was 12h)

5. Aggregating network predictions into consensus (tested on R.3.6.1)

The script 5-Gen_consensus.R uses the previously generated GRNs and aggregates them into a consensus network considering a maximum of 10% of all possible edges per approach. One consensus of all approaches is generated, and a second consensus omitting GRNS of ARACNE and Silencing is generated. The final consensus networks are exported as edge lists in a tab seperated text file: The first column is the node out node, second column the in node after that a column with the rank of the edge in each integrated approach is added and a final column that gives the mean of all ranks. (Ranks of edges not included in a certain approach are imputed as 10% of all possible edges +1). Similar to the process described in Marbach et al.

About

This repository contains the code for the creation of gene regulatory networks presented in the publication "Widening the landscape of transcriptional regulation of green algal photoprotection"

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Languages