A total of 55 tumor scRNA-seq datasets, corresponding to 563 different patients and covering over two million cells in 44 tumor types, were incorporated for constructing these networks. We have provided all the main codes and parameters for constructing the cell-type-specific interactome networks. Moreover, detailed elucidations have also been provided regarding the execution of statistical calculations within each CellNet function module. CellNetdb is publicly and freely available at http://www.bioailab.com:3838/CellNetdb to all users without any login or registration restrictions. The main processes of network function analysis were described below.
We utilized the workflow of SCINET implemented in ACTIONet along with four widely-used reference interactome networks, namely STRING, HumanNet, ConsensusPathDB, and Reactome, to construct cell-type-specific interactome networks for each cell type.
The data in Mutation module of each solid tumor type was collected from COSMIC. Then, users could browse or search somatic mutations in gene which are involved in the queried subnetwork.
To facilitate users in acquiring functional insights into the network, we conducted enrichment analysis on Gene Ontology (GO) and disease-associated gene sets. The GO gene sets were obtained from the Gene Ontology database (release 2022-06-15), while the disease-associated gene sets were sourced from the DisGeNET database (v7.0). The statistical significance of the enrichment of genes in GO terms or disease-associated gene sets within each queried local network was determined using the hypergeometric test.
In the Survival module, we gathered clinical data from several large-scale cohorts, including TCGA, MMRF and TARGET. Then, users could browse or search genes involved in the queried subnetwork whose expression level are associated with patients' overall survival.
In the Communication module, users could browse or search ligand-receptor pairs involved in the queried subnetwork. We only showed the significant interactions (P < 0.05). The Communication score in web table was the overall strength of cell-cell communication.
We have implemented the random walk with restart (RWR) algorithm to prioritize interested genes based on the cell-type-specific interactome networks. Specifically, the random walk with restart is mathematically defined as follows:
We evaluated the performance of various malignant cell networks generated by four reference networks in terms of their capacity to recover DisGeNET disease genes associated with specific tumor type. In this part, we utilized two network performance metrics, which were previously defined by Huang et al., as follows:
The area under the precision-recall curve (
To assess the functional application of cell-type-specific networks in understanding the context-specific role of genes, we utilized a metric called topological specificity (
Transcriptional specificity of genes pertains to their degree of specificity in expression within a particular cell type. To determine this, we employed the gene expression profile to calculate the average expression of various genes in a given cell type
We ranked the top 500 prognostic genes for each tumor type based on their statistical significance using clinical data from the TCGA project. The normalized within-group connectivity of each cancer prognostic signature in all cell-type-specific networks was calculated. Furthermore,
We employed two distinct metrics, namely shared-edge similarity and topology similarity, to assess the degree of similarity between networks. Initially, shared nodes were identified between any pair of networks. To quantify the shared-edge similarity, the edges connecting these nodes were extracted from both networks, resulting in the creation of subgraphs for each network. The shared-edge similarity was subsequently determined by calculating the Spearman correlation coefficient between the weights assigned to the shared edges in the respective subgraphs of both networks. To evaluate the topology similarity, the Spearman correlation coefficient was computed for the transformed topological specificity (
To ensure the reproducibility and usability of our work, we have included a comprehensive code notebook (refer to CellNetdb_notebook.ipynb), which provides guidance for users to replicate all steps on a small dataset.
Zekun Li^, Gerui Liu^, Xiaoxiao Yang^, Meng Shu, Wen Jin, Yang Tong, Xiaochuan Liu, Yuting Wang, Jiapei Yuan*,Yang Yang* (2023) An atlas of cell-type-specific interactome networks across 44 human tumor types. Genome Medicine 16(1):30