ConDecon
is a clustering-independent deconvolution method for estimating cell abundances in bulk tissues using single-cell RNA-seq data. The aim of ConDecon is to infer a probability distribution across a reference single-cell RNA-seq dataset that represents the likelihood for each cell in the reference data to be present in the query bulk tissue. To that end, ConDecon requires three inputs:
- Single-cell gene expression count data
- Single-cell latent space
- Normalized bulk data
With this information, ConDecon learns a relationship that explains the similarity between the gene expression profile of bulk and single-cell data as a function of changes in cell abundances, without relying on cluster labels or cell-type specific gene expression signatures at any step. ConDecon enables previously elusive analyses of dynamic cellular processes in bulk tissues and represents an increase in functionality and phenotypic resolution with respect to current methods for gene expression deconvolution. Additionally, ConDecon can be applied to other omics data modalities including spatial transcriptomics and chromatin accessibility data. Overall, we anticipate that these features will improve our understanding of tissue cell composition by facilitating the inference of cell state abundances within complex bulk tissues, particularly in the context of evolving systems like development and disease progression.
Aubin, R. G., Montelongo, J., Hu, R., Gunther, E., Nicodemus, P., Camara, P. G. Clustering-independent estimation of cell abundances in bulk tissue using single-cell RNA-seq data. Biorxiv (2023).
devtools::install_github("CamaraLab/ConDecon")
library(ConDecon)
A complete guide of ConDecon's tutorials and API is available here.
- Tutorial applying ConDecon to simulated transcriptomic data: Quick set up and example
- Tutorial applying ConDecon to B-cells from scRNA-seq data: B-cells example
- Tutorial applying ConDecon spatial transcriptomic data: Spatial RNA example
- Tutorial applying ConDecon to chromatin accessibility data: ATAC example
A list of the main user functions in the ConDecon package:
- RunConDecon
- PlotConDecon
- TransferFeatures
- BuildTrainingSet
- Map2Latent
- BuildModel
- PredictCellProb
- CalcRelativeCellProb
- ComputeProximityScore
A list of example data included in the ConDecon package:
- counts_gps: Simulated single-cell RNA-seq counts data
- latent_gps: PCA representation of 'counts_gps'
- variable_genes_gps: Variable genes associated with 'counts_gps'
- meta_data_gps: Meta data associated with 'counts_gps'
- bulk_gps: Simulated bulk RNA-seq data
- true_prop_gps: True cell type proportion of simulated bulk RNA-seq data