Overview BayesImpute is a statistical model that first identifies likely dropout events through the expression rate and coefficient of variation of gene in cell subpopulation, then imputes these values by constructing posterior distribution for each gene, and treats posterior mean as recovered expression value. As shown in Figure, the workflow of BayesImpute algorithm consists of four steps. The first step is preprocessing, which includes gene filtering, expression normalization and logarithmization. The second step is dimensionality reduction and preliminary clustering, which selects similar cells for subsequent identification and imputation step. During the identification step, BayesImpute distinguishes between dropout events and true zeros. Doing so reduces unwanted biases to the rest data and improves imputation accuracy. Finally, BayesImpute assumes that the actual gene expression values follow Gaussian distribution. It then follows Bayes rule to output the posterior distribution of true expression, and the posterior mean is used as recovered dropout values.
-
Notifications
You must be signed in to change notification settings - Fork 0
CSQxiaogongju/BayesImpute
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
BayesImpute: a Bayesian imputation method for single-cell RNA-seq data
Resources
Stars
Watchers
Forks
Releases
No releases published