scalablePCA

This analysis wants to benchmark 7 differt PCA's methods. This repository contains code for reproducing the benchmark of the PCA's methods.
The dataset is available on https://support.10xgenomics.com/single-cell-gene-expression/datasets/1.3.0/1M_neurons.

In the "scanpy10x.py" script we normalize with total UMI count per cell, we filter genes with more than 1 count and select highly-variable genes, we log-tranform the data and then scale to unit variance and shift to zero mean. Finally we save the preprocessed object using "adata.write()"

Next, in the file "time_7_metodi/time_subset.R" we create downsample sizes of datasets (sizes 100k,500k, 1M) from the preprocessed object described above.

We use seven different methods to compute PCA:

BiocSingular_Random
BiocSingular_Irlba
BiocSingular_Exact
Scanpy_in_R
Scanpy_in_Python
BiocSklearn_in_R
BiocSklearn_in_Python

In the folder "time_7_metodi" you can find the script to reproduce e

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
davide		davide
mem_7_metodi		mem_7_metodi
time_3_metodi		time_3_metodi
time_7_metodi		time_7_metodi
.gitignore		.gitignore
README.md		README.md
scalablePCA.Rproj		scalablePCA.Rproj
sce2adataPCA.py		sce2adataPCA.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scalablePCA

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

scalablePCA

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages