# Cell Cycle Scoring

This Jupyter notebooks uses the Seurat single-cell RNA-seq analysis package to compute cell cycle scores for each cell in the Gasperini et al. 2019 dataset. This code is based off the Seurat cell-cycle scoring and regression vignette available at: https://satijalab.org/seurat/articles/cell_cycle_vignette.html

Author: Karthik Guruvayurappan

In [1]:
library('Seurat')
library('Matrix')

Attaching SeuratObject

Attaching sp



In [None]:
# load in UMI count (expression) matrix
expression.matrix <- readMM('/iblm/netapp/data1/external/Gasperini2019/suppl/GSE120861_at_scale_screen.exprs.mtx')
head(expression.matrix)

In [None]:
# convert expression matrix from matrix to data frame format
expression.matrix <- as.data.frame(expression.matrix)
head(expression.matrix)

In [None]:
# read in column names and add to expression matrix
cell.barcodes <- read.delim('/iblm/netapp/data1/external/Gasperini2019/suppl/GSE120861_at_scale_screen.cells.txt', header = FALSE)
cell.barcodes <- cell.barcodes$V1
colnames(expression.matrix) <- cell.barcodes
head(expression.matrix)

In [None]:
# read in row names and add to expression matrix
genes <- read.delim('/iblm/netapp/data1/external/Gasperini2019/suppl/GSE120861_at_scale_screen.genes.txt', header = FALSE)
genes <- genes$V1
rownames(expression.matrix) <- genes
head(expression.matrix)

In [None]:
gene.expression <- CreateSeuratObject(counts = expression.matrix)
gene.expression

In [None]:
gene.expression <- NormalizeData(gene.expression)
gene.expression

In [None]:
gene.expression <- FindVariableFeatures(gene.expression, selection.method  = "vst")
gene.expression

In [None]:
gene.expression <- ScaleData(gene.expression, features = rownames(gene.expression))
gene.expression

In [None]:
gene.expression <- RunPCA(gene.expression, features = VariableFeatures(gene.expression), ndims.print = 1:5, nfeatures.print = 10)
gene.expression