subset(downsample= X) #3033

chrismahony · 2020-05-19T07:57:26Z

Can you tell me, when I use the downsample function, how does seurat exclude or choose cells?

Thanks
Chris

yuhanH · 2020-05-22T15:38:33Z

Hi,
You can set invert = TRUE, then it will exclude input cells.
For example

select.cell <- WhichCells(pbmc_small, idents = 0)
pbmc_small <- subset(x = pbmc_small, cells = select.cell, invert = TRUE)

chrismahony · 2020-06-30T12:55:08Z

Thanks for this, but I really want to understand more how the downsample function actualy works. If I have an input of 2000 cells and downsample to 500, how are te 1500 cells excluded? What pareameters are excluding these cells? Thanks

yuhanH · 2020-06-30T14:47:54Z

downsample is an input parameter from WhichCells

Maximum number of cells per identity class, default is Inf; downsampling will happen after all other operations, including inverting the cell selection

bug1303 · 2021-06-11T08:53:56Z

It's a closed issue, but I stumbled across the same question as well, and went on to find the answer.

You can see the code that is actually called as such: SeuratObject:::subset.Seurat, which in turn calls SeuratObject:::WhichCells.Seurat (as @yuhanH mentioned).

It first does all the selection and potential inversion of cells, and then this is the bit concerning downsampling:

    cells <- CellsByIdentities(object = object, cells = cells)
    cells <- lapply(X = cells, FUN = function(x) {
        if (length(x = x) > downsample) {
            x <- sample(x = x, size = downsample, replace = FALSE)
        }
        return(x)
    })

So indeed, it groups it into the identity classes (e.g. clusters or whichever idents are chosen), and then for each of those groups calls sample if it contains more than the requested number of cells. So, it's just a random selection.

yuhanH closed this as completed May 22, 2020

evanbiederstedt mentioned this issue Dec 23, 2021

Downsample from each cluster kharchenkolab/conos#115

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

subset(downsample= X) #3033

subset(downsample= X) #3033

chrismahony commented May 19, 2020

yuhanH commented May 22, 2020

chrismahony commented Jun 30, 2020

yuhanH commented Jun 30, 2020

bug1303 commented Jun 11, 2021

subset(downsample= X) #3033

subset(downsample= X) #3033

Comments

chrismahony commented May 19, 2020

yuhanH commented May 22, 2020

chrismahony commented Jun 30, 2020

yuhanH commented Jun 30, 2020

bug1303 commented Jun 11, 2021