Subsetting Dataset #40

sylvia-science · 2023-10-20T10:03:22Z

Hello,

If I'm particularly interested in a subset of cells in my dataset, is it valid to run SAVER on just that subset?

Furthermore, I'm working on a dataset comprised from many different scRNA sources where I suspect batch effects may be relevant. Would it be a good idea to split my SAVER runs to be on each source instead of running on data combined from all sources?

Thank you!

mohuangx · 2023-10-23T02:06:00Z

Hi,

I would recommend running SAVER on the entire dataset so that the prediction is performed on more cells and then look at the subset of the SAVER output but if computation is an issue, it's perfectly fine to run SAVER on the subset of cells too.

I agree that it's probably better to split the SAVER runs and then possibly batch correct. This way, SAVER won't be picking up on differences between sources in performing the prediction.

sylvia-science · 2023-10-23T16:05:28Z

Thank you for the fast response!

sylvia-science closed this as completed Oct 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Subsetting Dataset #40

Subsetting Dataset #40

sylvia-science commented Oct 20, 2023

mohuangx commented Oct 23, 2023

sylvia-science commented Oct 23, 2023

Subsetting Dataset #40

Subsetting Dataset #40

Comments

sylvia-science commented Oct 20, 2023

mohuangx commented Oct 23, 2023

sylvia-science commented Oct 23, 2023