Experiment with downsampling one sample in one cell type #14

adamgayoso · 2022-11-22T00:19:29Z

as a function of downsampling:

tree distance
uncertainty

PierreBoyeau · 2022-12-01T15:53:11Z

I conducted the analysis, here are the setup and results

Setup: we consider the semi-synthetic dataset. In this dataset, clusters A and B have a known sample stratification, and there is no stratification for the rest of the cells. To mimick differences in composition, we conduct the following preprocessing step to generate the data:

subcluster A (resp B) into two subclusters
attribute a random binary value $c_s$ to each sample $s$, which is going to characterize one of the two subclusters
In each sample $s$, discard q% of the cells belonging to subcluster $c_s$.

This aimed to mimick potentially strong differences in cell composition per sample.
The aim of the experiment is to (i). verify that the distance matrix is not affected by these compositional differences (ii). inspect the pertinence of the uncertainty estimates, see #11.

Results

The two figures below respectively show the evolution of the RF distance of the estimated matrices and the median of the popensity scores (y-axes) against the value of q (x axis)

adamgayoso · 2022-12-01T17:01:03Z

In each sample , discard q% of the cells belonging to subcluster .

Do you do this for all samples? Or just one sample at a time? It makes sense to me to do one sample as we are interested to see if that one sample becomes out of distribution

adamgayoso assigned PierreBoyeau Nov 22, 2022

adamgayoso added the experiments label Nov 22, 2022

justjhong closed this as completed Feb 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment with downsampling one sample in one cell type #14

Experiment with downsampling one sample in one cell type #14

adamgayoso commented Nov 22, 2022

PierreBoyeau commented Dec 1, 2022

adamgayoso commented Dec 1, 2022

Experiment with downsampling one sample in one cell type #14

Experiment with downsampling one sample in one cell type #14

Comments

adamgayoso commented Nov 22, 2022

PierreBoyeau commented Dec 1, 2022

adamgayoso commented Dec 1, 2022