Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Experiment with downsampling one sample in one cell type #14

Closed
adamgayoso opened this issue Nov 22, 2022 · 2 comments
Closed

Experiment with downsampling one sample in one cell type #14

adamgayoso opened this issue Nov 22, 2022 · 2 comments
Assignees

Comments

@adamgayoso
Copy link
Member

as a function of downsampling:

  1. tree distance
  2. uncertainty
@PierreBoyeau
Copy link
Collaborator

I conducted the analysis, here are the setup and results

Setup: we consider the semi-synthetic dataset. In this dataset, clusters A and B have a known sample stratification, and there is no stratification for the rest of the cells. To mimick differences in composition, we conduct the following preprocessing step to generate the data:

  1. subcluster A (resp B) into two subclusters
  2. attribute a random binary value $c_s$ to each sample $s$, which is going to characterize one of the two subclusters
  3. In each sample $s$, discard q% of the cells belonging to subcluster $c_s$.

This aimed to mimick potentially strong differences in cell composition per sample.
The aim of the experiment is to (i). verify that the distance matrix is not affected by these compositional differences (ii). inspect the pertinence of the uncertainty estimates, see #11.

Results
Untitled
Untitled (1)

The two figures below respectively show the evolution of the RF distance of the estimated matrices and the median of the popensity scores (y-axes) against the value of q (x axis)

@adamgayoso
Copy link
Member Author

In each sample , discard q% of the cells belonging to subcluster .

Do you do this for all samples? Or just one sample at a time? It makes sense to me to do one sample as we are interested to see if that one sample becomes out of distribution

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants