Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about dealing with snATAC-seq data with multipel samples. #205

Closed
dmsalsgh97 opened this issue Apr 25, 2023 · 2 comments
Closed
Labels
question Further information is requested

Comments

@dmsalsgh97
Copy link

dmsalsgh97 commented Apr 25, 2023

Hi,
Thanks for developing this wonderful tool!

I wanna ask about dealing with snATAC_seq data with multiple samples.
I'm trying to follow the workflow in this. #137

I have a conditional snATAC-seq with 10 samples. (6 normal, 4 dis), and I've clustered cells using 3rd party program (ArchR).
Now, If I want to do TOBIAS analysis in a condition-specific manner, should I make separate .bam files per sample and merge them?

For example, If I have sample1, sample2 and cell-type A, B,
Then make an input .bam file by merging sample1_celltypeA.bam and sample2_celltypeA.bam?

Thanks!
Minho

@dmsalsgh97 dmsalsgh97 changed the title Questions about dealing with snATAC-seq data. Questions about dealing with snATAC-seq data with multipel samples. Apr 25, 2023
@msbentsen
Copy link
Member

Hi @dmsalsgh97

Thank you for your question - yes, I would do exactly like you suggest, and make:

sample1_celltypeA.bam + sample2_celltypeA.bam -> merged_celltypeA.bam
sample1_celltypeB.bam + sample2_celltypeB.bam -> merged_celltypeB.bam
(etc)

And use merged_celltypeA.bam and merged_celltypeB.bam as input for TOBIAS.

If the cells in each cluster have roughly the same number of reads, it should be fine to merge it. If the clusters are small (<100 cells) and one cell has a lot of reads compared to all other cells (this might be part of your QC in ArchR), you just have to keep in mind that this cell might dominate the TOBIAS analysis. However, if you have enough cells per cluster, this effect should be negligible.

I can't say what the minimum number of cells needed is, so that requires a little bit trial-and-error. I hope it works out!

@msbentsen msbentsen added the question Further information is requested label Apr 28, 2023
@dmsalsgh97
Copy link
Author

Thanks for your comments!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants