New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Decide on minimum read count for exon expression #88
Comments
20x |
Is this per exon or mean across all exons per gene ? |
I think we want to keep exon with low / no expression. .. Lets set a threshold as sum of read counts for first 6 exons (as this is what we are looking at) to be 1000. |
Okay, when I do this (sum read coverage across first 6 exons per gene), I end up with only 2,497 genes having a sum of >= 1000.
|
Can check out code here: ceabigr/code/65-exon-coverage.qmd Lines 357 to 375 in a6a5f24
|
what would the gene count be if reduced sum to 100. |
|
Greater than 500? |
|
lets go forward with > 100 |
Alrighty, we may need to make further adjustments. Those numbers above were just from a single sample that I was using for code testing. I've managed to write code to look at all the files and do the threshold filtering for all samples on a per gene basis. I.e. All samples must have an exon coverage sum threshold of
|
Relevant code section: ceabigr/code/65-exon-coverage.qmd Lines 437 to 486 in 6b4bf89
|
lets go with threshold of 10 |
No description provided.
The text was updated successfully, but these errors were encountered: