Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More information - How to calculate meanFragmentLength #1

Closed
sagarutturkar opened this issue Mar 29, 2019 · 9 comments
Closed

More information - How to calculate meanFragmentLength #1

sagarutturkar opened this issue Mar 29, 2019 · 9 comments

Comments

@sagarutturkar
Copy link

Hello,

Thank you for providing this as a package and will help to streamline the calculations across users. I have seen some students performing wrong calculation.

  1. Can you please include the example data files (counts, annotations, sample_metric)?

  2. Picard CollectInsertSizeMetrics has columns:

MEDIAN_INSERT_SIZE
MODE_INSERT_SIZE
MEDIAN_ABSOLUTE_DEVIATION
MIN_INSERT_SIZE
MAX_INSERT_SIZE
MEAN_INSERT_SIZE
....and so on

For the meanFragmentLength are you referring to MEAN_INSERT_SIZE column? Just need to confirm since there could be different interpretations for fragment/insert.

Thank you.

@AAlhendi1707
Copy link
Owner

AAlhendi1707 commented Mar 30, 2019

Hi there,

I have added an example RNA-seq dataset to the package that can be used to validate your work. See
README.md for the usage example.

The fragment length is determined by your sequencing protocol. It’s more or less the distance between the two paired reads (including the read lengths, minus any overlap). For a typical long RNA-seq library prep it’s often in the order of 300–600 bp. You can run Picard’s CollectInsertSizeMetrics tool on your aligned BAM files to find the mean fragment length.

Also read this paper for more info about the fragment lenght calculation.

@sagarutturkar
Copy link
Author

Thank you! This is helpful!!

@AAlhendi1707 AAlhendi1707 pinned this issue May 4, 2019
@AAlhendi1707 AAlhendi1707 changed the title More information How to calculate meanFragmentLength Jan 4, 2020
@AAlhendi1707 AAlhendi1707 changed the title How to calculate meanFragmentLength More information - How to calculate meanFragmentLength Jan 4, 2020
@JonDo352
Copy link

JonDo352 commented May 27, 2020

Hi AAlhendi1707,

I have searched this problem over and over, and I keep getting routed back to this page. How does one convert from MEAN_INSERT_SIZE to meanFragmentLength? I have seen your posted example, but I am still having trouble understanding this. Any help would be appreciated.

Thank you.

@Counts-Xin
Copy link

Thank you! This is helpful!!

so,that means mean INSERT_SIZE plus 2*adapter?

@sherry308
Copy link

Can you please explain in detail how to get fragments size from Piccard. If it it insert size + adapter length, Where I will get information related rot adapter length?

@Sanataniitb
Copy link

The [CollectInsertSizeMetrics(Picard)] tool requires input BAM/SAM file for finding out the mean fragment length. Where as I have only the count matrix file. Is there any way to find the mean fragment length if I only have the count matrix file.

@Nairobi-2020
Copy link

https://www.biostars.org/p/83901/
for whole transcriptome, this thread is helpful.

@Nairobi-2020
Copy link

Nairobi-2020 commented Oct 7, 2022

I do not think we need to consider insertion size, etc. It evens out in large number.
Just the panel that selects regions, and how long of the panel is on the gene.

@camilla0801
Copy link

Hi, I don't understand the output of Picard. How do I get meanFragmentLengthfrom MEAN_INSERT_SIZE? in my case it's a number and it does not make any sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants