report abundance in summary when total number of reads is included in the merged profile #853

ShaiberAlon · 2018-06-08T19:53:16Z

When the total of reads is reported in the the misc data, then we can use it to estimate relative abundance of bins when creating the summary.

Basically there should be a function to use the following information:
size of each bin
number of reads mapped to each bin per sample
total number of reads per sample

And computes some kind of relative abundance estimation for each bin in each sample.

brymerr921 · 2018-06-08T20:13:03Z

+1 !!!

ShaiberAlon · 2018-06-15T22:05:51Z

@meren, I'll be offline until next Thursday (6/21), and according to Evan, you plan to finish v5 by 6/24. This is my only v5 related open issue, but I wonder if someone else wants to take a stab at adding this step to the summary.

meren · 2018-06-16T03:05:52Z

I will take this over.

ShaiberAlon · 2018-06-16T04:22:02Z

Thank you!

meren · 2018-06-23T00:23:38Z

With the commit above the summary reports all data stored in misc additional data tables. The new section in the summary output looks like this:

ShaiberAlon · 2018-07-27T19:50:32Z

Re-opening this.

We should come up with a normalization that takes genomic length and read length into consideration when computing relative abundance estimation from percent reads mapped information.

We can look to see if this review is relevant: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5080976/

To recover read length, we will randomly select X(=1000?) reads and compute the mean length. This will be done in anvi-profile.

ShaiberAlon added the feature request label Jun 8, 2018

ShaiberAlon assigned meren, ozcan, ekiefl and ShaiberAlon Jun 8, 2018

meren added the priority label Jun 8, 2018

meren added this to the v5 milestone Jun 8, 2018

meren unassigned ozcan, ekiefl and ShaiberAlon Jun 16, 2018

meren mentioned this issue Jun 22, 2018

Summary reporting of misc data #884

Merged

meren closed this as completed in #884 Jun 22, 2018

ShaiberAlon reopened this Jul 27, 2018

ShaiberAlon self-assigned this Jul 27, 2018

ShaiberAlon removed this from the v5 milestone Jul 27, 2018

meren removed their assignment Apr 20, 2019

meren unassigned ShaiberAlon Oct 3, 2019

meren added time-out / fade-out and removed priority labels Oct 3, 2019

meren closed this as completed Oct 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

report abundance in summary when total number of reads is included in the merged profile #853

report abundance in summary when total number of reads is included in the merged profile #853

ShaiberAlon commented Jun 8, 2018

brymerr921 commented Jun 8, 2018

ShaiberAlon commented Jun 15, 2018

meren commented Jun 16, 2018

ShaiberAlon commented Jun 16, 2018

meren commented Jun 23, 2018

ShaiberAlon commented Jul 27, 2018

report abundance in summary when total number of reads is included in the merged profile #853

report abundance in summary when total number of reads is included in the merged profile #853

Comments

ShaiberAlon commented Jun 8, 2018

brymerr921 commented Jun 8, 2018

ShaiberAlon commented Jun 15, 2018

meren commented Jun 16, 2018

ShaiberAlon commented Jun 16, 2018

meren commented Jun 23, 2018

ShaiberAlon commented Jul 27, 2018