-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Interpretation of reported abundance table #10
Comments
The first table represents the raw abundances based on the read mapping. The second table shows the estimated abundance on a specific taxonomic rank. This works as follows (will be described in our upcoming paper about food ingredient detection): For each taxon in the dataset we count the number of reads assigned to it. Taxa on lower levels than the requested taxonomic rank are pruned and their read counts are added to their respective parents, while reads from taxa on higher levels are distributed among their children in proportion to the weights of the sub-trees rooted at each child. After the redistribution the estimated number of reads and abundance percentages are returned as outputs. I will also add more detailed explanation to the Markdown documentation of the output options. |
So the best prediction would be the second table. |
Thanks for the quick response. Very helpful. |
Hi, could I ask in this thread again about 1 detail in the way of redistribution reads. How do you define the weight of each sub-tree in the taxonomic tree? Thanks |
The weight is the number of reads mapped to taxa in the sub-tree. |
Thank you for your prompt response! |
Hi. I'm running MetaCache query with the
-abundaces profile.tsv
and-abundance-per species
flags. It appears this writes two profiling results toprofile.tsv
: the full taxon profile and a species profile. However, these profiles do not appear to agree. For example, the taxon profile reports Streptococcus dysgalactiae subsp. equisimilis at 1.15% and no other results for S. dysgalactiae. The species profile reports S. dysgalactiae at 1.93%. Why is there a discrepancy?Relevant lines from
profile.tsv
:Is the best prediction of the abundance of S. dysgalactiae by MetaCache 1.15% or 1.93%?
Thanks,
Donovan
The text was updated successfully, but these errors were encountered: