Re-write metagenome_contributions.py #1

gavinmdouglas · 2018-03-02T15:19:39Z

metagenome_contributions.py no longer in this repository - there could be better way to output the contributions that could leverage the likelihoods outputted in R by the discrete hidden-state prediction methods.

The text was updated successfully, but these errors were encountered:

gavinmdouglas · 2018-05-04T14:35:37Z

metagenome_pipeline.py outputs functions stratified by functional contributions by default now. per_sample_functions.py is still experimental script to use probability distributions rather than discrete predictions.

jjmmii · 2018-05-07T15:13:58Z

Hi @gavinmdouglas , I updated the local clone today and noticed that metagenome_pipeline.py is taking a long time to run (job has been running for 8 hours and still running). I guess it's because it's calculating stratified output? Is there a way to turn off this option? Thank you so much.

Best,
-Jamie

gavinmdouglas · 2018-05-07T20:02:20Z

Hey Jamie,

This is definitely a problem, thanks for pointing this out. I have re-written how the stratified data is output and it is much faster now. I haven't added an option yet for non-stratified output only.

Thanks,

Gavin

jjmmii · 2018-05-08T06:26:36Z

Thanks so much Gavin! It is blazing fast now, but I noticed there is much less number of lines in the pred_metagenome_unstrat.tsv compared to the OUT_PREFIX.genefamilies.biom.tsv in a previous version running with the same data (853 lines vs 3333 lines respectively). Also strangely, in pred_metagenome_strat.tsv, when I check which sequences are mapped to the EC's, only a few (9 out of 485 sequences) are used/output. For example:

$ cut -f2 pred_metagenome_strat.tsv | sort-uniq-count-rank
595     seq_16
549     seq_13
504     seq_11
455     seq_9
454     seq_4
454     seq_6
443     seq_2
442     seq_5
160     seq_7
1       sequence

Coincidentally these sequences are the very first ones in my data. Could this be a bug (i.e. not all output was written) or PICRUSt2 only mapped a few of my sequences to genes?

Best,
Jamie

jjmmii · 2018-06-05T02:11:44Z

Just to follow-up, the problem was gone as of the latest clone of PICRUSt2 yesterday.

gavinmdouglas closed this as completed May 4, 2018

dcm9123 mentioned this issue Dec 7, 2023

Customize database guidance #335

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-write metagenome_contributions.py #1

Re-write metagenome_contributions.py #1

gavinmdouglas commented Mar 2, 2018

gavinmdouglas commented May 4, 2018

jjmmii commented May 7, 2018

gavinmdouglas commented May 7, 2018

jjmmii commented May 8, 2018 •

edited

Loading

jjmmii commented Jun 5, 2018 •

edited

Loading

Re-write metagenome_contributions.py #1

Re-write metagenome_contributions.py #1

Comments

gavinmdouglas commented Mar 2, 2018

gavinmdouglas commented May 4, 2018

jjmmii commented May 7, 2018

gavinmdouglas commented May 7, 2018

jjmmii commented May 8, 2018 • edited Loading

jjmmii commented Jun 5, 2018 • edited Loading

jjmmii commented May 8, 2018 •

edited

Loading

jjmmii commented Jun 5, 2018 •

edited

Loading