-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lineage assignment of 'none' when processing many samples #85
Comments
Can you tell me the version of ncov-tools and ncov-parser being used?
… On Jun 11, 2021, at 5:55 PM, Madison Chapel ***@***.***> wrote:
Hello, I've been having a problem lately where the run_name_summary_qc.tsv file is failing to populate with lineage assignments. The lineage, lineage_notes, and scorpio_call columns all contain values of 'none' for all samples.
If I run fewer samples, the output seems fine. If there are ~100 or more samples being analyzed, the columns show the 'none' value. The lineage assignments in the lineage/lineage_report.csv seem to be fine regardless of how many samples I'm running ncov-tools on.
Any help with this would be appreciated! Currently I'm just doing a bit of a workaround by stealing the lineages from the lineage_report.csv and sticking them into the run_name_summary_qc.tsv file before I generate the pdf report, but I'd definitely prefer a cleaner fix.
Thanks!
Madison
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub <#85>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAV7CQLQ7IYBNO2QKWUY3TTSKA3VANCNFSM46RRK4MQ>.
|
Yep! ncov-tools is 1.7.1, and ncov-parser should be 0.6.7. We re-made the environment at 1:00pm today with mamba straight from the environment.yml file |
I just re-created my environment and made sure to match yours with the latest version. I ran it on 158 samples and all three columns (i.e. lineage, lineage_notes, scorpio_call) correctly showed the values in the lineage_report.csv file. I ran it with usher and pangolearn as the pangolin inference engine and both populated the fields in the _summary_qc.tsv file correctly.
Were there any errors in the snakemake log file by chance or did everything complete successfully?
… On Jun 11, 2021, at 6:25 PM, Madison Chapel ***@***.***> wrote:
Yep! ncov-tools is 1.7.1, and ncov-parser should be 0.6.7. We re-made the environment at 1:00pm today with mamba straight from the environment.yml file
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub <#85 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAV7CTLCUTZA767MCD7433TSKENBANCNFSM46RRK4MQ>.
|
@DarianHole just to confirm, the |
Correct! Sorry that was important info I should have included! Lineage report from pangolin is as expected, the |
@DarianHole @Mach-2 which platform (i.e. Illumina, Oxford-nanopore) were you processing when this issue occurred? |
I've seen it with both nanopore and the freebayes Illumina files as input. I could quickly check if the ivar Illumina files lead to a similar output though as I am sure I have a number I could quickly find. Exact input for the nanopore:
Command is just the all command for me:
|
@DarianHole I was able to reproduce the issue with a Nanopore run. The problem is with |
Thanks for the quick work Richard! It seems like it may not have been a sample number issue but instead a certain type of sample whose output that was messing with the final output (so more samples makes it more likely). I can test it right now and report back pretty quick as well |
Yep @rdeborja I can confirm that the changes to the |
@DarianHole great, thanks for the quick turn around. I've merged and created a new release (ncov-parser 0.6.8) on pypi. @Mach-2 can you this resolved your issue as well? |
Thanks for the quick solution Richard (and thanks for helping test things, Darian)! Darian and I were describing the same issue, so I think it's fair to assume that if the fix worked for his test data set then things are good to go now. |
Hello, I've been having a problem lately where the run_name_summary_qc.tsv file is failing to populate with lineage assignments. The
lineage
,lineage_notes
, andscorpio_call
columns all contain values of 'none' for all samples.If I run fewer samples, the output seems fine. If there are ~100 or more samples being analyzed, the columns show the 'none' value. The lineage assignments in the lineage/lineage_report.csv seem to be fine regardless of how many samples I'm running ncov-tools on.
Any help with this would be appreciated! Currently I'm just doing a bit of a workaround by stealing the lineages from the lineage_report.csv and sticking them into the run_name_summary_qc.tsv file before I generate the pdf report, but I'd definitely prefer a cleaner fix.
Thanks!
Madison
The text was updated successfully, but these errors were encountered: