Skip to content

Identify unique combinations of taxids before running NCBI topology#4

Merged
maxibor merged 3 commits intomaxibor:masterfrom
alexhbnr:uniq_ncbitopology
Aug 20, 2021
Merged

Identify unique combinations of taxids before running NCBI topology#4
maxibor merged 3 commits intomaxibor:masterfrom
alexhbnr:uniq_ncbitopology

Conversation

@alexhbnr
Copy link
Contributor

@alexhbnr alexhbnr commented Aug 2, 2021

Reduces the run time of sam2lca by identifying all unique combinations of taxids that were observed across all reads and by running the expensive NCBI topology function of ete3 on these. For samples that have a lot of aligned reads but only a few combinations of taxids, this will decrease the number of lookups strongly.

alexhbnr and others added 3 commits July 30, 2021 11:33
Reduces the run time of sam2lca by identifying all unique combinations
of taxids that were observed across all reads and by running the
expensive NCBI topology function of ete3 on these. For samples that have
a lot of aligned reads but only a few combinations of taxids, this will
decrease the number of lookups strongly.
@maxibor maxibor merged commit 898ae8d into maxibor:master Aug 20, 2021
@alexhbnr alexhbnr deleted the uniq_ncbitopology branch October 13, 2021 19:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants