Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
The 2 main changes here are changing the const sites list to a set to improve lookup time and pre-computing the positions at each variable site in a more efficient manner. For my test VCF (100k rows, 100 samples) `write_vcf` went from 95 seconds to 0.7 seconds. For the full 800k rows VCF, `write_vcf` went from ~8 hours to 5.6 seconds (5000x speedup). Times & profiling using cProfile CLoses nextstrain/augur#1378
- Loading branch information
1 parent
1aae85e
commit 8322d54
Showing
1 changed file
with
23 additions
and
28 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters