New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
compare with needletail and kseq.h #2
Comments
dear Jianshu, very interesting, thanks much for the pointer! I was under the impression that the seq_io benchmark was exhaustive in terms of rust implementations, but it's not. That said, |
Got around to doing some benchmarks: natir/in_place_fastx#1 (comment) |
it seems very promising. also the in_place one. I will also do an experiment on that one. I suggest using metagenomes,like an environmental sample,can be 100G+ and very diverse. Thanks, Jianshu |
I tried your in place fastx commit (main.rs) with the error (inside the in place fastx clone from you fork): cargo run --release -- ~/Github/test_data/T4AerOil_sbsmpl5.fa I can confirm that it is fasta file: Thanks, Jianshu |
Hello Rayan,
It would be interesting to compare with biofast: https://github.com/lh3/biofast
It seems for the test dataset: M_abscessus_HiSeq.fq, needtail is still faster than even paralleled version of seq_io. This repo takes about 3 seconds with 12 threads while the needtail takes 0.8 seconds for plain fastq( M_abscessus_HiSeq.fq) file.
Thanks,
Jianshu
The text was updated successfully, but these errors were encountered: