-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Method A doesn't give expected results #2
Comments
Hi Jubin It seems that there is a bug in the code. I will inspect the code and correct it. Best regards |
Many thanks again @NuruddinKhoiry. I'm very much looking forward to testing your next release. This will surely be a useful tool for many biologists who routinely perform metabarcode sequencing studies. |
Hi Jubin I have fixed the bug in the "vote" option and added the taxonomy ids in the output tables in the new release of blastmining (v.1.2.0). pip install blastMining==1.2.0 Best regards, Khoiri |
Many thanks @NuruddinKhoiry. I'm on vacation now but will be back at work in a couple of weeks. Will definitely test out the new version and try to get back to you with my feedback before the end of this year |
Hi Jubin I will close this issue. If you found any error in the new release, please open a new issue. Best regards, Khoiri |
Hi,
I quickly tested your tool and observed that the Method A (majority vote with percent identity cut-off) doesn't give expected results as stated in the README. All results in
lca_method.tsv
indicate the species name rather than the taxon names (reflecting kingdom,phylum,class,order,family & genus taxonomy ranks) as informed by the percent identity cutoff. I made this testing on the conda installation of your package and below are the command-lines that I had used.cat test_zotus.fasta | parallel --gnu -j 100 --recstart '>' -N 10 --pipe blastn -task blastn -query - -db /home/antonycp/tools/ncbi-blast-2.3.0+/bin/nt -outfmt \'6 qseqid sseqid pident length mismatch gapopen evalue bitscore staxid\' -qcov_hsp_perc 70 -max_target_seqs 10 > test_zotus_coverage_blastn_maxtarget10_screenresults.txt
Nucleotide-Nucleotide BLAST 2.12.0+ was used above
conda activate conda_blastmining_env/
blastMining vote -i test_zotus_coverage_blastn_maxtarget10_screenresults.txt -o vote_method -e 0.001 -txl 99,97,95,90,85,80,75 -n 10 -sm 'Sample' -j 100 -p lca_method -kp -rm
head -n10 vote_method/lca_method.tsv
I'm attaching here the input files (in .txt format so as to comply github upload rules)
test_zotus_coverage_blastn_maxtarget10_screenresults.txt
test_zotus.fasta.txt
and the output files I got
lca_method.summary.txt
lca_method.tsv.txt
Also, is there any way to get the taxonomy ids in the final output files? I'm sure many users would really require this info. Thank you very much in advance!
The text was updated successfully, but these errors were encountered: