-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Empty faa file for 1008392 produces error with diamond #14
Comments
I am following the online supplementary document and find this problem too. When I try I get
Then I remove this TaxID by
|
@urmi-21 @haojingshao you are exactly right. See issue #11 where we just now fixed this issue by removing the species 1008392, as suggested by @urmi-21, from the default set of representatives for bacteria. Apparently the proteome was removed this year from UniProt. Everything should work fine in the latest commit. @haojingshao the BLAST headers in I'll close this issue for now since the immediate problem of the missing 1008392 strain is solved in #11 and the general problem of missing sequences can be avoided by removing them as @haojingshao shows. |
I found that the function
add_recommended_prokaryotes
add a species with tax id 1008392. Theuniprot_fill_strata
function downloads an empty faa file for this. This creates an issue when using the newly addedstrata_diamond
function as diamond throws an error when using an empty file.I looked up NCBI and found this species only has a nucleotide sequence https://www.ncbi.nlm.nih.gov/taxonomy/?term=1008392
Uniprot also returns empty result: https://www.uniprot.org/uniprot/?query=taxonomy%3A1008392&sort=score
I think this species should be removed from
add_recommended_prokaryotes
My current workaround for this problem is to manually remove 1008392 from list of prokaryotes:
The text was updated successfully, but these errors were encountered: