-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No PDB acessions matched and Retrieving no protein structure files #111
Comments
Hi! Thanks for using After building the local CAZyme database with records downloaded from CAZy, did you retrieve the PDB accessions from UniProt using (semi-shameless plug of our paper in coming ;) ) To summarise a chunk of the paper (where it's explained better): When building the local CAZyme database,
Therefore, the resulting database only contains that data. - you can check this using sqlite3 -header GH.db "SELECT * FROM Pdbs"
So you first need to populate the local CAZyme database with PDB accessions from UniProt, using the
I'll add an additional note to the documentation to make this clearer - I can see how it doesn't seem obvious |
@HobnobMancer Thanks for the help with this dumb error! btw I realized that the paper explains this workflow very well. Sorry for that. However, the function
Even with this warning, I could get the PDB IDs and the PDB structures as well. Thanks again! |
Don't apologise! It should have been more obvious in the documentation :) I'm glad you got it working! ReproductionI can't reproduce this error. Using the following commands produces no errors for me: cazy_webscraper <email> --families PL20 -o cazy_db
cw_get_uniprot_data cazy_db <email> --families --pdb --sequence --ec And the data was downloaded and inserted into the local CAZyme database correctly - checked using:
BioservicesThe lines of code you are quoting are from bioservices. For reproducibility of work/research, I wouldn't recommend altering the code base of widely used packages such as bioservices. If you're having issues, I would recommend raising an issue in the respective GitHub repo. The Bioservices error 596 typically arised from issues with the new UniProt API (updated last year), as discussed in issue #100 . You need to running bioservices version >= 1.10.4. You might want to checkout the bioservices issue 224. |
I tried to run basic commands from README and Documentation. As my primary goal is to retrieve the PDB files, I started creating a local database with
And next, some command to get pdb structures
However, I got the following output:
I ran different settings with PL, GH, and GT, and I got the same result.
My system configuration:
The text was updated successfully, but these errors were encountered: