Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

missing DTD's for esearch and esummary #290

Closed
norling opened this issue Feb 14, 2014 · 2 comments
Closed

missing DTD's for esearch and esummary #290

norling opened this issue Feb 14, 2014 · 2 comments

Comments

@norling
Copy link

norling commented Feb 14, 2014

Warning about missing DTD's to be downloaded.

to replicate:

Python 2.7.5 (default, Aug 25 2013, 00:04:04) 
[GCC 4.2.1 Compatible Apple LLVM 5.0 (clang-500.0.68)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from Bio import Entrez
>>> import Bio
>>> Bio.__version__
'1.63'
>>> Entrez.email = "my-email@slu.se"
>>> data = Entrez.read(Entrez.esearch("nucleotide", "48165[BioProject]"))
/Library/Python/2.7/site-packages/Bio/Entrez/Parser.py:525: UserWarning: Unable to load 
DTD file esearch.dtd.
[...]
>>> summary = Entrez.read(Entrez.esummary(db="nucleotide", id=data['IdList'][0]))
/Library/Python/2.7/site-packages/Bio/Entrez/Parser.py:525: UserWarning: Unable to load 
DTD file esummary-v1.dtd.
[...]

Keep up the good work! =)

@peterjc
Copy link
Member

peterjc commented Feb 14, 2014

The missing esearch.dtd file was handled on issue #287 6c2e7db

Strangely I'm not seeing the warning about esummary-v1.dtd here (perhaps already cached locally?):

$ python
Python 2.7.5 (default, Aug 25 2013, 00:04:04) 
[GCC 4.2.1 Compatible Apple LLVM 5.0 (clang-500.0.68)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from Bio import Entrez
>>> Entrez.email = "p.j.a.cock@googlemail.com"
>>> Entrez.read(Entrez.esummary(db="nucleotide", id="295413968"))
[{'Status': 'live', 'Comment': '  ', 'Caption': 'NC_014083', 'Title': 'Torque teno virus 25, complete genome', 'CreateDate': '2010/04/29', 'Extra': 'gi|295413968|ref|NC_014083.1||gnl|NCBI_GENOMES|26054[295413968]', 'TaxId': 687364, 'ReplacedBy': '', u'Item': [], 'Length': 3763, 'Flags': 768, 'UpdateDate': '2010/04/29', u'Id': '295413968', 'Gi': 295413968}]
>>> quit()

The raw XML (which GitHub tries to render if I paste it in full) includes:

<!DOCTYPE eSummaryResult PUBLIC "-//NLM//DTD esummary v1 20060131//EN" "http://eutils.ncbi.nlm.nih.gov/eutils/dtd/20060131/esummary-v1.dtd">

peterjc added a commit that referenced this issue Feb 14, 2014
@peterjc
Copy link
Member

peterjc commented Feb 14, 2014

The missing DTD file will be bundled with the next release (which through prior work will also download and cache missing DTD files automatically).

Thanks for reporting this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants