Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

eol_names_and_ranks_archive.tar.gz only downloadable via browser (not command-line) #10

Open
JRice opened this issue Sep 15, 2015 · 5 comments
Assignees

Comments

@JRice
Copy link
Member

JRice commented Sep 15, 2015

currently it ends up in /opt/downloads/eol_names_and_ranks_archive.tar.gz and we need it to be accessible to PHP somewhere "standard".

Update the script: /opt/eol_php_code/rake_tasks/create_eol_archive_names_only.php

@JRice JRice added bug and removed bug labels Sep 15, 2015
@JRice
Copy link
Member Author

JRice commented Sep 15, 2015

The URL should be static, and publicly available... apparently there is one that is working, location TBD.

Note that you should add this file to the .gitignore list, though!

@JRice
Copy link
Member Author

JRice commented Sep 15, 2015

Is this already working? i.e.: http://services.eol.org/downloads/eol_names_and_ranks_archive.tar.gz

@JRice JRice changed the title Modify PHP names script [?] Modify PHP names script Sep 15, 2015
@JRice JRice added the bug label Sep 15, 2015
@JRice JRice changed the title [?] Modify PHP names script Modify PHP names script Sep 15, 2015
@JRice JRice modified the milestone: 2015-09-15 Sep 15, 2015
@jhpoelen
Copy link

@jhammock @JRice thanks for making this available.

For some reason, using wget services.eol.org/downloads/eol_archive_objects.tar.gz results in:

--2015-09-15 08:52:46--  http://services.eol.org/downloads/eol_archive_objects.tar.gz
Resolving services.eol.org... 160.111.248.28
Connecting to services.eol.org|160.111.248.28|:80... connected.
HTTP request sent, awaiting response... 503 Service Unavailable
2015-09-15 08:52:47 ERROR 503: Service Unavailable.

Same occurs for wget http://services.eol.org/downloads/eol_names_and_ranks_archive.tar.gz

--2015-09-15 08:54:43--  http://services.eol.org/downloads/eol_names_and_ranks_archive.tar.gz
Resolving services.eol.org... 160.111.248.28
Connecting to services.eol.org|160.111.248.28|:80... connected.
HTTP request sent, awaiting response... 503 Service Unavailable
2015-09-15 08:54:45 ERROR 503: Service Unavailable.

Downloading in Firefox v40.0.3 seems to work ok (download still in progress...).

Can you reproduce this?

@jhpoelen
Copy link

The resource at http://services.eol.org/downloads/eol_names_and_ranks_archive.tar.gz contains the following columns:

taxonID scientificName nameAccordingTo taxonRank genus specificEpithet
34543 Enhydra EOL Group on Flickr; IUCN Red List ... genus Enhydra

This list helps to associate a specific name string to an EOL page id. However, it is not clear how the name fits into a taxonomic tree (if at all).

I had a similar issue with GloBI were Sergey (see globalbioticinteractions/globalbioticinteractions#70) asked for a full download of taxa included in GloBI. To help do this, I created a dump with with full taxonomic path including tsn's or taxon ids of external taxonomies. This way, Sergey (and others) can easily link a specific taxon to a multitude of taxonomies without having to retrieve them one by one.

I've included an example produced by GloBI after taxonomic name matching against various taxonomic services (including EOLs). Any way that EOL can provide a similar taxon dump periodically? This would cut down the GloBI name resolution time from about a week to hours or less and use the careful taxon links established by EOL curators.

id name rank commonNames path pathIds pathNames
EOL:328583 Enhydra lutris Species Seeotter @de ; sea otter @en ; Nutria marina @es ; Merisaukko @fi ; Loutre de mer @fr ; Zeeotter @nl ; Loira de mar @oc ; Animalia ; Chordata ; Mammalia ; Carnivora ; Mustelidae ; Enhydra ; Enhydra lutris EOL:1 ; EOL:694 ; EOL:1642 ; EOL:7662 ; EOL:7670 ; EOL:34543 ; EOL:328583 kingdom ; phylum ; class ; order ; family ; genus ; species
OTT:949676 Enhydra lutris species Animalia ; Chordata ; Mammalia ; Carnivora ; Mustelidae ; Enhydra ; Enhydra lutris IRMNG:11 ; IRMNG:148 ; IRMNG:1310 ; IRMNG:12116 ; IRMNG:104767 ; IRMNG:1297077 ; IRMNG:10198728 kingdom ; phylum ; class ; order ; family ; genus ; species
GBIF:2433670 Enhydra lutris species Animalia ; Chordata ; Mammalia ; Carnivora ; Mustelidae ; Enhydra ; Enhydra lutris GBIF:1 ; GBIF:44 ; GBIF:359 ; GBIF:732 ; GBIF:5307 ; GBIF:2433669 ;

...

@JRice JRice modified the milestones: 2015-09-15, 2015.09.29 Sep 29, 2015
@JRice JRice modified the milestones: 2015.09.29, 2015.10.27 Oct 27, 2015
@JRice JRice changed the title Modify PHP names script eol_names_and_ranks_archive.tar.gz only downloadable via browser (not command-line) Nov 10, 2015
@JRice JRice modified the milestones: 2015.10.27, 2015.11.10 Nov 11, 2015
@HudaSoliman HudaSoliman self-assigned this Nov 18, 2015
@jhammock jhammock modified the milestones: 2016.03.22, 2016.04.05 Apr 5, 2016
@jhammock jhammock modified the milestones: 2016.04.05, 2016.04.19 Apr 19, 2016
@jhammock jhammock modified the milestones: 2016.04.19, 2016.05.03 May 3, 2016
@jhammock jhammock modified the milestones: 2016.05.03, 2016.05.17 May 17, 2016
@jhammock jhammock modified the milestones: 2016.05.17, 2016.05.31 May 31, 2016
@jhammock jhammock modified the milestones: 2016.05.31, 2016.06.14 Jun 14, 2016
@jhammock jhammock modified the milestone: 2016.06.14 Jul 12, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants