Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing HT018925962 and HT018925945 #299

Closed
jschnasse opened this issue Apr 4, 2016 · 5 comments
Closed

Missing HT018925962 and HT018925945 #299

jschnasse opened this issue Apr 4, 2016 · 5 comments
Assignees
Labels

Comments

@jschnasse
Copy link

@fsteeg
Copy link
Member

fsteeg commented Apr 4, 2016

API 1.x is not up to date due to the quaoar cluster issues (see hbz/nwbib#302).

Data 2.0 is on a different machine, seems to be up to date.

@fsteeg
Copy link
Member

fsteeg commented Apr 6, 2016

I've indexed the updates that were missing due to the cluster issues:

http://lobid.org/resource?id=HT018925962&format=full
http://lobid.org/resource?id=HT018925945&format=full

Same for sources:

http://lobid.org/resource?id=HT018925962&format=source
http://lobid.org/resource?id=HT018925945&format=source

Other missing resources should be present too. Assigning to @jschnasse for review.

Some notes on what I did:

To restore these, I've indexed the updates since 2016-03-26 (when first nagios warnings came) from:
http://index.hbz-nrw.de/alephxml/export/update/

  1. in lodmill, locally checked out into /home/fsteeg/git/lodmill:

cd /home/fsteeg/git/lodmill/lodmill-rd/doc/scripts/hbz01/ and download files here.

To process a single update:

bash -x startHbz01ToLobidResources.sh master /home/fsteeg/git/lodmill/lodmill-rd/doc/scripts/hbz01/DE-605-aleph-update-marcxchange-20160329-20160330.tar.gz lobid-resources NOALIAS quaoar2.hbz-nrw.de quaoar exact

To process multiple files and redirect output to log file:

bash -x startHbz01ToLobidResources.sh master dummy_ignore lobid-resources NOALIAS quaoar2.hbz-nrw.de quaoar exact doc/scripts/hbz01/updates.txt > 20160405-140410-master.log.startHbz01ToLobidResources.sh 2>&1

The updates.txt file contains full paths to the files, as in the single file sample above.

  1. in mabxml-elasticsearch, locally checked out into /home/fsteeg/git/mabxml-elasticsearch:

cd /home/fsteeg/git/mabxml-elasticsearch/src/main/resources/input and download files here.

In Transform.java, set DIR = "/home/fsteeg/git/mabxml-elasticsearch/src/main/resources/input" (temporary: set esIndexer.setIndexname("hbz01-staging"), see hbz/mabxml-elasticsearch#21) and run Transform.java.

@fsteeg fsteeg assigned jschnasse and unassigned fsteeg Apr 6, 2016
@fsteeg fsteeg added review and removed working labels Apr 6, 2016
@acka47
Copy link
Contributor

acka47 commented Apr 6, 2016

I came across entry HT001401787 where the JSON and the source don't describe the same title:

See http://lobid.org/resource?id=HT001401787&format=full vs. http://lobid.org/resource?id=HT001401787&format=source.

I don't know whether this has anything to do with this issue. If not, we need to open a new one.

@fsteeg
Copy link
Member

fsteeg commented Apr 7, 2016

Completely different titles? They (now) are both "Westfälische Bibliographie zur Geschichte, Landeskunde und Volkskunde". Could that have been a temporary issue? Or am I missing some detail?

@acka47
Copy link
Contributor

acka47 commented Apr 7, 2016

Strange. I swear these were different titles yesterday. Obviously, this was a temporary issue then.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants