-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inconsistencies / missing data in automatic updates #372
Comments
Might be related to #363. |
Hi, Reading this issue, I assume that the records weren't added to lobid-gnd due to these update issues. So I just add this feedback here in case more examples help you fix the issue. Best regards, |
I reindexed those two (underlying issue is still unresolved): https://lobid.org/gnd/1312495189 |
Couldn't find the underlying problem. Especially strange is that the automatic updates somtimes are smaller than the later manually invoked one. As @fsteeg mentioned this could be a temporarily network issue. There might also be a problem on the side of the provider. Because of this hardly debugable problem and also to cope with possible problem at provider side I suggest to do also a daily update in addendum to the hourly updates. This way we have should be more safe to get all the data. If agreed I will configure to also have a daily update. Or better ideas? |
+1 This sound like a good approach to me. Isn't it so that the number of reports has risen since we switched to hourly updates in November (#350)? The question is whether it is a good idea in the first place to have hourly updates if people can not rely on them being carried out reliably. |
This redundancy in updating the data should bring more stability in being in sync with upstream data.
#378 works as a safety rope. |
(+1 for additional daily updates, I've approved #378)
Could this in some way be related to the fact that the OAI-PMH interface expects UTC times, while the modification times in the data and the server use local time (see mail from J.R. on 2023-12-22)? |
From German Wikipedia:
So indeed: if we query what we think starts last hour to now (MEZ) we query in fact just now to next hour (UTC). Wondering why there was data at all. Going to fix it. |
OAI-PMH expects UTC times.We use CET timed server and so these times must be chenged to UTC. When files are produced these will have CET based timestamps, though.
A new dump should be provided soon: |
We received a mail yesterday about missing records that were created last week. Example: https://lobid.org/gnd/1319507522 Creation date (see MARC) is: 2024-02-15 |
The example (https://lobid.org/gnd/1319507522) now works and I sent out a mail response. |
E.V. who sent the mail mentioned in #372 (comment) followed up on it by providing more entries that are still missing. I went through them to see on which day they were created and found entries from the following days:
He closes the email with the notion that the list is not exhaustive and more entries are missing. As the impact of the missing updates is significantly downgrading the service we should not wait for a new full dump but reindex titles – probably best starting at 2023-11-10 as this is the date we have rescheduled the updates (see #350). |
Reindexed updates since https://lobid.org/gnd/1317861825 |
+1 It's ok for me to close this issue now but we should monitor closely whether updates reliably come in . |
Closing. Updates have been fine during the last weeks/months. |
Via email feedback, original message on 12/22/23 12:38 by M.H.
New entry was missing in lobid-gnd:
https://services.dnb.de/oai/repository?verb=GetRecord&metadataPrefix=RDFxml&identifier=oai:dnb.de/authorities/1312101741
Latest update is now on
2023-12-27T11:12:51.000
, which is2023-12-27T10:12:51Z
in OAI-PMH, as clarified by DNB via email on 12/22/23, 17:13 by J.R.Fetching updates manually worked, the missing resource is now in lobid-gnd:
https://lobid.org/gnd/1312101741
However, the automatic update for that time span on the server is way too small:
Compared to the manual run for the same time span (
sol@quaoar3:~/git/lobid-gnd$ sbt "runMain apps.ConvertUpdates 2023-12-27T09:40:26Z 2023-12-27T10:40:25Z"
):Might have been temporary network issues, but at least we need better monitoring.
The text was updated successfully, but these errors were encountered: