-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Harvesting: OAI sets are not updated when datasets are deleted. #8005
Comments
Review with Leonid
|
2023/12/19: Prioritized during meeting on 2023/12/18. Added to Needs Sizing. |
This is an old (3 y.o.) issue. And also, note the update in the opening comment suggests that the problem may have been a fluke on the client side all along. |
What I said in the last comment appears to be correct, this is no longer an issue. The main set from demo is a perfect test for this, because we continuously purge most of the datasets created there, as described in the opening comment. Per OAI-PMH specifications, the OAI records for these datasets are kept, so that we can communicate to any clients that may have harvested them that they no longer exist. So that they know that they need to remove them on their end as well. |
P.S.
Actually, scratch that - all the oai_dc records on demo are valid. |
In the OAI protocol there is the concept of updating the list of identifiers in a set when something is deleted. Currently, in 5.5 on demo, a successful harvest of the "no set" set, ie all datasets, correctly harvests the 133 published datasets but fails 2068 others that no longer exist in the db. We believe this is a result of our monthly auto cleaning via API (destroy). So, it seems like this is not updating the OAI sets, leading to phantom harvest records. Need to verify it is server side versus client side.
Update: confirmed that the oairecord for the failing globalid was marked as removed on the server in the db, so either it is not passed as deleted by oai or maybe more likely, client does not handled the removed status correctly.
The text was updated successfully, but these errors were encountered: