You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
But when I get them with Catmandu, it iterates forever. It gets the 498 items, and then repeats (not all the items in a loop, it's a more complicated pattern).
The script at the bottom gets all items (you have to abort it at some point):
alinari.pl > alinari.rdf
Then I find the item IDs like this:
perl -ne 'print qq{$1\n} if m{<edm:ProvidedCHO rdf:about="(.*?)"}' alinari.rdf > ids
and analyze them with something like
sort ids|uniq -c|sort -nr|less
In my case different items were repeated 13,12, and 1 times
Maybe that server (MINT OAI) doesn't return unique record/header/identifier. But the pagination (resumption token) protocol works fine, since http://validator.oaipmh.com/#ListRecords gets 498 records
http://panic.image.ntua.gr:9876/foodanddrink/oai?verb=ListRecords&set=1003&metadataPrefix=rdf is a small set with 498 items.
http://validator.oaipmh.com/#ListRecords gets that many items.
But when I get them with Catmandu, it iterates forever. It gets the 498 items, and then repeats (not all the items in a loop, it's a more complicated pattern).
The script at the bottom gets all items (you have to abort it at some point):
alinari.pl > alinari.rdf
Then I find the item IDs like this:
and analyze them with something like
In my case different items were repeated 13,12, and 1 times
The text was updated successfully, but these errors were encountered: