-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix rule causing "Complex regular subexpression recursion limit" #318
Comments
Somewhat loosely related to LibreCat/Catmandu-OAI#7 |
Ill check it tomorrow when I'm back from vacation. Can you share with us the baseUrl , set, metadataPrefix you try to harvest? |
Tried the following: use Data::Dumper;
try {
$fixed = $fixer->fix($item);
} catch {
warn "caught error: $_";
print Dumper($item);
return;
}; which made the harvest continue past the bad record, but for some reason Repo details:
but note that the error occurs 5101200 records into the set, so it takes about 1 day and 8 hours to reach it :) If there's something I can test, let me know. |
I will check it for a while to see which resumption token crashes the harvest |
Interesting..I can boil it down to this kind of regex match that fails:
|
We checked it and it is probably the regular expression you are using that is funky. You get the same effect in Perl when doing:
The problem is that the pattern |
Wow, thanks a lot for checking! Funnily enough, this is a regex I took from https://github.com/LibreCat/Catmandu/wiki/Example-Fix-Script :) |
@danmichaelo 😃 Ooops. I better fix that script |
My OAI harvest crashed with the error:
Unfortunately I don't know which record or which fix rule caused the problem. My fix file is here: https://gist.github.com/danmichaelo/d52035c4204cbe2b1c21c717102c3161
I can share my
harvest.pl
also if needed, but here's the short version:Will try to see if I can add
try.. catch
to find out which record caused this.The text was updated successfully, but these errors were encountered: