-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When using -zip experiencing errors on a file from OPF Format Corpus #72
Comments
I see the cause of this one; I introduced this bug by accident. The code here branches if the if self.zip:
self.identify_contents(filename, type=self.container_type(matches)) The |
1b5698a updated self.container_type() to recognize OLE as an additional format, but this broke the `-zip` switch. The method was being used to identify two different categories of formats: 1. Container formats which need to be matched against the PRONOM container signatures in order to get more precise matches; and 2. Container formats which can be recursed into via the `-zip` switch in order to identify the formats of their contents. FIDO supports OLE for the former but not the latter, and since OLE is usually not interesting for its contents, it doesn't make sense to support recursing into it. This commit adds a new method which differentiates whether FIDO is interested in recursing into a format, not merely whether it *is* a container format, and updates the `-zip` path to check using it. Fixes #72.
Fixed by #73. I can confirm after that PR that |
Thanks @mistydemeo! @jhsimpson is going to merge #73 soon. |
Attempting to scan the opt-format-corpus I'm seeing an error from a specific file:
Distro stats:
My mirror of the OPF Format Corpus can be found here: https://github.com/ross-spencer/opf-format-corpus
The text was updated successfully, but these errors were encountered: