-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple identification unexpected outcomes #146
Comments
Hi @Dclipsham You're right to use the "-multi" switch to change this behaviour but unfortunately all the currently available modes dispense with the priorities altogether. I haven't created a multi mode to replicate DROID's behaviour of doing the scan & then using the priorities afterwards to filter the results. But certainly possible to create such a mode and I think it might be a nice feature to have, so I'll mark this as a feature request. Hopefully there'll be a new release in the next couple of months and I'll try to get this in it |
Thanks Richard, this makes sense and would be a welcome addition. I'm curious as to why, in the above examples SF favoured one over the other where there isn't a PRONOM priority set. i.e. in the JPG/HTML hybrid it chose JPG, but in the MOV/JPG hybrid it chose MOV. Is it that the positive extension outcome weighted it further? |
For the JPG/HTML - it started scanning and got the JPG match before the HTML match. Once it confirmed the JPG match it would then only consider further matches if they had priority over JPG (e.g. more specific flavours of JPG). |
v1.10.0 has a new "droid" multi mode when building with roy |
This could be my misuse/misunderstanding of params...
I need to test for multiple identification outcomes as part of a service we're building. To mimic this test, I have created a pair of invalid files (that is, they don't represent real data that can be decoded by any format renderer) that contain the identification patterns required for multiple formats.
hybrid_jpeg_html_file.jpg contains the identification patterns of both Raw JPEG Stream (fmt/41), and Hypertext Markup Language (fmt/96)
hybrid_jpeg_mov_file.mov contains the identification patterns of both Raw JPEG Stream (fmt/41), and Quicktime (x-fmt/384)
fake_hybrid_files.zip
DROID returns multi ID as desired, but Siegfried (out of the box, Windows 10) seems to only return first match. Using 'roy build -multi 3' and re-running gives the expected multiple identification outcome, but has the unwanted side-effect of no longer honouring priority relationships (in the example below, Music XML (fmt/896) & XML (fmt/101). 'roy build -multi 2' stops giving me the multiple ID I'm after.
Outputs below:
multi 3:
Multi 2:
multi 3 scanning of legitimate MusicXML file (unable to share):
The text was updated successfully, but these errors were encountered: