-
-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not recognizing Office 2007+ files (docx, xslx,...) #1
Comments
Ok. I am seeing the same behavior. I did not have a docx file type in my tests. One is added now. Looking into it now. Thanks and sorry for the delay. |
Actually my local file commands still fail on this. Can you post your magic file somewhere? Maybe pastebin.com? |
Here you go. This is the magic file that came with Cygwin for I'm guessing this is the relevant part:
|
Interesting. I don't support the search/... types but I guess I can add it. What I can do immediately is to add the [Content_Types].xml check and spit out Microsoft OOXML at least. |
So version 1.5 has much better processing of the 2007+ versions of these files. Thanks again. |
Thanks! I tested it out on .docx, .xlsx, and .pptx and they are working now. I forgot to mention that .xls and .ppt aren't recognized either (though .doc is). I can file a separate issue for those if you want. |
Hi, I found this project from your comment on the article http://www.rgagnon.com/javadetails/java-0487.html. I have used the UNIX "file" command with good accuracy, so seeing that the simplemagic library is based on the same logic appealed to me. Unfortunately this Java library doesn't have the same success rate. Particularly, it fails on most MS Office files from Office 2007+.
Here is what I get from SimpleMagic:
Here is what I expected using "file"
My system:
I thought it might be due to an older magic file, but unfortunately, using my system's magic file doesn't help much (actually makes it worse). What version of file/magic was used here? Perhaps the file format of magic changed since then?
The text was updated successfully, but these errors were encountered: