Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate advanced file scanning techniques #3

Open
cdgriffith opened this issue Nov 25, 2013 · 4 comments
Open

Integrate advanced file scanning techniques #3

cdgriffith opened this issue Nov 25, 2013 · 4 comments
Assignees
Milestone

Comments

@cdgriffith
Copy link
Owner

Better identify common files. Such as opening .docx/.pptx/.xlsx and viewing the XML file to figure out which exactly they are.

@eight04
Copy link

eight04 commented Nov 21, 2022

Is it normal that a zip file was detected as docx?

@eight04
Copy link

eight04 commented Nov 21, 2022

Using from_string.

@cdgriffith
Copy link
Owner Author

@eight04 Wouldn't surprise me as docx is actually a zip file, probably means that I have too broad a match for the docx type.

Perchance do you know what program generated that zip file?

@eight04
Copy link

eight04 commented Nov 26, 2022

Nope, but I can upload the zip file:
沙花叉word-20220523T041928Z-001.zip
There are more cases, I just uploaded one of them.

Windows 10
Python 3.10.8
puremagic 1.14

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants