Extract embedded attachments from Microsoft Office documents #4861
Labels
f:request-analysis
improvement
Improves existing functionality (UI tweaks, refactoring, performance, etc)
x:uk
We've seen a couple of cases (example) where authorities send a word document (
.docx
) with embedded attachments that seem impossible to open, presumably unless you have some specific version of Office.It turns out you can extract them:
.docx
to.zip
unzip PATH_TO_FILE.zip
word/embeddings
.bin
extension; change them to.pdf
You can do this without the command line, but I ran in to the problem of the
.zip
extracting to a.cpgz
. To extract this you can try downloading with a different browser (didn't work for me) or download The Unarchiver. (Via http://osxdaily.com/2013/02/13/open-zip-cpgz-file/)The text was updated successfully, but these errors were encountered: