Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tika Image Detection misses images in PowerPoint and Word documents. #1372

Closed
hhuangMITRE opened this issue Jul 19, 2021 · 1 comment
Closed

Comments

@hhuangMITRE
Copy link
Contributor

hhuangMITRE commented Jul 19, 2021

Right now TikaImageDetection appears to miss images embedded within *.ppt/*.pptx/*.doc/*.docx formatted documents. Identify if the issue is specific to those formats and see if images can be properly extracted out of such documents.

@hhuangMITRE hhuangMITRE self-assigned this Jul 19, 2021
@jrobble jrobble added the hotfix label Jul 19, 2021
@jrobble jrobble added this to To do in OpenMPF: Development via automation Jul 19, 2021
@jrobble jrobble added this to the Milestone 2 milestone Jul 19, 2021
@jrobble jrobble moved this from To do to In Progress in OpenMPF: Development Jul 19, 2021
@hhuangMITRE hhuangMITRE changed the title Tika Image Detection misses images in PowerPoint documents. Tika Image Detection misses images in PowerPoint and Word documents. Aug 4, 2021
@hhuangMITRE
Copy link
Contributor Author

hhuangMITRE commented Aug 4, 2021

Issue has been identified and resolved in TikaImageDetection branch. Creating PR now for review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

2 participants