extracting images from a document

Hi,

my task is to parse some docx files, extracting plain text and images from them.

am i right in thinking (after reading the docs and playing around with some code) that there is no way so far to extract images from a document? I.e. I know that the document has an `inline_shapes` property, but:
- the run doesn’t have one, so I can’t establish the link between an image and a run
- `InlineShape` doesn’t have a method for saving an image to disk?

I conjecture that the best way I have is to unzip the docx and parse xml?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

extracting images from a document #108

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

extracting images from a document #108

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions