Skip to content

Can't extract some images from PDF #2496

@ericosmic

Description

@ericosmic

I try to extract text from some PDF files, but found some images , mainly is title blocks, can not be correctly extracted.
one of bug files :
zf39.pdf
I used below code to extract image of PDF:
image
but result image is like some kind color point.
The extract results:
t_1
t_0\

My pymupdf version: ('1.20.2', '1.20.3', '20220813000001')

Metadata

Metadata

Assignees

No one assigned

    Labels

    not a bugnot a bug / user error / unable to reproducewontfixno intention to resolve

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions