Skip to content

Encoding issue #1290

Open
Open
@nnurmano

Description

@nnurmano

I am testing pdf to markdown conversion and I have (cid:588)(cid:607)(cid:623) in the output file. I researched into it and found out that you use pdfminer. I raised a similar issue with them, but never received a solution. Is there any way to replace pdfminer with some better OCR tooling?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions