You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The issue has been traced back and found to be inherent to pdfminer's current implementation of the pdf text extraction method, and somewhat on the PDF format itself.
See Issue royjohal/pdfminer.six#1
Pointing to glyphs inside the font spec while the fontspec does not have a glyph->character mapping makes sure that a resultant text is not always found.
Summary
pdfminer sometimes omits characters in the textual output.
There are some characters missing.
Environment
The text was updated successfully, but these errors were encountered: