Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text is Not Rendered in Certain PDFs #348

Open
TechD123 opened this issue May 30, 2024 · 9 comments
Open

Text is Not Rendered in Certain PDFs #348

TechD123 opened this issue May 30, 2024 · 9 comments
Assignees

Comments

@TechD123
Copy link

Since updating from 3.19 to the recently published 3.26 (via F-Droid), certain PDFs don't render any text. As my main use case for this app is its PDF reading feature, I have reverted to the previous version.

I'll share a link to a sample PDF once I find one that's affected. So far I've only seen this on PDFs that would leak PII if I shared them here... :P

@andiwand
Copy link
Member

Thank you for reporting this @TechD123 ! If you don't want to share the files publicly you can send them to us via mail if that is an option. Otherwise we can wait for another file which has this issue

@TomTasche
Copy link
Member

I've seen the same behavior, but can't share those PDFs publicly either. @andiwand @ViliusSutkus89 let me know if you are interested in taking a look.

@andiwand
Copy link
Member

andiwand commented Jun 2, 2024

can you put it into the private testing repo? otherwise you can mail it to me @TomTasche

@ViliusSutkus89
Copy link
Contributor

If I could reproduce it, I could tell if the problem is in upstream pdf2htmlEX or if it's on our side

@ViliusSutkus89
Copy link
Contributor

@TomTasche tried those pdf's on pdf2htmlEX docker image available on docker hub and it gives the same result. Although that official pdf2htmlEX release docker image is 4 years old and using outdated Poppler and FontForge.

I've inspected the HTML DOM, all the text is there, but hidden by CSS. I assume it's a font issue, will try to debug

@ViliusSutkus89 ViliusSutkus89 self-assigned this Jun 21, 2024
@TomTasche
Copy link
Member

Since PDF is quite important for our users, I'd actually propose to set a bounty for this issue. @andiwand @ViliusSutkus89

I'd love to have a fix for this soon.

@ViliusSutkus89
Copy link
Contributor

pdf2htmlEX-Android conan is nearly there. Then I can focus on debugging upstream pdf2htmlEX

@ViliusSutkus89
Copy link
Contributor

Did some debugging, usually this error happens when fontforge errors out while trying to save a "malformed" font. Don't know how much of that malform is actually bad font and how much of that is bad parsing. Will see what I can do to workaround

TomTasche added a commit that referenced this issue Jul 15, 2024
…ion-bump

Upgrade pdf2htmlEX-Android to 0.18.25 to workaround issue #348
@TomTasche
Copy link
Member

A fix for that is rolling out now, thanks a lot @ViliusSutkus89 ! 🎉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants