-
Notifications
You must be signed in to change notification settings - Fork 465
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thumbnailing Large/Complex pdfs can blow heap #23384
Comments
What this does:
|
related cloud support ticket https://dotcms.zendesk.com/agent/tickets/109178 |
Passed Internal QA Docker Image: Note For QA It may take a while, it took me 30 minutes until could see the thumbnail. |
Fixed, tested locally and it is taken like 10/15 minutes to show the thumbnail, which is not great, but is not killing the server. Then we are ok for now. Passed QA. Tested on release-23.01 // Docker // FF |
When trying to generate thumbnails for PDFs, we make use of an Apache library called PDFBox. The issue is that when we try to thumbnail these pdfs, it can overload the servers and blow server heap space. This is a known issue with the library and something they have addressed in later versions.
It is easy to reproduce - when I try to thumbnail this PDF locally I get an OOM exception. At best, we should be able to thumbnail PDFs without blowing heap, regardless of how long it takes.
Here is a fat pdf that blows up:
neuroscience2009.pdf
The text was updated successfully, but these errors were encountered: