[CHROMIUM] Route to allow source as PDF file to be converted (currently only HTML/Markdown/URL) #882

michelrisucci · 2024-05-17T21:54:37Z

Currently we have Chromium routes for sources in HTML, Markdown and URLs:

Is there a way to have a PDF file as source?
It's useful for when you want to change it's PDF-A standard or simply by regenerating it to normalize/fix broken PDF.

Currently, PDF Engines endpoints have a route able to convert from source PDF file:

But it's simply not reliable as Chromium is. Tried some PDF files with EXIF pictures and it is breaking while trying to convert these attached pictures.
I would love to have the same behavior for Chromium.

gulien · 2024-05-22T07:35:25Z

Hello @michelrisucci

Chromium only handles HTML based inputs. When you ask to convert to a specific PDF/A format, it's actually the LibreOffice PDF engine that does the job. In other words, HTML into PDF via Chromium, and then PDF/A via LibreOffice. The PDF engines endpoint just skip the HTML part and calls LibreOffice directly.

michelrisucci · 2024-05-22T20:40:24Z

@gulien got it, but Chrome browser is able to read PDFs and regenerate them by printing to PDF.
Aren't we able to have an API to print PDF from PDF file?
I tried LibreOffice PDF engine, but, as I said, I faced some errors logging something related to "EXIF" pictures converting (PDF attachments?).

gulien · 2024-05-23T15:21:39Z

Does Chrome really re-render a PDF? 🤔

It's useful for when you want to change it's PDF-A standard or simply by regenerating it to normalize/fix broken PDF.

So it won't work for PDF/A, but it might work to normalize a broken PDF?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CHROMIUM] Route to allow source as PDF file to be converted (currently only HTML/Markdown/URL) #882

[CHROMIUM] Route to allow source as PDF file to be converted (currently only HTML/Markdown/URL) #882

michelrisucci commented May 17, 2024

gulien commented May 22, 2024

michelrisucci commented May 22, 2024

gulien commented May 23, 2024

[CHROMIUM] Route to allow source as PDF file to be converted (currently only HTML/Markdown/URL) #882

[CHROMIUM] Route to allow source as PDF file to be converted (currently only HTML/Markdown/URL) #882

Comments

michelrisucci commented May 17, 2024

gulien commented May 22, 2024

michelrisucci commented May 22, 2024

gulien commented May 23, 2024