You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some of the PDF files that my application processes were created by scanners, so they're basically a PDF containing nothing but one image per page.
I would like to extract the images so that I can deal with them as images rather than PDFs. I don't want to use GetImage to convert the whole page to an image because this will include the margins around the image.
Looking at the source code, it looks like Docnet includes the PDFium calls required to extract images from PDF files:
Some of the PDF files that my application processes were created by scanners, so they're basically a PDF containing nothing but one image per page.
I would like to extract the images so that I can deal with them as images rather than PDFs. I don't want to use GetImage to convert the whole page to an image because this will include the margins around the image.
Looking at the source code, it looks like Docnet includes the PDFium calls required to extract images from PDF files:
docnet/src/Docnet.Core/Bindings/PdfiumWrapper.cs
Lines 3211 to 3214 in 728e6c9
docnet/src/Docnet.Core/Bindings/PdfiumWrapper.cs
Lines 1869 to 1887 in 728e6c9
It would be great if this was exposed so it was available to be used through Docnet.
The text was updated successfully, but these errors were encountered: