Document Layout Analysis resources repos for development with PdfPig.
-
Updated
Oct 1, 2023 - C#
Document Layout Analysis resources repos for development with PdfPig.
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Using a MaskRCNN model trained on the PublayNet dataset with ML.Net in C# / .Net for Document layout analysis and page segmmentation task.
Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Add a description, image, and links to the document-layout-analysis topic page so that developers can more easily learn about it.
To associate your repository with the document-layout-analysis topic, visit your repo's landing page and select "manage topics."