Skip to content

Mid-level package that uses Rtesseract and ReadPDF to get the intermediate-level elements from a document, e.g., table, title, sections, text.

Notifications You must be signed in to change notification settings

dsidavis/GetDocElements

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GetDocElements

This package takes the bounding boxes created from a PDF, either from ReadPDF or Rtesseract, and reconstructs the elements of the PDF, e.g. columns, titles, sections, etc.

About

Mid-level package that uses Rtesseract and ReadPDF to get the intermediate-level elements from a document, e.g., table, title, sections, text.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages