DocuScan Package

DocuScan is a lightweight document scanner.

DocuScan allows users to open up document types docx,doc,pdf and return the information inside as strings.

DocuScan also allows for manipulation of this information via regular expressions.

Requirements:

Installation:

Usage:

###It is worth noting that the fileName must be in the directory.

########### example :DocuScan("C:\Users\Person\Desktop\folder1\test.pdf")

Functionality:

returnFileText() - Returns the text of a file.
executeRegex(regexExpression) - creates a list of all matching cases of regexExpression
executeHeaderRegex(regularExpression) - creates a list of all matching cases of regexExpression in the header XML.
executeFooterRegex(regularExpression) - creates a list of all matching cases of regexExpression in the Footer XML.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py

Provide feedback