pyqt-pdf2text

Converting PDF or Images into text file from PyQt with Tesseract and PyPDF2

Requirements

PyPDF2
pytesseract
pdf2image
PyQt5>=5.14

Poppler is already included. (As of September 14, 2020, it is the latest version.)

Note

The current GUI only uses Tesseract for image-to-text conversion and does not use it for PDF-to-text conversion. The functionality does exist in the script.py, so feel free to use it if you'd like.

How to install

Install Tesseract from Google.
Add the installed path of Tesseract to your environment variables.
git clone
pip install -r requirements.txt
python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pyqt-pdf2text

Requirements

Note

How to install

Preview

About

Releases

Packages

Languages

License

yjg30737/pyqt-pdf2text

Folders and files

Latest commit

History

Repository files navigation

pyqt-pdf2text

Requirements

Note

How to install

Preview

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages