PDF TO TEXT CONVERTER

convert PDF to TXT.

Instructions

pip install aradf

Install Tesseract, include arabic training data in the installation from: https://github.com/UB-Mannheim/tesseract/wiki
convert PDF to TXT:

from aradf import convertor

# get the text, it also saves txt file to the same directory of the pdf
txt = convertor.pdf_to_txt('path/to/pdf_file')

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
aradf		aradf
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg