Skip to content

Mutlaq0/aradf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF TO TEXT CONVERTER

convert PDF to TXT.

Instructions

  1. Install:
pip install aradf
  1. Install Tesseract, include arabic training data in the installation from: https://github.com/UB-Mannheim/tesseract/wiki

  2. convert PDF to TXT:

from aradf import convertor

# get the text, it also saves txt file to the same directory of the pdf
txt = convertor.pdf_to_txt('path/to/pdf_file')

About

arabic pdf to text convertor

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages