EditorX is an image-to-text web app that allows users to edit extracted text and save it as pdf.
.
├── app.py
├── image_processor.py # Extracts text from images
├── static # Includes CSS, JS and image folders
└── css
└── js
└── image
└── uploads
└── processed
├── templates # HTML files
├── requirements.txt
├── LICENCE
└── README.md
conda create -n [name of enviroment] python=3.7
pip install -r requirements.txt
Linux:
[sudo] apt-get install tesseract-ocr
macOS:
brew install tesseract-ocr
Windows: Find the instructions here.
python app.py