Skip to content

hyaovi/django-ocr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Djano-ocr

Django based Optical character recognition (OCR)

Django-ocr screenshot Extract text from PDF and images (JPG, BMP, TIFF, GIF) and convert into editable Word, and Text output formats

It was created using Flask.

Prerequisites

  • Python3 - Python 3.6 or later
  • Poppler-util
  • Tesseract OCR
  • Tessdata languages
    • Language used (Africans (afr) - English (eng) - Francais (fra) - Russian (rus) - Italian (ita) - Chinese (chin_sim & chin_tra) - Portguese (port)

Getting Started

  1. Clone the fork with HTTPS, using your local terminal to a preferred location, and cd into the project.
git clone https://github.com/hyaovi/django-ocr.git

Cloning into 'django-ocr'...
remote: Enumerating objects...

cd django-ocr/
  1. Create your virtual environment, and activate it.
python -m venv env

source env/bin/activate  # Linux/Mac
env/Scripts/activate  # Windows
  1. Install dependencies
pip install -r requirements.txt
  1. Run local server, and DONE!
python manage.py runserver

Watching for file changes with StatReloader
Performing system checks...

System check identified no issues (0 silenced).
June 19, 2020 - 11:50:14
Django version 3.0.7, using settings 'core.settings'
Starting development server at http://127.0.0.1:8000/
Quit the server with CONTROL-C.

Built With

  • Django Django is a high-level Python Web framework that encourages rapid development and clean, pragmatic design.
  • Tesseract OCR PyMongo is a Python distribution containing tools for working with MongoDB

About

Django based Optical character recognition (OCR)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published