- A small app that will allow the users to login and upload PDF and image documents. The server will generate text from the image/document by OCR or Optical Character Recognition using Tesseract.
https://drive.google.com/file/d/1h9u3k_ubcC5y34Go23hr7nAqr0tTCpcV/view?usp=sharing
This Django web app uses pytesseract(https://pypi.org/project/pytesseract/) for OCR.
Note: This app requires you to install https://github.com/tesseract-ocr/tesseract in your Operating System to be able to use the OCR.
- User authentication and login
- Handles both PDF and image files
- Copy and generate text from images or PDF docs using OCR
- Ability to download the original upload files
This app uses PostgreSQL for DBMS. Download it from - https://www.postgresql.org/download/
Make sure you download and install tesseract(https://github.com/tesseract-ocr/tesseract) for OCR.
Clone the project
git clone: https://github.com/Sundeep220/Django_OCR
-
Before running this project:
-> Create a virtual environment, in Windows command is as follows:
pythom -m venv env
-> To activate:
env/Scripts/activate
-
To run this project do the following:
Pre-requisites: Python, pip and django should be installed in your system. All the coding is done using VScode.
- To install the dependencies:
pip install -r requirements.txt
- Before Applying the below steps make sure you have integerated postgres in you application.
Make the migrations:
python manage.py makemigrations
Migrate the tables:
python manage.py migrate
- Create a superuser for your project:
python manage.py createsuperuser
This will prompt you to enter username, email and password for the superuser. - Run the server using:
python manage.py runserver
- To install the dependencies:
To run this project, you will need to add the following environment variables to your .env file in the root folder
SECRET_KEY
DB_NAME
DB_USER
DB_PASSWORD
DB_HOST
DB_PORT
##App view:
- Home Page:
- Uploaded doc page:
- Extracted text page:
- Login Page:
- Register Page: