Skip to content

OCR based text summarizer for mini project in AI module

Notifications You must be signed in to change notification settings

Charith47/ocr-summarizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

User Interface Preview

  • User interface of initial page.

page-ui

  • User interface while in use.

page-ui-use

Server Installation

Required packages in Python (3.6+) environment

  • networkx
  • nltk
  • numpy
  • opencv-python
  • pytesseract
  • python-dotenv
  • quart
  • Quart-CORS
  • scipy

Required third party software

  • Tesseract OCR engine

Installation

  • Python packages using pip

      python -m pip install networkx nltk numpy opencv-python pytesseract python-dotenv quart Quart-CORS scipy
    
  • Installing Tesseract OCR engine

    • Download Link

    • Install to the default location (Recommended)

        C:\\Program Files\\Tesseract-OCR\\tesseract.exe
      
    • Or change the value of TESSERACT in .env file accordingly.

Running the server

Client installation

  • Use a simple web server to serve dist folder in client directory.

  • For example execute python -m http.server inside dist to serve the files of the directory. And visit the given link (i.e.http://localhost:8000/) to access the web client.

Future enhancements

  • Re-implement API in JavaScript and Express.js while keeping OCR/Summarization functionality in Python.