Skip to content

Transform images with text into a concise summary using Tesseract OCR and Google's Pegasus model

Notifications You must be signed in to change notification settings

skald1311/Summarize-Snap

Repository files navigation

LinkedIn


Logo

Summarize Snap

Transform images with text into a concise summary using Tesseract OCR and Google's Pegasus model
VIDEO DEMO

Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Contributing
  4. License
  5. Contact

About The Project

Summarize Snap is a cutting-edge project that seamlessly bridges the gap between visual content and concise textual summaries. This innovative solution is designed to streamline the process of extracting meaningful insights from images containing textual information. Whether it's a snapshot of a magazine article, a wiki page, or any other image containing text, Summarize Snap empowers users to swiftly and accurately obtain summaries.

ssd2

Key Features:

  1. Image-to-Text Conversion: Leveraging the power of Tesseract OCR (Optical Character Recognition), Summarize Snap efficiently converts images containing text into editable textual data. This foundational step ensures that the textual content is accurately extracted from the image, setting the stage for robust summarization.

  2. Advanced Text Summarization: With the integration of Google's Pegasus model, Summarize Snap takes text summarization to the next level. This state-of-the-art model, trained on massive amounts of data, excels at capturing the essence of lengthy passages and distilling them into concise and coherent summaries. The model I used was specifically trained on the cnn_dailymail dataset.

  3. User-Friendly Interface: Summarize Snap boasts an intuitive and user-friendly interface, making it accessible to both tech-savvy users and newcomers. Simply upload an image with text, and the tool takes care of the rest, ensuring a seamless user experience from start to finish.

  4. Versatility and Application: From students seeking to grasp the main ideas of dense academic texts to professionals needing quick insights from business documents, Summarize Snap finds application across various domains and sectors.

Experience the future of text summarization with Summarize Snap. Whether you're a researcher, a student, a professional, or simply someone looking to extract valuable information from images, this project offers a revolutionary solution at your fingertips. Embrace the synergy of Tesseract OCR and Google's Pegasus model for an unparalleled summarization experience.

Unlock the potential of images as a source of succinct knowledge with Summarize Snap today. Transform visual content into actionable insights effortlessly and elevate your information processing game.

(back to top)

Built With

  • Python
  • Pytorch
  • HTML5
  • CSS

(back to top)

Installation

LIVE DEMO ISN'T AVAILABLE BECAUSE UNFORTUNATELY I COULDN'T GET TESSERACT TO BE INSTALLED PROPERLY ON RENDER.COM. IT DOESN'T WORK BUT THE LINK IS HERE REGARDLESS: NOT WORKING

VIDEO DEMO HERE

LOCAL VERSION WORKS FINE, BELOW IS THE INSTRUCTIONS

To get a local copy up and running follow these simple example steps.

  1. Click the green button

image

  1. Download ZIP

    image

  2. Extract the file

    Make sure all of the files are in the same folder!!!
  3. Install Tesseract manually

    Latest installer for window: https://github.com/UB-Mannheim/tesseract/wiki

    For other OS: https://tesseract-ocr.github.io/tessdoc/Installation.html

    Search Edit the system environment variables -> Environment Variables -> PATH -> NEW -> add the path to tesseract-ocr (usually C:\Program Files\Tesseract-OCR) -> OK

    In Environment Variables -> New -> Variable name: TESSDATA_PREFIX | Variable value: C:\Program Files\Tesseract-OCR\tessdata -> OK

  4. Open cmd -> change directory to "src" folder -> Create a virtual environment (below is for Windows)

    py -3 -m venv .venv
    .venv\Scripts\activate
  5. Install all the dependencies

pip install -r requirements.txt

if this doesn't work, try this instead:

pip install transformers torch sentencepiece pytesseract Flask Flask-Reuploaded Flask-WTF
  1. Run the below command in terminal
    flask --app app run

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

(back to top)

Contact

Duong Hoang - LinkedIn

Project Link: github.com/skald1311/Summerize-Snap

(back to top)

About

Transform images with text into a concise summary using Tesseract OCR and Google's Pegasus model

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published