Skip to content

SPAR10 allows various businesses and organizations to deal with textual data in a hassle-free manner and addresses several problems when dealing with textual data. All the features implemented are specific to their use cases and have a limited scope in terms of their performance. The core features are: Text Redaction, Text Summarization, Text Ex…

Notifications You must be signed in to change notification settings

riya-joshi-401/SPAR10-AI-Enabled-Text-Processing-Framework

 
 

Repository files navigation

SPAR10: AI Enabled Text Processing Framework

Frameworks and language used

Libraries used

Database

Editors and workspace

Front end tech stack

Introduction

Large volumes and a variety of data are generated by today's businesses in the form of e-documents, scanned images, emails, text messages and responses, user reviews, and so on. All of this is going to grow tremendously when they go digital. Almost 80% of the data created is semi-structured or unstructured, and the wide variety of document kinds and formats makes automation and analysis difficult. For enterprises to analyze and derive insights from thousands of documents, it would require huge manual effort and would be time-consuming.

SPAR10 is an intelligent AI-driven product which helps businesses automate the process of capturing and analyzing structured and unstructured text data through AI/ML. It is developed as a web application, using a variety of Linguistic and Machine Learning models along with Natural Language Processing to solve problems associated with text data. It applies different algorithms for each business problem.

Technical Manual

Tech stack used:

  • Programming Languages: Python, JavaScript
  • Libraries: Numpy, Pandas, OpenCV, PIL, Tensorflow, Tabula, Camelot, Nltk, Scikit-Learn, Seaborn, Matplotlib, Pickle, Keras, h5py
  • Front End Tools: HTML5, CSS, JQuery, Bootstrap
  • Back End Tools: Flask
  • Database: MySQL,SQLAlchemy
  • Editors/IDE: VScode, Jupyter Notebook, Git bash, Google Colab

Software required:

  • Web browser: To use the application.
  • Install Python 3.9.7 from https://www.python.org/downloads/
  • Install all the dependencies using the requirements.txt file using the following command:
    pip install -r requirements.txt
    

Setting up the database:

https://www.sqlite.org/2021/sqlite-dll-win64-x64-3370000.zip download this and extract it to anaconda3/dlls

Running the application:

Execute the following command on Command Prompt:

    python main.py

Developed with ♥️ by Purvi Harniya, Riya Joshi, Akhil Namboodiri and Sagar Kamat

About

SPAR10 allows various businesses and organizations to deal with textual data in a hassle-free manner and addresses several problems when dealing with textual data. All the features implemented are specific to their use cases and have a limited scope in terms of their performance. The core features are: Text Redaction, Text Summarization, Text Ex…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 86.7%
  • HTML 9.1%
  • Python 4.2%