Skip to content
A small tool to normalize and extract values from unstructured text messages.
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
db
docs
LICENSE
README.md
acronym_resolution.py
auto_correct.py
key_phrases.py
polarity_textblob.py

README.md

Emogram (Text Analysis for unstructured text)

Introduction

A small set of tools that'll normalize and extract values from unstructured text messages using concepts of NLP. Other Applications can use these modules to extract information and public opinions from Surveys, Social Networking sites, etc.

Getting Started

Prerequisites

What things you need to run the program:

  • Python Compiler (3.7 Recommended)
  • A clone of this repository :P
  • Install all the necessary packages form pypi by using the following command:
pip install textblob
pip install spellchecker

Functions

Acronym Resolution

Expands acronyms that are present in the text as the first step of text normalization.

Key Phrases Extraction

Rapid Automatic Keyword Extraction (RAKE) algorithm to determine key phrases in a body of text by analyzing the frequency of word appearance and its co-occurance with other words in the text.

Polarity Detection

Using TextBlob to detect Polarity of normalized text that ranges from -1 (Strongly Negative) to 1 (Strongly Positive).

Auto Correct

Autocorrects misspelt words/typos present in the text as a part of text normalization.

Authors



Made with ❤️ by Axemhammer

wave

You can’t perform that action at this time.