The DisKoveror Text analytics engine(DTAE) is a software product developed by Serendio for extracting information such as Sentiment, Text categories, named entities etc from unstructured text. DTAE is useful in extracting information from Twitter streams, product reviews, emails, webpages and any other source of unstructured information.
The workspace contains the core Text analytics engine which in turn optimally leverages multiple open source packages to give several results structed and presented in a clean user-friendly format.
- Sentiment Extraction
- Topic detection
- Named Entity Recognition
- Java based API
- Coreference Resolution
- Sentiment Polarity
The intended architecture of the system is as given below.
The diskoveror-ta-engine leverages modules under the below independent categories :
- Stanford CoreNLP
- Apache OpenNLP
- DUKE
- Topic Modeling Algorithm
- Sentiment Analysis Algorithm
- Life Science Ontologies
- Legal Ontologies
New modules could be supported under these categories without disturbing the existing system in any manner.
To package it in a single executable jar for distribution (.jar file), the following command has to be run from the command line.
mvn clean compile assembly:single
Version 7 build 79 or above
Apache Maven 3.0.5 or above