End to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python 3 and developed by the World Well-Being Project at the University of Pennsylvania.
Python Shell
Switch branches/tags
Nothing to show
Latest commit 1a27efb Jul 1, 2017 @sjgiorgi sjgiorgi version 1.1.0
Permalink
Failed to load latest commit information.
dlatk version 1.1.0 Jun 30, 2017
doc version 1.1.0 Jun 30, 2017
install Initial commit Oct 21, 2016
python27 version 1.0.1, see changelog Nov 21, 2016
.gitignore Initial commit Oct 21, 2016
LICENSE Initial commit Oct 21, 2016
MANIFEST.in Initial commit Oct 21, 2016
README.md version 1.1.0 Jun 30, 2017
dlatkInterface.py version 1.1.0 Jun 30, 2017
setup.py version 1.1.0 Jun 30, 2017

README.md

Differential Language Analysis ToolKit

DLATK is an end to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python 3 and developed by the World Well-Being Project at the University of Pennsylvania and Stony Brook University.

It contains:

  • feature extraction
  • part-of-speech tagging
  • correlation
  • prediction and classification
  • mediation
  • dimensionality reduction and clustering
  • wordcloud visualization

DLATK can utilize:

Installation

DLATK is available via conda, pip or github.

conda install -c wwbp dlatk
pip install dlatk
python setup.py install

Dependencies

See the full installation instructions for recommended and optional dependencies.

Documentation

The documentation for the latest release is at dlatk.wwbp.org.

License

Licensed under a GNU General Public License v3 (GPLv3)

Background

Developed by the World Well-Being Project based out of the University of Pennsylvania.