End to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python 3 and developed by the World Well-Being Project at the University of Pennsylvania.
Python Shell
Switch branches/tags
Nothing to show
Clone or download
Latest commit d752743 Jul 13, 2018
Permalink
Failed to load latest commit information.
dlatk version 1.1.4 Jan 11, 2018
doc version 1.1.4 Jan 11, 2018
install Initial commit Oct 21, 2016
python27 version 1.0.1, see changelog Nov 21, 2016
.gitignore Initial commit Oct 21, 2016
LICENSE Initial commit Oct 21, 2016
MANIFEST.in Initial commit Oct 21, 2016
README.md docker tweeks Jul 13, 2018
dlatkInterface.py version 1.1.3 Jan 9, 2018
setup.py version 1.1.4 Jan 11, 2018

README.md

Differential Language Analysis ToolKit

DLATK is an end to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python 3 and developed by the World Well-Being Project at the University of Pennsylvania and Stony Brook University.

It contains:

  • feature extraction
  • part-of-speech tagging
  • correlation
  • prediction and classification
  • mediation
  • dimensionality reduction and clustering
  • wordcloud visualization

DLATK can utilize:

Installation

DLATK is available via any of four popular installation platforms: conda, pip, github, or Docker:

New to installing Python packages?

It is recommended that you see the full installation instructions.

1. conda

conda install -c wwbp dlatk

2. pip

pip install dlatk

3. GitHub

git clone SEE_URL_ABOVE
cd dlatk
python setup.py install

4. Docker

Detailed Docker install instructions here.

docker run --name mysql_v5  --env MYSQL_ROOT_PASSWORD=my-secret-pw --detach mysql:5.5
docker run -it --rm --name dlatk_docker --link mysql_v5:mysql dlatk/dlatk bash

Dependencies

See the full installation instructions for recommended and optional dependencies.

Documentation

The documentation for the latest release is at dlatk.wwbp.org.

Citation

If you use DLATK in your work please cite the following paper:

@InProceedings{DLATKemnlp2017,
  author = 	"Schwartz, H. Andrew
		and Giorgi, Salvatore
		and Sap, Maarten
		and Crutchley, Patrick
		and Eichstaedt, Johannes
		and Ungar, Lyle",
  title = 	"DLATK: Differential Language Analysis ToolKit",
  booktitle = 	"Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations",
  year = 	"2017",
  publisher = 	"Association for Computational Linguistics",
  pages = 	"55--60",
  location = 	"Copenhagen, Denmark",
  url = 	"http://aclweb.org/anthology/D17-2010"
}

License

Licensed under a GNU General Public License v3 (GPLv3)

Background

Developed by the World Well-Being Project based out of the University of Pennsylvania and Stony Brook University.