Skip to content

quanteda/dictionarytools

Repository files navigation

dictionarytools

CRAN_Status_Badge Travis-CI Build Status AppVeyor Build status Coverage Status

An R package consisting of dictionaries for text analysis and associated utilities. Designed to be used with quanteda but can be used more generally with any text analytic package (e.g. tidytext, tm, etc.).

Installing

# the devtools package needs to be installed for this to work
devtools::install_github("quanteda/dictionarytools") 

Plans

To do list includes adding functions that: - allow us to convert a wild-card or regex dictionary into a fixed match dictionary, for the supported languages (English, initially); - expand a core word list through synonyms using the wordnet package; - expand a core word list through (e.g.) cosine similarities to other words from a corpus; - expand a core word list through word2vec vector proximities to other words from a corpus; - allow easy editing of dictionaries via a round-trip to the editor.

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

About

Tools for text analysis dictionaries

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages