Skip to content
Tools for Statistical Content Analysis
R HTML
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
R
data
man
tests
vignettes
.Rbuildignore
.gitignore
.travis.yml
DESCRIPTION
NAMESPACE
README.md

README.md

tosca

Tools for Statistical Content Analysis
created at TU Dortmund University.

About

tosca is a framework for statistical methods in content analysis. We offer a pipeline for preprocessing, model text corpora using a link to the implemantation of Latent Dirichlet Allocation from the lda package. Useful plot routines for both - pre- and post-modeled corpora - are given for the descriptive analysis of text corpora and topic models. Moreover, an implementation of Chang's intruder words and intruder topics is provided; as well as reasoned sampling of text ids to get effective sets of texts for human labeling/coding regarding accuracy of estimating Precision and Recall.

Installation

See examples how to use tosca at the Vignette.

Citation

For a BibTeX entry please use citation(package = "tosca").

Contribution

This R package is licensed under the GPLv3. For wishes, issues, and bugs please use the issue tracker.

Build Status Coverage Status CRAN Status Badge CRAN Downloads Total Downloads

You can’t perform that action at this time.