Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
JonasRieger committed Aug 26, 2019
1 parent 5f9d830 commit c13d6b1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Expand Up @@ -2,7 +2,7 @@
Tools for Statistical Content Analysis

## About
'tosca' is a framework for statistical methods in content analysis. We offer a pipeline for preprocessing, model text corpora using a link to the implemantation of Latent Dirichlet Allocation from the 'lda' package. Useful plot routines for both - pre- and post-modeled corpora - are given for the descriptive analysis of text corpora and topic models. Moreover, an implementation of Chang's intruder words and intruder topics is provided - as well as reasoned sampling of text ids to get effective sets of texts for human labeling/coding.
'tosca' is a framework for statistical methods in content analysis. We offer a pipeline for preprocessing, model text corpora using a link to the implemantation of Latent Dirichlet Allocation from the 'lda' package. Useful plot routines for both - pre- and post-modeled corpora - are given for the descriptive analysis of text corpora and topic models. Moreover, an implementation of Chang's intruder words and intruder topics is provided; as well as reasoned sampling of text ids to get effective sets of texts for human labeling/coding regarding accuracy of estimating Precision and Recall.
URL: https://github.com/Docma-TU/tosca
created at TU Dortmund University: http://docma.tu-dortmund.de/cms/de/home/R-Paket-_tosca_/index.html

Expand Down

0 comments on commit c13d6b1

Please sign in to comment.