Skip to content


Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?


Failed to load latest commit information.
Latest commit message
Commit time
February 15, 2023 12:40
December 7, 2022 16:51
June 10, 2018 03:20
March 31, 2023 17:07
October 5, 2018 16:35
April 6, 2021 11:45
March 31, 2023 13:07
March 31, 2023 18:17
February 28, 2022 18:56
August 9, 2022 19:16
February 28, 2022 18:56
December 1, 2018 17:30
December 5, 2016 13:53
March 31, 2023 13:08
July 9, 2018 10:46
March 30, 2023 17:28
March 31, 2023 17:07
March 31, 2023 17:07
March 22, 2021 09:05
March 31, 2023 17:07
August 9, 2022 19:16
August 9, 2022 19:16

quanteda: quantitative analysis of textual data

CRAN Version Downloads Total Downloads R-CMD-check codecov DOI DOI


An R package for managing and analyzing text, created by Kenneth Benoit. Supported by the European Research Council grant ERC-2011-StG 283794-QUANTESS.

For more details, see

quanteda version 3

The quanteda 3.0 major release improves functionality, completes the modularisation of the package begun in v2.0, further improves function consistency by removing previously deprecated functions, and enhances workflow stability and consistency by deprecating some shortcut steps built into some functions.

See for a full list of the changes.

The quanteda family of packages

As of v3.0, we have continued our trend of splitting quanteda into modular packages. These are now the following:

  • quanteda: contains all of the core natural language processing and textual data management functions
  • quanteda.textmodels: contains all of the text models and supporting functions, namely the textmodel_*() functions. This was split from the main package with the v2 release
  • quanteda.textstats: statistics for textual data, namely the textstat_*() functions, split with the v3 release
  • quanteda.textplots: plots for textual data, namely the textplot_*() functions, split with the v3 release

We are working on additional package releases, available in the meantime from our GitHub pages:

  • quanteda.sentiment: Functions and lexicons for sentiment analysis using dictionaries
  • quanteda.tidy: Extensions for manipulating document variables in core quanteda objects using your favourite tidyverse functions

and more to come.

How To…

How to Install

The normal way from CRAN, using your R GUI or


Or for the latest development version:

# devtools package required to install quanteda from Github 

Because this compiles some C++ and Fortran source code, you will need to have installed the appropriate compilers to build the development version.

How to Use

See the quick start guide to learn how to use quanteda.

How to Get Help

How to Cite

Benoit, Kenneth, Kohei Watanabe, Haiyan Wang, Paul Nulty, Adam Obeng, Stefan Müller, and Akitaka Matsuo. (2018) “quanteda: An R package for the quantitative analysis of textual data”. Journal of Open Source Software. 3(30), 774.

For a BibTeX entry, use the output from citation(package = "quanteda").

How to Leave Feedback

If you like quanteda, please consider leaving feedback or a testimonial here.

How to Contribute

Contributions in the form of feedback, comments, code, and bug reports are most welcome. How to contribute: