diff --git a/CRAN-RELEASE b/CRAN-RELEASE new file mode 100644 index 0000000000..d864ce22b7 --- /dev/null +++ b/CRAN-RELEASE @@ -0,0 +1,2 @@ +This package was submitted to CRAN on 2021-08-17. +Once it is accepted, delete this file and tag the release (commit a425cb3c7). diff --git a/docs/404.html b/docs/404.html index ebf0e7a296..2ba83ca6ea 100644 --- a/docs/404.html +++ b/docs/404.html @@ -52,8 +52,6 @@ - - @@ -75,19 +73,12 @@ - - - - - -
- - - - +
+ + + @@ -239,11 +228,11 @@

Contents

@@ -269,8 +258,6 @@

Contents

- - diff --git a/docs/CONDUCT.html b/docs/CONDUCT.html index 07b225a32f..075641106c 100644 --- a/docs/CONDUCT.html +++ b/docs/CONDUCT.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
@@ -194,7 +186,7 @@
@@ -244,11 +236,11 @@

Contents

@@ -274,8 +266,6 @@

Contents

- - diff --git a/docs/LICENSE-text.html b/docs/LICENSE-text.html index f2019f8d2a..11cad8dfd2 100644 --- a/docs/LICENSE-text.html +++ b/docs/LICENSE-text.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
@@ -194,7 +186,7 @@
-

This article compares quanteda to alternative R packages for quantitative text analysis (tm, tidytext, corpus, and koRpus) and the Natural Language Toolkit for Python. If a function is available in another package, we provide the respective command.

+

This article compares quanteda to alternative R packages for quantitative text analysis (tm, tidytext, corpus, and koRpus) and the Natural Language Toolkit for Python. If a function is available in another package, we provide the respective command.

Note that we have used the package manuals for the comparison. If we have overlooked certain functions, please let us know - either by editing the table and issuing a Pull Request or by contacting the maintainer.

- +
@@ -469,13 +467,11 @@

Stefan Müller, Kenneth Benoit, and Kohei Watan
-

-

Site built with pkgdown 1.6.1.9000.

+

Site built with pkgdown 1.6.1.

diff --git a/docs/articles/pkgdown/comparison_files/header-attrs-2.10/header-attrs.js b/docs/articles/pkgdown/comparison_files/header-attrs-2.10/header-attrs.js new file mode 100644 index 0000000000..dd57d92e02 --- /dev/null +++ b/docs/articles/pkgdown/comparison_files/header-attrs-2.10/header-attrs.js @@ -0,0 +1,12 @@ +// Pandoc 2.9 adds attributes on both header and div. We remove the former (to +// be compatible with the behavior of Pandoc < 2.8). +document.addEventListener('DOMContentLoaded', function(e) { + var hs = document.querySelectorAll("div.section[class*='level'] > :first-child"); + var i, h, a; + for (i = 0; i < hs.length; i++) { + h = hs[i]; + if (!/^h[1-6]$/i.test(h.tagName)) continue; // it should be a header h1-h6 + a = h.attributes; + while (a.length > 0) h.removeAttribute(a[0].name); + } +}); diff --git a/docs/articles/pkgdown/design.html b/docs/articles/pkgdown/design.html index 309528330f..df2f314d1b 100644 --- a/docs/articles/pkgdown/design.html +++ b/docs/articles/pkgdown/design.html @@ -28,8 +28,6 @@ - -
+
@@ -190,8 +188,8 @@

  • object_verb: a function that inputs an object of class object, and returns a a modified object class object. There are no exceptions to this naming rule, so that even functions that operate on character objects following this convention, such as char_tolower(). (Ok, so there is a slight exception: we abbreviated character to char!)

  • data_class_descriptor: data objects are named this way to clearly distinguish them and to make them easy to identify in the index. The first part identifies them as data, the second names their object class, and the third component is a descriptor. Example: data_corpus_inaugural is the quanteda corpus() class object consisting of the US presidents’ inaugural addresses.

  • textgeneral_specific: functions that input a quanteda object and return the result of an analysis, as a new type of object. Only the underscored functions that begin with text break the previous rule about the first part of the name identifying the object class that is input and output. Examples: textstat_readability() takes a character or corpus as input, and returns a data.frame; textplot_xray() takes a kwic object as input, and generates a dispersion plot (named “x-ray” because of its similarity to the plot produced by Kindle).

  • -
  • Extensions of R functions: These are commonly used R functions, such as head(), that are also defined for quanteda objects. Examples: head.dfm(), coercion functions such as as.list.tokens, and Boolean class type checking functions such as is.dfm(). Many post-estimation methods defined for lm objects, for instance predict(), are also defined for most textmodel objects

  • -
  • R-like functions. These are functions for quanteda objects that follow naming conventions and functionality that should be very familiar to users of R. Example: ndoc() returns the number of documents in a corpus, tokens, or dfm object, similar to base::nrow(). Note that like nrow(), ndoc() is not plural. Other examples include docnames() and featnames() – similar to rownames() and colnames().

  • +
  • Extensions of R functions: These are commonly used R functions, such as head(), that are also defined for quanteda objects. Examples: head.dfm(), coercion functions such as as.list.tokens, and Boolean class type checking functions such as is.dfm(). Many post-estimation methods defined for lm objects, for instance predict(), are also defined for most textmodel objects

  • +
  • R-like functions. These are functions for quanteda objects that follow naming conventions and functionality that should be very familiar to users of R. Example: ndoc() returns the number of documents in a corpus, tokens, or dfm object, similar to base::nrow(). Note that like nrow(), ndoc() is not plural. Other examples include docnames() and featnames() – similar to rownames() and colnames().

  • Grammatical exceptions: Every language has these, usually due to path dependency from historical development, and quanteda is no exception. The list, however, is short:

      @@ -210,7 +208,7 @@

      Constructors for core data types

      -

      The quanteda package consists of a few core data types, created by calling constructors with identical names. These are all “nouns” in the sense of declaring what they construct. This follows very similar R behaviour in many of the core R objects, such as data.frame(), list(), etc.

      +

      The quanteda package consists of a few core data types, created by calling constructors with identical names. These are all “nouns” in the sense of declaring what they construct. This follows very similar R behaviour in many of the core R objects, such as data.frame(), list(), etc.

      Core object types and their constructor functions:

      • @@ -238,7 +236,7 @@

        Advantages

        -

        In our view, the advantages of this clarity outweigh whatever advantages might be found from overloading a generic function. The functions corpus_sample(), tokens_sample(), and dfm_sample(), for instance, are clearer to understand and read from a package’s function index, than the previously overloaded version of sample() that could be dispatched on a corpus, tokenized text, or dfm object. Additionally, in the case of sample(), we avoid the namespace “conflict” caused by redefining the function as a generic, so that it could be overloaded. Our new, more specific naming conventions therefore reduce the likelihood of namespace conflicts with other packages.

        +

        In our view, the advantages of this clarity outweigh whatever advantages might be found from overloading a generic function. The functions corpus_sample(), tokens_sample(), and dfm_sample(), for instance, are clearer to understand and read from a package’s function index, than the previously overloaded version of sample() that could be dispatched on a corpus, tokenized text, or dfm object. Additionally, in the case of sample(), we avoid the namespace “conflict” caused by redefining the function as a generic, so that it could be overloaded. Our new, more specific naming conventions therefore reduce the likelihood of namespace conflicts with other packages.

        @@ -250,12 +248,12 @@

        Extensions of core R functions

        -

        Many simple base R functions – simpler at least than the example of sample() cited above – are still extended to quanteda objects through overloading. The logic of allowing is that these functions, e.g. cbind() for a dfm, are very simple and very common, and therefore are well-known to users. Furthermore, they can operate in only one fashion on the object for which they are defined, such as cbind() combining two dfm objects by joining columns. Similar functions extended in this way include print(), head(), tail(), and t(). Most of these functions are so natural that their documentation is not included in the package index.

        +

        Many simple base R functions – simpler at least than the example of sample() cited above – are still extended to quanteda objects through overloading. The logic of allowing is that these functions, e.g. cbind() for a dfm, are very simple and very common, and therefore are well-known to users. Furthermore, they can operate in only one fashion on the object for which they are defined, such as cbind() combining two dfm objects by joining columns. Similar functions extended in this way include print(), head(), tail(), and t(). Most of these functions are so natural that their documentation is not included in the package index.

        Additions to core R(-like) functions

        -

        Additional functions have been defined for quanteda objects that are very similar to simple base R functions, but are not named using the class_action format because they do not return a modified object of the same class. These follow as closely as possible the naming conventions found in the base R functions that are similar. For instance, docnames() and featnames() return the document names of various quanteda objects, in the same way that rownames() does for matrix-like objects (a matrix, data.frame, data.table, etc.). The abbreviation of featnames() is intentionally modeled on colnames(). Likewise, ndoc() returns the number of documents, using the singular form similar to nrow() and ncol().

        +

        Additional functions have been defined for quanteda objects that are very similar to simple base R functions, but are not named using the class_action format because they do not return a modified object of the same class. These follow as closely as possible the naming conventions found in the base R functions that are similar. For instance, docnames() and featnames() return the document names of various quanteda objects, in the same way that rownames() does for matrix-like objects (a matrix, data.frame, data.table, etc.). The abbreviation of featnames() is intentionally modeled on colnames(). Likewise, ndoc() returns the number of documents, using the singular form similar to nrow() and ncol().

        @@ -339,7 +337,7 @@

      • From a dfm – after dfm() on the processed document and features.

      • @@ -357,7 +355,7 @@

        char_tolower() %>% tokens() %>% tokens_wordstem() %>% - tokens_remove(stopwords("english")) %>% + tokens_remove(stopwords("english")) %>% dfm()

      We recognize however that not all sequences will make sense, for instance tokens_wordstem() will only work after tokenization, and will try to catch these errors and make the proper sequence clear to users.

  • @@ -375,13 +373,11 @@

    -

    -

    Site built with pkgdown 1.6.1.9000.

    +

    Site built with pkgdown 1.6.1.

    diff --git a/docs/articles/pkgdown/design_files/header-attrs-2.10/header-attrs.js b/docs/articles/pkgdown/design_files/header-attrs-2.10/header-attrs.js new file mode 100644 index 0000000000..dd57d92e02 --- /dev/null +++ b/docs/articles/pkgdown/design_files/header-attrs-2.10/header-attrs.js @@ -0,0 +1,12 @@ +// Pandoc 2.9 adds attributes on both header and div. We remove the former (to +// be compatible with the behavior of Pandoc < 2.8). +document.addEventListener('DOMContentLoaded', function(e) { + var hs = document.querySelectorAll("div.section[class*='level'] > :first-child"); + var i, h, a; + for (i = 0; i < hs.length; i++) { + h = hs[i]; + if (!/^h[1-6]$/i.test(h.tagName)) continue; // it should be a header h1-h6 + a = h.attributes; + while (a.length > 0) h.removeAttribute(a[0].name); + } +}); diff --git a/docs/authors.html b/docs/authors.html index 2648f48a0d..85dc04192d 100644 --- a/docs/authors.html +++ b/docs/authors.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
    @@ -194,7 +186,7 @@
    -
    • Kenneth Benoit. Maintainer, author, copyright holder. @@ -285,7 +276,7 @@

      Authors


      ERC-2011-StG 283794-QUANTESS

    - +

    @@ -294,11 +285,11 @@

    Authors

    -

    Site built with pkgdown 1.6.1.9000.

    +

    Site built with pkgdown 1.6.1.

    @@ -324,8 +315,6 @@

    Authors

    - - diff --git a/docs/extra.css b/docs/extra.css index 6838592738..4365fb9cfc 100644 --- a/docs/extra.css +++ b/docs/extra.css @@ -69,17 +69,17 @@ div.contents h1 { div.contents h2 { font-size: 150%; - padding-top: 70px; + padding-top: 60px; } div.contents h3 { font-size: 120%; - padding-top: 60px; + padding-top: 50px; } div.contents h4 { font-size: 100%; - padding-top: 60px; + padding-top: 50px; } pre { @@ -87,6 +87,3 @@ pre { font-size: 80%; line-height: 150%; } - - - diff --git a/docs/index.html b/docs/index.html index 9fc9154f6f..75bbfdacf0 100644 --- a/docs/index.html +++ b/docs/index.html @@ -34,8 +34,6 @@ - -

      -
    • Fixed a problem when applying purrr::map() to dfm() (#928).
    • +
    • Fixed a problem when applying purrr::map() to dfm() (#928).
    • Added documentation for regex2fixed() and associated functions.
    • Fixed a bug in textstat_collocations.tokens() caused by “documents” containing only "" as tokens. (#940)
    • Fixed a bug caused by cbind.dfm() when features shared a name starting with quanteda_options("base_featname") (#946)
    • @@ -258,7 +250,7 @@

    • summary.corpus() now generates a special data.frame, which has its own print method, rather than requiring verbose = FALSE to suppress output (#926).
    • -textstat_collocations() is now multi-threaded.
    • +textstat_collocations() is now multi-threaded.
    • head.dfm(), tail.dfm() now behave consistently with base R methods for matrix, with the added argument nfeature. Previously, these methods printed the subset and invisibly returned it. Now, they simply return the subset. (#952)
    • Dictionary keys are now unique, and if multiple, identical keys are defined for a dictionary when constructed, the values will be merged into the consolidated key. (#959)
    • @@ -274,12 +266,12 @@

      New Features

      • Added methods for changing the docnames of tokens and dfm objects (#987).
      • -
      • Added new function textmodel_lsa() for Latent Semantic Analysis.
      • +
      • Added new function textmodel_lsa() for Latent Semantic Analysis.
      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • The computation of tfidf has been more thoroughly described in the documentation for this function (#997).
      • Fixed a bug discovered in #1011 for unused keys in tokens_lookup(..., exclusive = FALSE).
      • @@ -299,15 +291,15 @@

      • tokens_replace() now allows token types to be substituted directly and quickly.
      • -textmodel_affinity() now adds functionality to fit the Perry and Benoit (2017) class affinity model.
      • +textmodel_affinity() now adds functionality to fit the Perry and Benoit (2017) class affinity model.
      • Added a spacy_parse method for corpus objects. Also restored quanteda methods for spacyr spacy_parsed objects.
      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
        -
      • Improved documentation for textmodel_nb() (#1010), and made output quantities from the fitted NB model regular matrix objects instead of Matrix classes.
      • +
      • Improved documentation for textmodel_nb() (#1010), and made output quantities from the fitted NB model regular matrix objects instead of Matrix classes.
      @@ -320,7 +312,7 @@

    • The deprecated “list of characters” tokenize() function and all methods associated with the tokenizedTexts object types have been removed.
    • Added convenience functions for keeping tokens or features: tokens_keep(), dfm_keep(), and fcm_keep(). (#1037)
    • -textmodel_NB() has been replaced by textmodel_nb().
    • +textmodel_NB() has been replaced by textmodel_nb().

    @@ -337,11 +329,11 @@

    Contents

    -

    Site built with pkgdown 1.6.1.9000.

    +

    Site built with pkgdown 1.6.1.

    @@ -367,8 +359,6 @@

    Contents

    - - diff --git a/docs/news/news-1.0.html b/docs/news/news-1.0.html index d00c1a8d18..88db92c379 100644 --- a/docs/news/news-1.0.html +++ b/docs/news/news-1.0.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
    @@ -194,7 +186,7 @@
    -
    +

    -Bug fixes and stability enhancements

    +Bug fixes and stability enhancements
    @@ -263,7 +255,7 @@

    textmodel_wordshoal() has been removed, and relocated to a new package (wordshoal).
  • The generic wrapper function textmodel(), which used to be a gateway to specific textmodel_*() functions, has been removed.
  • (Most of) the textmodel_*() have been reimplemented to make their behaviour consistent with the lm/glm() families of models, including especially how the predict, summary, and coef methods work (#1007, #108).
  • -
  • The GitHub home for the repository has been moved to https://github.com/quanteda/quanteda.
  • +
  • The GitHub home for the repository has been moved to https://github.com/quanteda/quanteda.
  • @@ -280,11 +272,11 @@

    Contents

    -

    Site built with pkgdown 1.6.1.9000.

    +

    Site built with pkgdown 1.6.1.

    @@ -310,8 +302,6 @@

    Contents

    - - diff --git a/docs/news/news-1.1.html b/docs/news/news-1.1.html index a1e755f45e..a7576f75f6 100644 --- a/docs/news/news-1.1.html +++ b/docs/news/news-1.1.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
    @@ -194,7 +186,7 @@
    -
    +

    -Bug fixes and stability enhancements

    +Bug fixes and stability enhancements
      -
    • Fixed a problem in the examples for textplot_scale1d() by adjusting the refscores for data_corpus_irishbudget2010.
    • +
    • Fixed a problem in the examples for textplot_scale1d() by adjusting the refscores for data_corpus_irishbudget2010.
    • Eliminated unnecessary dependency on the digest package.
    • Updated the vignette title to be less generic.
    • Improved the robustness of dfm_trim() and dfm_weight() for previously weighted dfm objects and when supplied thresholds are proportions instead of counts. (#1237)
    • Fixed a problem in summary.corpus(x, n = 101) when ndoc(x) > 100 (#1242).
    • -
    • Fixed a problem in predict.textmodel_wordscores(x, rescaling = "mv") that always reset the reference values for rescaling to the first and second documents (#1251).
    • -
    • Issues in the color generation and labels for textplot_keyness() are now resolved (#1233, #1233).
    • +
    • Fixed a problem in predict.textmodel_wordscores(x, rescaling = "mv") that always reset the reference values for rescaling to the first and second documents (#1251).
    • +
    • Issues in the color generation and labels for textplot_keyness() are now resolved (#1233, #1233).
    @@ -260,7 +252,7 @@

    Behaviour changes

      -
    • Changed the default in textmodel_wordfish() to sparse = FALSE, in response to #1216.
    • +
    • Changed the default in textmodel_wordfish() to sparse = FALSE, in response to #1216.
    • dfm_group() now preserves docvars that are constant for the group aggregation (#1228).
    • The default threads is now 2, to comply with CRAN policies. (The user can increase this via quanteda_options(threads = ...).
    • @@ -280,11 +272,11 @@

      Contents

      -

      Site built with pkgdown 1.6.1.9000.

      +

      Site built with pkgdown 1.6.1.

      @@ -310,8 +302,6 @@

      Contents

      - - diff --git a/docs/news/news-1.2.html b/docs/news/news-1.2.html index 444e52392f..3cfb4b80db 100644 --- a/docs/news/news-1.2.html +++ b/docs/news/news-1.2.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
      @@ -194,7 +186,7 @@
      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • Fix bug in nsyllable() that incorrectly handled cased words, and returned wrong names with use.names = TRUE. (#1282)
      • Fix the overwriting of summary.character() caused by previous import of the network package namespace. (#1285)
      • @@ -251,7 +243,7 @@

      • dfm_trim() now takes more options, and these are implemented more consistently. min_termfreq and max_termfreq have replaced min_count and max_count, and these can be modified using a termfreq_type argument. (Similar options are implemented for docfreq_type.) Solves #1253, #1254.
      • -textstat_simil() and textstat_dist() now take valid dfm indexes for the relevant margin for the selection argument. Previously, this could also be a direct vector or matrix for comparison, but this is no longer allowed. Solves #1266.
      • +textstat_simil() and textstat_dist() now take valid dfm indexes for the relevant margin for the selection argument. Previously, this could also be a direct vector or matrix for comparison, but this is no longer allowed. Solves #1266.
      • Improved performance for dfm_group() (#1295).
      @@ -269,11 +261,11 @@

      Contents

      -

      Site built with pkgdown 1.6.1.9000.

      +

      Site built with pkgdown 1.6.1.

      @@ -299,8 +291,6 @@

      Contents

      - - diff --git a/docs/news/news-1.3.html b/docs/news/news-1.3.html index 3cf0ad2f5d..d91f1d2777 100644 --- a/docs/news/news-1.3.html +++ b/docs/news/news-1.3.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
      @@ -194,7 +186,7 @@

      Bug fixes

        -
      • Fixed a bug in textmodel_affinity() that caused failure when the input dfm had been compiled with tolower = FALSE. (#1338)
      • +
      • Fixed a bug in textmodel_affinity() that caused failure when the input dfm had been compiled with tolower = FALSE. (#1338)
      • Fixed a bug affecting tokens_lookup() and dfm_lookup() when nomatch is used. (#1347)
      • Fixed a problem whereby NA texts created a “document” (or tokens) containing "NA" (#1372)
      @@ -257,9 +249,9 @@

      quanteda 1.3.4 2018-07-15

      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • Keep encodings of types when a tokens object is recompiled. (#1387)
      • More robust handling in predict.textmodel_worscores() when training and test feature sets are difference (#1380).
        @@ -278,7 +270,7 @@

        New Features

          -
        • Added as.igraph.fcm() method for converting an fcm object into an igraph graph object.
        • +
        • Added as.igraph.fcm() method for converting an fcm object into an igraph graph object.
        • Added a case_insensitive argument to char_segment() and corpus_segment().
      @@ -287,27 +279,27 @@

      quanteda 1.3.13 2018-11-01

      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • Fixed a bug causing incorrect counting in fcm(x, ordered = TRUE). (#1413) Also set the condition that window can be of size 1 (formerly the limit was 2 or greater).
      • Fixed deprecation warnings from adding a dfm as docvars, and this now imports the feature names as docvar names automatically. (related to #1417)
      • Fixed behaviour from tokens(x, what = "fasterword", remove_separators = TRUE) so that it correctly splits words separated by \n and \t characters. (#1420)
      • Add error checking for functions taking dfm inputs in case a dfm has empty features (#1419).
      • -
      • For textstat_readability(), fixed a bug in Dale-Chall-based measures and in the Spache word list measure. These were caused by an incorrect lookup mechanism but also by limited implementation of the wordlists. The new wordlists include all of the variations called for in the original measures, but using fast fixed matching. (#1410)
      • -
      • Fixed problems with basic dfm operations (rowMeans(), rowSums(), colMeans(), colSums()) caused by not having access to the Matrix package methods. (#1428)
      • -
      • Fixed problem in textplot_scale1d() when input a predicted wordscores object with se.fit = TRUE (#1440).
      • -
      • Improved the stability of textplot_network(). (#1460)
      • +
      • For textstat_readability(), fixed a bug in Dale-Chall-based measures and in the Spache word list measure. These were caused by an incorrect lookup mechanism but also by limited implementation of the wordlists. The new wordlists include all of the variations called for in the original measures, but using fast fixed matching. (#1410)
      • +
      • Fixed problems with basic dfm operations (rowMeans(), rowSums(), colMeans(), colSums()) caused by not having access to the Matrix package methods. (#1428)
      • +
      • Fixed problem in textplot_scale1d() when input a predicted wordscores object with se.fit = TRUE (#1440).
      • +
      • Improved the stability of textplot_network(). (#1460)

      New Features

        -
      • Added new argument intermediate to textstat_readability(x, measure, intermediate = FALSE), which if TRUE returns intermediate quantities used in the computation of readability statistics. Useful for verification or direct use of the intermediate quantities.
      • +
      • Added new argument intermediate to textstat_readability(x, measure, intermediate = FALSE), which if TRUE returns intermediate quantities used in the computation of readability statistics. Useful for verification or direct use of the intermediate quantities.
      • Added a new separator argument to kwic() to allow a user to define which characters will be added between tokens returned from a keywords in context search. (#1449)
      • -
      • Reimplemented textstat_dist() and textstat_simil() in C++ for enhanced performance. (#1210)
      • +
      • Reimplemented textstat_dist() and textstat_simil() in C++ for enhanced performance. (#1210)
      • Added a tokens_sample() function (#1478).
      @@ -315,8 +307,8 @@

      Behaviour changes

        -
      • Removed the Hamming distance method from textstat_dist() (#1443), based on the reasoning in #1442.
      • -
      • Removed the “chisquared” and “chisquared2” distance measures from textstat_simil(). (#1442)
      • +
      • Removed the Hamming distance method from textstat_dist() (#1443), based on the reasoning in #1442.
      • +
      • Removed the “chisquared” and “chisquared2” distance measures from textstat_simil(). (#1442)
      @@ -324,20 +316,20 @@

      quanteda 1.3.14 2018-11-19

      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
        -
      • Improved the robustness of textstat_keyness() (#1482).
      • +
      • Improved the robustness of textstat_keyness() (#1482).
      • Improved the accuracy of sparsity reporting for the print method of a dfm (#1473).
      • -
      • Diagonals on a textstat_simil() return object coerced to matrix now default to 1.0, rather than 0.0 (#1494).
      • +
      • Diagonals on a textstat_simil() return object coerced to matrix now default to 1.0, rather than 0.0 (#1494).

      New Features

        -
      • Added the following measures to textstat_lexdiv(): Yule’s K, Simpson’s D, and Herdan’s Vm.
      • +
      • Added the following measures to textstat_lexdiv(): Yule’s K, Simpson’s D, and Herdan’s Vm.
      @@ -354,11 +346,11 @@

      Contents

      -

      Site built with pkgdown 1.6.1.9000.

      +

      Site built with pkgdown 1.6.1.

      @@ -384,8 +376,6 @@

      Contents

      - - diff --git a/docs/news/news-1.4.html b/docs/news/news-1.4.html index 8926eb0232..8ebeb0fa6e 100644 --- a/docs/news/news-1.4.html +++ b/docs/news/news-1.4.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
      @@ -194,7 +186,7 @@
      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      @@ -288,7 +280,7 @@

      New features

        -
      • Added Yule’s I to textstat_lexdiv().
      • +
      • Added Yule’s I to textstat_lexdiv().
      • Added forward compatibility for newer (v2) corpus class objects.
      • Added a new function featfreq() to compute the overall feature frequencies from a dfm.
      @@ -315,11 +307,11 @@

      Contents

      -

      Site built with pkgdown 1.6.1.9000.

      +

      Site built with pkgdown 1.6.1.

      @@ -345,8 +337,6 @@

      Contents

      - - diff --git a/docs/news/news-2.0.html b/docs/news/news-2.0.html index dbaccd4627..f8ffb78d7e 100644 --- a/docs/news/news-2.0.html +++ b/docs/news/news-2.0.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
      @@ -194,7 +186,7 @@

      Behaviour changes

        -
      • Added a force = TRUE option and error checking for the situations of applying dfm_weight() or dfm_group() to a dfm that has already been weighted. (#1545) The function textstat_frequency() now allows passing this argument to dfm_group() via .... (#1646)
      • +
      • Added a force = TRUE option and error checking for the situations of applying dfm_weight() or dfm_group() to a dfm that has already been weighted. (#1545) The function textstat_frequency() now allows passing this argument to dfm_group() via .... (#1646)
      • -textstat_frequency() now has a new argument for resolving ties when ranking term frequencies, defaulting to the “min” method. (#1634)
      • -
      • New docvars accessor and replacement functions are available for corpus, tokens, and dfm objects via $. (See Index Operators for Core Objects above.)
      • +textstat_frequency() now has a new argument for resolving ties when ranking term frequencies, defaulting to the “min” method. (#1634) +
      • New docvars accessor and replacement functions are available for corpus, tokens, and dfm objects via $. (See Index Operators for Core Objects above.)
      • -textstat_entropy() now produces a data.frame that is more consistent with other textstat methods. (#1690)
      • +textstat_entropy() now produces a data.frame that is more consistent with other textstat methods. (#1690)
      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • docnames now enforced to be character (formerly, could be numeric for some objects).
      • docnames are now enforced to be strictly unique for all object classes.
      • @@ -320,13 +312,13 @@

        Changes

        • Moved data_corpus_irishbudget2010 and data_corpus_dailnoconf1991 to the quanteda.textmodels package.
        • -
        • Em dashes and double dashes between words, whether surrounded by a space or not, are now converted to " - " to distinguish them from infix hyphens. (#1889)
        • +
        • Em dashes and double dashes between words, whether surrounded by a space or not, are now converted to ” - ” to distinguish them from infix hyphens. (#1889)
        • Verbose output for dfm and tokens creation is now corrected and more consistent. (#1894)
      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • Number removal is now both improved and fixed (#1909).
      • Fixed an issue causing CRAN errors in pre-v4, related to the new default of stringsAsFactors = FALSE for data.frame objects.
      • @@ -349,11 +341,11 @@

        Contents

        -

        Site built with pkgdown 1.6.1.9000.

        +

        Site built with pkgdown 1.6.1.

        @@ -379,8 +371,6 @@

        Contents

        - - diff --git a/docs/news/news-2.1.html b/docs/news/news-2.1.html index e0f1dbd865..c238c1131b 100644 --- a/docs/news/news-2.1.html +++ b/docs/news/news-2.1.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
        @@ -194,7 +186,7 @@
        -
        +

        -Bug fixes and stability enhancements

        +Bug fixes and stability enhancements
          -
        • Stopped returning NA for non-existent features when n > nfeat(x) in textstat_frequency(x, n). (#1929)
        • +
        • Stopped returning NA for non-existent features when n > nfeat(x) in textstat_frequency(x, n). (#1929)
        • Fixed a problem in dfm_lookup() and tokens_lookup() in which an error was caused when no dictionary key returned a single match (#1946).
        • Fixed a bug that caused a textstat_simil/dist object converted to a data.frame to drop its document2 labels (#1939).
        • Fixed a bug causing dfm_match() to fail on a dfm that included “pads” (""). (#1960)
        • Updated the data_dfm_lbgexample object using more modern dfm internals.
        • -
        • Updates textstat_readability(), textstat_lexdiv(), and nscrabble() so that empty texts are not dropped in the result. (#1976)
        • +
        • Updates textstat_readability(), textstat_lexdiv(), and nscrabble() so that empty texts are not dropped in the result. (#1976)
        @@ -265,13 +257,13 @@

        Changes

      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • corpus_reshape() now allows reshaping back to documents even when segmented texts were of zero length. (#1978)
      • -
      • Special handling applied for Solaris to some issues breaking on that build, relating to the cacheing in summary.corpus()/textstat_summary().
      • +
      • Special handling applied for Solaris to some issues breaking on that build, relating to the cacheing in summary.corpus()/textstat_summary().
      @@ -284,12 +276,12 @@

      Changes

      • -textstat_keyness() performance is now improved through implementation in (multi-threaded) C++.
      • +textstat_keyness() performance is now improved through implementation in (multi-threaded) C++.
      -
      +

      -Bug fixes and stability enhancements

      +Bug fixes and stability enhancements
      • Fixes breaking tests and examples on Solaris platform as well as other changes introduced by changes to the stringi package.
      @@ -308,11 +300,11 @@

      Contents

      -

      Site built with pkgdown 1.6.1.9000.

      +

      Site built with pkgdown 1.6.1.

      @@ -338,8 +330,6 @@

      Contents

      - - diff --git a/docs/news/news-3.0.html b/docs/news/news-3.0.html index f76331f33c..11ee437860 100644 --- a/docs/news/news-3.0.html +++ b/docs/news/news-3.0.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
      @@ -194,7 +186,7 @@ +
    • +

      texts() and texts<- are deprecated.

      + +
    @@ -289,7 +291,7 @@

    • all methods for defunct corpuszip objects.
    • -View() functions
    • +View() functions
    • as.wfm() and as.DocumentTermMatrix() (the same functionality is available via convert())
    • @@ -310,16 +312,20 @@

  • dictionary_edit(), char_edit(), and list_edit() are removed.

  • +
  • dfm_weight() - formerly deprecated "scheme" options are now removed.

  • +
  • tokens() - formerly deprecated options remove_hyphens and remove_twitter are now removed. (Use split_hyphens instead, and the default tokenizer always now preserves Twitter and other social media tags.)

  • +
  • Special versions of head() and tail() for corpus, dfm, and fcm objects are now removed, since the base methods work fine for these objects. The main consequence was the removal of the nf option from the methods for dfm and fcm objects, which limited the number of features. This can be accomplished using the index operator [ instead, or for printing, by specifying print(x, max_nfeat = 6L) (for instance).

  • -
    +

    -Bug fixes and stability enhancements

    +Bug fixes and stability enhancements
    • Fixed a bug causing topfeatures(x, group = something) to fail with weighted dfms (#2032).

    • kwic() is more stable and does not crash when a vector is supplied as the window argument (#2008).

    • Allow use of multi-threading with more than two threads by fixing quanteda_options().

    • Mentions of the now-removed ngrams option in dfm(x, ...) has now been removed from the dfm documentation. (#1990)

    • +
    • Handling for some early-cycle v2 dfm object is improved, to ensure that they are updated to the latest object format. (#2097)

    @@ -336,11 +342,11 @@

    Contents

    -

    Site built with pkgdown 1.6.1.9000.

    +

    Site built with pkgdown 1.6.1.

    @@ -366,8 +372,6 @@

    Contents

    - - diff --git a/docs/news/news-3.1.html b/docs/news/news-3.1.html new file mode 100644 index 0000000000..b76537b3aa --- /dev/null +++ b/docs/news/news-3.1.html @@ -0,0 +1,294 @@ + + + + + + + + +Version 3.1 • quanteda + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    +
    + + + + +
    + +
    +
    + + +
    +

    +quanteda 3.1 Unreleased +

    +
    +

    +Bug fixes and stability enhancements

    +
      +
    • Improved and more consistent handling of empty corpus, tokens and dfm objects, to address #2110.
    • +
    • +rbind.dfm() now preserves docvars (#2109).
    • +
    • Document name for Biden’s 2021 Inaugural Address in data_corpus_inaugural is now consistent with all other documents.
    • +
    • Fix #2127 that caused subsetting to change document names.
    • +
    +
    +
    +

    +Changes and additions

    + +
    +
    +

    +Deprecations

    + +
    +
    +
    + + + +
    + + +
    + + +
    +

    Site built with pkgdown 1.6.1.

    +
    + +
    +
    + + + + + + + + + + + diff --git a/docs/news/news-dev.html b/docs/news/news-dev.html index dc3f3b2ab6..b28198804d 100644 --- a/docs/news/news-dev.html +++ b/docs/news/news-dev.html @@ -52,8 +52,6 @@ - - @@ -75,15 +73,9 @@ - - - - - -
    @@ -194,7 +186,7 @@
    - @@ -676,7 +668,7 @@

    - + @@ -685,12 +677,12 @@

    - - @@ -728,7 +720,7 @@

  • Added Greek stopwords. (See #282).
  • -
  • Added index handling [, [[, and $ for (hashed) tokens objects.
    +
  • Added index handling [, [[, and $ for (hashed) tokens objects.
  • Now using ggplot2.
  • Added tokens methods for collocations() and kwic().
  • @@ -762,14 +754,14 @@

    docvars() on eligible objects that contain no docvars now returns an empty 0 x 0 data.frame (in the spirit of #242).
  • Redesigned textmodel_scale1d now produces sorted and grouped document positions for fitted wordfish models, and produces a ggplot2 plot object.
  • -textmodel_wordfish() now preserves sparsity while processing the dfm, and uses a fast approximation to an SVD to get starting values. This also dramatically improves performance in computing this model. (#482, #124)
  • +textmodel_wordfish() now preserves sparsity while processing the dfm, and uses a fast approximation to an SVD to get starting values. This also dramatically improves performance in computing this model. (#482, #124)
  • The speed of kwic() is now dramatically improved, and also returns an indexed set of tokens that makes subsequent commands on a kwic class object much faster. (#603)
  • Package options (for verbose, threads) can now be set or queried using quanteda_options().
  • Improved performance and better documentation for corpus_segment(). (#634)
  • Added functions corpus_trimsentences() and char_trimsentences() to remove sentences from a corpus or character object, based on token length or pattern matching.
  • -
  • Added options to textstat_readability(): min_sentence_length and max_sentence_length. (#632)
  • -
  • Indexing now works for dictionaries, for slicing out keys and values ([), or accessing values directly ([[). (#651)
  • -
  • Began the consolidation of collocation detection and scoring into a new function textstat_collocations(), which combines the existing collocations() and sequences() functions. (#434) Collocations now behave as sequences for other functions (such as tokens_compound()) and have a greatly improved performance for such uses.
  • +
  • Added options to textstat_readability(): min_sentence_length and max_sentence_length. (#632)
  • +
  • Indexing now works for dictionaries, for slicing out keys and values ([), or accessing values directly ([[). (#651)
  • +
  • Began the consolidation of collocation detection and scoring into a new function textstat_collocations(), which combines the existing collocations() and sequences() functions. (#434) Collocations now behave as sequences for other functions (such as tokens_compound()) and have a greatly improved performance for such uses.
  • @@ -781,11 +773,11 @@

  • metadoc() now returns a vector instead of a data.frame for a single variable, similar to docvars()
  • -
  • Most verbose options now take the default from getOption("verbose") rather than fixing the value in the function signatures. (#577)
  • +
  • Most verbose options now take the default from getOption("verbose") rather than fixing the value in the function signatures. (#577)
  • -textstat_dist() and textstat_simil() now return a matrix if a selection argument is supplied, and coercion to a list produces a list of distances or similarities only for that selection.
  • +textstat_dist() and textstat_simil() now return a matrix if a selection argument is supplied, and coercion to a list produces a list of distances or similarities only for that selection.
  • All remaining camelCase arguments are gone. For commonly used ones, such as those in tokens(), the old arguments (e.g. removePunct) still produce the same behaviour but with a deprecation warning.
  • -
  • Added n_target and n_reference columns to textstat_keyness() to return counts for each category being compared for keyness.
  • +
  • Added n_target and n_reference columns to textstat_keyness() to return counts for each category being compared for keyness.
  • @@ -794,7 +786,7 @@

    • Fixed an problem in tokens generation for some irregular characters (#554).
    • Fixed a problem in setting the parallel thread size on single-core machines (#556).
    • -
    • Fixed problems for str() on a corpus with no docvars (#571).
    • +
    • Fixed problems for str() on a corpus with no docvars (#571).
    • removeURL in tokens() now removes URLs where the first part of the URL is a single letter (#587).
    • @@ -814,7 +806,7 @@

      New features

        -
      • Corpus construction using corpus() now works for a tm::SimpleCorpus object. (#680)
      • +
      • Corpus construction using corpus() now works for a tm::SimpleCorpus object. (#680)
      • Added corpus_trim() and char_trim() functions for selecting documents or subsets of documents based on sentence, paragraph, or document lengths.
      • Conversion of a dfm to an stm object now passes docvars through in the $meta of the return object.
      • New dfm_group(x, groups = ) command, a convenience wrapper around dfm.dfm(x, groups = ) (#725).
      • @@ -822,17 +814,17 @@

      • Corpus constructor methods for data.frame objects now conform to the “text interchange format” for corpus data.frames, automatically recognizing doc_id and text fields, which also provides interoperability with the readtext package. corpus construction methods are now more explicitly tailored to input object classes.

    -
    +

    -Bug fixes and stability enhancements

    +Bug fixes and stability enhancements

    • dfm_lookup() behaves more robustly on different platforms, especially for keys whose values match no features (#704).
    • -textstat_simil() and textstat_dist() no longer take the n argument, as this was not sorting features in correct order.
    • -
    • Fixed failure of tokens(x, what = "character") when x included Twitter characters @ and # (#637).
    • +textstat_simil() and textstat_dist() no longer take the n argument, as this was not sorting features in correct order. +
    • Fixed failure of tokens(x, what = "character") when x included Twitter characters @ and # (#637).
    • Fixed bug #707 where ntype.dfm() produced an incorrect result.
    • -
    • Fixed bug #706 where textstat_readability() and textstat_lexdiv() for single-document returns when drop = TRUE.
    • +
    • Fixed bug #706 where textstat_readability() and textstat_lexdiv() for single-document returns when drop = TRUE.
    • Improved the robustness of corpus_reshape().
    • print, and head, and tail methods for dfm are more robust (#684).
    • @@ -843,12 +835,12 @@

    • fcm_compress() now retains the fcm class, and generates and error when an asymmetric compression is attempted (#728).
    • -textstat_collocations() now returns the collocations as character, not as a factor (#736)
    • +textstat_collocations() now returns the collocations as character, not as a factor (#736)
    • Fixed a bug in dfm_lookup(x, exclusive = FALSE) wherein an empty dfm ws returned with there was no no match (#116).
    • Argument passing through dfm() to tokens() is now robust, and preserves variables defined in the calling environment (#721).
    • -
    • Fixed issues related to dictionaries failing when applying str(), names(), or other indexing operations, which started happening on Linux and Windows platforms following the CRAN move to 3.4.0. (#744)
    • +
    • Fixed issues related to dictionaries failing when applying str(), names(), or other indexing operations, which started happening on Linux and Windows platforms following the CRAN move to 3.4.0. (#744)
    • Dictionary import using the LIWC format is more robust to improperly formatted input files (#685).
    • -
    • Weights applied using dfm_weight() now print friendlier error messages when the weight vector contains features not found in the dfm. See this Stack Overflow question for the use case that sparked this improvement.
    • +
    • Weights applied using dfm_weight() now print friendlier error messages when the weight vector contains features not found in the dfm. See this Stack Overflow question for the use case that sparked this improvement.
    @@ -860,7 +852,7 @@

    New features

      -
    • Improvements and consolidation of methods for detecting multi-word expressions, now active only through textstat_collocations(), which computes only the lambda method for now, but does so accurately and efficiently. (#753, #803). This function is still under development and likely to change further.
    • +
    • Improvements and consolidation of methods for detecting multi-word expressions, now active only through textstat_collocations(), which computes only the lambda method for now, but does so accurately and efficiently. (#753, #803). This function is still under development and likely to change further.
    • Added new quanteda_options that affect the maximum documents and features displayed by the dfm print method (#756).
    • ngram formation is now significantly faster, including with skips (skipgrams).
    • @@ -873,12 +865,12 @@

    • New wrapper phrase() converts whitespace-separated multi-word patterns into a list of patterns. This affects the feature/pattern matching in tokens/dfm_select/remove, tokens_compound, tokens/dfm_lookup, and kwic. phrase() and the associated changes also make the behaviour of using character vectors, lists of characters, dictionaries, and collocation objects for pattern matches far more consistent. (See #820, #787, #740, #837, #836, #838)
    • corpus.Corpus() for creating a corpus from a tm Corpus now works with more complex objects that include document-level variables, such as data from the manifestoR package (#849).
    • -
    • New plot function textplot_keyness() plots term “keyness”, the association of words with contrasting classes as measured by textstat_keyness().
    • +
    • New plot function textplot_keyness() plots term “keyness”, the association of words with contrasting classes as measured by textstat_keyness().
    • Added corpus constructor for corpus objects (#690).
    • Added dictionary constructor for dictionary objects (#690).
    • Added a tokens constructor for tokens objects (#690), including updates to tokens() that improve the consistency and efficiency of the tokenization.
    • -
    • Added new quanteda_options(): language_stemmer and language_stopwords, now used for default in *_wordstem functions and stopwords() for defaults, respectively. Also uses this option in dfm() when stem = TRUE, rather than hard-wiring in the “english” stemmer (#386).
    • -
    • Added a new function textstat_frequency() to compile feature frequencies, possibly by groups. (#825)
    • +
    • Added new quanteda_options(): language_stemmer and language_stopwords, now used for default in *_wordstem functions and stopwords() for defaults, respectively. Also uses this option in dfm() when stem = TRUE, rather than hard-wiring in the “english” stemmer (#386).
    • +
    • Added a new function textstat_frequency() to compile feature frequencies, possibly by groups. (#825)
    • Added nomatch option to tokens_lookup() and dfm_lookup(), to provide tokens or feature counts for categories not matched to any dictionary key. (#496)
    @@ -886,7 +878,7 @@

    Behaviour changes

      -
    • The functions sequences() and collocations() have been removed and replaced by textstat_collocations().
    • +
    • The functions sequences() and collocations() have been removed and replaced by textstat_collocations().
    • (Finally) we added “will” to the list of English stopwords (#818).
    • dfm objects with one or both dimensions having zero length, and empty kwic objects now display more appropriately in their print methods (per #811).
    • @@ -899,11 +891,11 @@

    • Improved cbind.dfm() function allows cbinding vectors, matrixes, and (recyclable) scalars to dfm objects.
    -
    +

    -Bug fixes and stability enhancements

    +Bug fixes and stability enhancements
      -
    • For the underlying methods behind textstat_collocations(), we corrected the word matching, and lambda and z calculation methods, which were slightly incorrect before. We also removed the chi2, G2, and pmi statistics, because these were incorrectly calculated for size > 2.
      +
    • For the underlying methods behind textstat_collocations(), we corrected the word matching, and lambda and z calculation methods, which were slightly incorrect before. We also removed the chi2, G2, and pmi statistics, because these were incorrectly calculated for size > 2.
    • LIWC-formatted dictionary import now robust to assignment to term assignment to missing categories.
    • @@ -911,7 +903,7 @@

    • Separators including rare spacing characters are now handled more robustly by the remove_separators argument in tokens(). See #796.
    • Improved memory usage when computing ntoken() and ntype(). (#795)
    • Improvements to quanteda_options() now does not throw an error when quanteda functions are called directly without attaching the package. In addition, quanteda options can be set now in .Rprofile and will not be overwritten when the options initialization takes place when attaching the package.
    • -
    • Fixed a bug in textstat_readability() that wrongly computed the number of words with fewer than 3 syllables in a text; this affected the FOG.NRI and the Linsear.Write measures only.
    • +
    • Fixed a bug in textstat_readability() that wrongly computed the number of words with fewer than 3 syllables in a text; this affected the FOG.NRI and the Linsear.Write measures only.
    • Fixed mistakes in the computation of two docfreq schemes: "logave" and "inverseprob".
    • Fixed a bug in the handling of multi-thread options where the settings using quanteda_options() did not actually set the number of threads. In addition, we fixed a bug causing threading to be turned off on macOS (due to a check for a gcc version that is not used for compiling the macOS binaries) prevented multi-threading from being used at all on that platform.
    • Fixed a bug causing failure when functions that use quanteda_options() are called without the namespace or package being attached or loaded (#864).
    • @@ -932,11 +924,11 @@

      Contents

      -

      Site built with pkgdown 1.6.1.9000.

      +

      Site built with pkgdown 1.6.1.

      @@ -962,8 +954,6 @@

      Contents

      - - diff --git a/docs/pkgdown.yml b/docs/pkgdown.yml index 4142a23f79..202235ff9b 100644 --- a/docs/pkgdown.yml +++ b/docs/pkgdown.yml @@ -1,6 +1,6 @@ -pandoc: 2.11.2 -pkgdown: 1.6.1.9000 -pkgdown_sha: ae4332d48aedc46be2b532b26661a9bc6b7244fc +pandoc: 2.14.0.1 +pkgdown: 1.6.1 +pkgdown_sha: ~ articles: pkgdown/comparison: comparison.html pkgdown/design: design.html @@ -19,7 +19,7 @@ articles: pkgdown/replication/qss: qss.html pkgdown/replication/text2vec: text2vec.html quickstart: quickstart.html -last_built: 2021-04-06T10:01Z +last_built: 2021-08-17T16:37Z urls: reference: https://quanteda.io/reference article: https://quanteda.io/articles diff --git a/docs/reference/as.character.corpus.html b/docs/reference/as.character.corpus.html index ec51e390d9..d992f098dd 100644 --- a/docs/reference/as.character.corpus.html +++ b/docs/reference/as.character.corpus.html @@ -54,8 +54,6 @@ - - @@ -77,15 +75,9 @@ - - - - - -
      @@ -196,7 +188,7 @@
    - +
    data_char_stopwords .stopwords(used by stopwords() +(used by stopwords()
    encodedTextFiles.zipmoved to the readtext packagemoved to the readtext package
    describeTexts
    textfilemoved to package readtext +moved to package readtext
    encodedTextsmoved to package readtext, as data_char_encodedtexts +moved to package readtext, as data_char_encodedtexts
    check.names

    logical. If TRUE then the names of the - variables in the data frame are checked to ensure that they are - syntactically valid variable names and are not duplicated. - If necessary they are adjusted (by make.names) - so that they are.

    logical; passed to the data.frame() call.

    @@ -290,11 +278,11 @@

    Contents

    -

    Site built with pkgdown 1.6.1.9000.

    +

    Site built with pkgdown 1.6.1.

    @@ -320,8 +308,6 @@

    Contents

    - - diff --git a/docs/reference/as.dfm.html b/docs/reference/as.dfm.html index 5c5ecdab70..ba546572eb 100644 --- a/docs/reference/as.dfm.html +++ b/docs/reference/as.dfm.html @@ -57,8 +57,6 @@ - - @@ -80,15 +78,9 @@ - - - - - -
    @@ -199,7 +191,7 @@