Clarification on arg order

juliasilge · Sep 5, 2023 · add0b80 · add0b80
1 parent cb434c8
commit add0b80
Show file tree

Hide file tree

Showing 5 changed files with 11 additions and 10 deletions.
diff --git a/DESCRIPTION b/DESCRIPTION
@@ -74,4 +74,4 @@ Config/testthat/edition: 3
 Encoding: UTF-8
 LazyData: TRUE
 Roxygen: list(markdown = TRUE)
-RoxygenNote: 7.2.2
+RoxygenNote: 7.2.3
diff --git a/R/unnest_tokens.R b/R/unnest_tokens.R
@@ -70,13 +70,13 @@
 #' d
 #'
 #' d %>%
-#'   unnest_tokens(word, txt)
+#'   unnest_tokens(output = word, input = txt)
 #'
 #' d %>%
-#'   unnest_tokens(sentence, txt, token = "sentences")
+#'   unnest_tokens(output = sentence, input = txt, token = "sentences")
 #'
 #' d %>%
-#'   unnest_tokens(ngram, txt, token = "ngrams", n = 2)
+#'   unnest_tokens(output = ngram, input = txt, token = "ngrams", n = 2)
 #'
 #' d %>%
 #'   unnest_tokens(chapter, txt, token = "regex", pattern = "Chapter [\\\\d]")

diff --git a/man/tidytext-package.Rd b/man/tidytext-package.Rd
diff --git a/man/unnest_tokens.Rd b/man/unnest_tokens.Rd
diff --git a/vignettes/tidytext.Rmd b/vignettes/tidytext.Rmd
@@ -49,12 +49,12 @@ original_books <- austen_books() %>%
 original_books
 ```
 
-To work with this as a tidy dataset, we need to restructure it as **one-token-per-row** format. The `unnest_tokens` function is a way to convert a dataframe with a text column to be one-token-per-row:
+To work with this as a tidy dataset, we need to restructure it as **one-token-per-row** format. The `unnest_tokens` function is a way to convert a dataframe with a text column to be one-token-per-row. Here let's tokenize to a new `word` column from the existing `text` column:
 
 ```{r}
 library(tidytext)
 tidy_books <- original_books %>%
-  unnest_tokens(word, text)
+  unnest_tokens(output = word, input = text)
 
 tidy_books
 ```
@@ -188,7 +188,7 @@ is a sad sentence, not a happy one, because of negation. The [Stanford CoreNLP](
 
 ```{r}
 PandP_sentences <- tibble(text = prideprejudice) %>% 
-  unnest_tokens(sentence, text, token = "sentences")
+  unnest_tokens(output = sentence, input = text, token = "sentences")
 ```
 
 Let's look at just one.