r-lib · hadley · Aug 1, 2025 · Jul 30, 2025 · Jul 31, 2025 · Jul 31, 2025
diff --git a/NEWS.md b/NEWS.md
@@ -1,5 +1,6 @@
 # testthat (development version)
 
+* `vignette("custom-expectations)` has been overhauled to make it much clearer how to create high-quality expectations (#2113, #2132, #2072).
 * `expect_snapshot()` and friends will now fail when creating a new snapshot on CI. This is usually a signal that you've forgotten to run it locally before committing (#1461).
 * `expect_snapshot_value()` can now handle expressions that generate `-` (#1678) or zero length atomic vectors (#2042).
 * `expect_matches()` failures should be a little easier to read (#2135).

diff --git a/R/expect-that.R b/R/expect-that.R
@@ -2,25 +2,37 @@
 #'
 #' @description
 #' These are the primitives that you can use to implement your own expectations.
-#' Every branch of code inside an expectation must call either `pass()` or
-#' `fail()`; learn more in `vignette("custom-expectation")`.
+#' Regardless of how it's called an expectation should either return `pass()`,
+#' `fail()`, or throw an error (if for example, the arguments are invalid).
 #'
-#' @param message a string to display.
+#' Learn more about creating your own expectations in
+#' `vignette("custom-expectation")`.
+#'
+#' @param message Failure message to send to the user. It's best practice to
+#'   describe both what is expected and what was actually received.
 #' @param info Character vector continuing additional information. Included
 #'   for backward compatibility only and new expectations should not use it.
 #' @param srcref Location of the failure. Should only needed to be explicitly
 #'   supplied when you need to forward a srcref captured elsewhere.
+#' @param trace_env If `trace` is not specified, this is used to generate an
+#'   informative traceack for failures. You should only need to set this if
+#'   you're calling `fail()` from a helper function; see
+#'   `vignette("custom-expectation")` for details.
 #' @param trace An optional backtrace created by [rlang::trace_back()].
 #'   When supplied, the expectation is displayed with the backtrace.
-#' @param trace_env If `is.null(trace)`, this is used to automatically
-#'   generate a traceback running from `test_code()`/`test_file()` to
-#'   `trace_env`. You'll generally only need to set this if you're wrapping
-#'   an expectation inside another function.
+#'   Expert use only.
 #' @export
 #' @examples
-#' \dontrun{
-#' test_that("this test fails", fail())
-#' test_that("this test succeeds", succeed())
+#' expect_length <- function(object, n) {
+#'   act <- quasi_label(rlang::enquo(object), arg = "object")
+#'
+#'   act_n <- length(act$val)
+#'   if (act_n != n) {
+#'     msg <- sprintf("%s has length %i, not length %i.", act$lab, act_n, n)
+#'     return(fail(msg))
+#'   }
+#'
+#'   pass(act$val)
 #' }
 fail <- function(
   message = "Failure has been forced",
@@ -53,7 +65,7 @@ pass <- function(value) {
 #' Mark a test as successful
 #'
 #' This is an older version of [pass()] that exists for backwards compatibility.
-#' You should now use `pass()` instead`
+#' You should now use `pass()` instead.
 #'
 #' @export
 #' @inheritParams fail

diff --git a/man/expect.Rd b/man/expect.Rd
diff --git a/man/expectation.Rd b/man/expectation.Rd
diff --git a/man/fail.Rd b/man/fail.Rd
diff --git a/man/succeed.Rd b/man/succeed.Rd
diff --git a/vignettes/custom-expectation.Rmd b/vignettes/custom-expectation.Rmd
@@ -11,59 +11,99 @@ vignette: >
 library(testthat)
 knitr::opts_chunk$set(collapse = TRUE, comment = "#>")
 
-# Pretend we're snapsotting
+# Pretend we're snapshotting
 snapper <- local_snapshotter(fail_on_new = FALSE)
 snapper$start_file("snapshotting.Rmd", "test")
 ```
 
-This vignette shows you how to write your expectations that work identically to the built-in `expect_` functions. 
-
-You can use these either locally by putting them in a helper file, or export them from your package.
+This vignette shows you how to write your own expectations. You can use them within your package by putting them in a helper file, or share them with others by exporting them from your package.
 
 ## Expectation basics
 
-There are three main parts to writing an expectation, as illustrated by `expect_length()`:
+An expectation has three main parts, as illustrated by `expect_length()`:
 
 ```{r}
 expect_length <- function(object, n) {  
   # 1. Capture object and label
   act <- quasi_label(rlang::enquo(object))
 
-  # 2. Verify the expectations
+  # 2. Check if expectations are violated
   act_n <- length(act$val)
   if (act_n != n) {
     msg <- sprintf("%s has length %i, not length %i.", act$lab, act_n, n)
     return(fail(msg))
   }
 
-  # 3. Pass
+  # 3. Pass when expectations are met
   pass(act$val)
 }
 ```
 
-### Capture value and label
+The first step in any expectation is to use `quasi_label()` to capture a "labelled value", i.e. a list that contains both the value (`$val`) for testing and a label (`$lab`) for messaging. This is a pattern that exists for fairly esoteric reasons; you don't need to understand it, just copy and paste it 🙂.
 
-The first step in any expectation is to capture the actual object, and generate a label for it to use if a failure occur. All testthat expectations support quasiquotation so that you can unquote variables. This makes it easier to generate good labels when the expectation is called from a function or within a for loop.
+Next you need to check each way that `object` could violate the expectation. In this case, there's only one check, but in more complicated cases there can be multiple checks. In most cases, it's easier to check for violations one by one, using early returns to `fail()`. This makes it easier to write informative failure messages that state both what the object is and what you expected.
 
-By convention, the first argument to every `expect_` function is called `object`, and you capture its value (`val`) and label (`lab`) with `act <- quasi_label(enquo(object))`, where `act` is short for actual (in constrast to expected).
+Also note that you need to use `return(fail())` here. You won't see the problem when interactively testing your function because when run outside of `test_that()`, `fail()` throws an error, causing the function to terminate early. When running inside of `test_that()`, however, `fail()` does not stop execution because we want to collect all failures in a given test.  
 
-### Verify the expectation
+Finally, if the object is as expected, call `pass()` with `act$val`. Returning the input value is good practice since expectation functions are called primarily for their side-effects (triggering a failure). This allows expectations to be chained:
 
-Now we can check if our expectation is met and return `fail()` if not. The most challenging job here is typically generating the error message because you want it to be as self-contained as possible. This means it should typically give both the expected and actual value, along with the name of the object passed to the expectation. testthat expectations use `sprintf()`, but if you're familiar with {glue}, you might want to use that instead.
+```{r}
+mtcars |>
+  expect_type("list") |>
+  expect_s3_class("data.frame") |> 
+  expect_length(11)
+```
+
+### Testing your expectations
 
-More complicated expectations will have more `if` statements. For example, we might want to make our `expect_length()` function include an assertion that `object` is a vector:
+Once you've written your expectation, you need to test it, and luckily testthat comes with three expectations designed specifically to test expectations:
+
+* `expect_success()` checks that your expectation emits exactly one success and zero failures.
+* `expect_failure()` checks that your expectation emits exactly one failure and zero successes. 
+* `expect_failure_snapshot()` captures the failure message in a snapshot, making it easier to review if it's useful or not.
+
+The first two expectations are particularly important because they ensure that your expectation reports the correct number of successes and failures to the user.
+
+```{r}
+test_that("expect_length works as expected", {
+  x <- 1:10
+  expect_success(expect_length(x, 10))
+  expect_failure(expect_length(x, 11))
+})
+
+test_that("expect_length gives useful feedback", {
+  x <- 1:10
+  expect_snapshot_failure(expect_length(x, 11))
+})
+```
+
+## Examples
+
+The following sections show you a few more variations, loosely based on existing testthat expectations.
+
+### `expect_vector_length()`
+
+Let's make `expect_length()` a bit more strict by also checking that the input is a vector. R is a bit weird in that it gives a length to pretty much every object, and you can imagine not wanting this code to succeed:
+
+```{r}
+expect_length(mean, 1)
+```
+
+To do this we'll add an extra check that the input is either an atomic vector or a list:
 
 ```{r}
 expect_vector_length <- function(object, n) {  
-  act <- quasi_label(rlang::enquo(object), arg = "object")
+  act <- quasi_label(rlang::enquo(object))
 
-  if (!is.atomic(act$val) || !is.list(act$val)) {
+  # It's non-trivial to check if an object is a vector in base R so we
+  # use an rlang helper
+  if (!rlang::is_vector(act$val)) {
     msg <- sprintf("%s is a %s, not a vector", act$lab, typeof(act$val))
     return(fail(msg))
   }
 
   act_n <- length(act$val)
-  if (act$n != n) {
+  if (act_n != n) {
     msg <- sprintf("%s has length %i, not length %i.", act$lab, act_n, n)
     return(fail(msg))
   }
@@ -72,33 +112,91 @@ expect_vector_length <- function(object, n) {
 }
 ```
 
-Note that it's really important to `return(fail())` here. You wont see the problem when interactively testing your function because when run outside of `test_that()`, `fail()` throws an error, causing the function to terminate early. When running inside of `test_that()` however, `fail()` does not stop execution because we want to collect all failures in a given test.
+```{r}
+#| error: true
+expect_vector_length(mean, 1)
+expect_vector_length(mtcars, 15)
+```
 
-### Pass the test
+### `expect_s3_class()`
 
-If no assertions fail, call `pass()` with the input value (usually `act$val`). Returning the input value is good practice since expectation functions are called primarily for their side-effects (triggering a failure). This allows expectations to be chained:
+Or imagine if you're checking to see if an object inherits from an S3 class. In R, there's no direct way to tell if an object is an S3 object: you can confirm that it's an object, then that it's not an S4 object. So you might organize your expectation this way:
 
 ```{r}
-mtcars |>
-  expect_type("list") |>
-  expect_s3_class("data.frame") |> 
-  expect_length(11)
+expect_s3_class <- function(object, class) {
+  if (!rlang::is_string(class)) {
+    rlang::abort("`class` must be a string.")
+  }
+
+  act <- quasi_label(rlang::enquo(object))
+
+  if (!is.object(act$val)) {
+    return(fail(sprintf("%s is not an object.", act$lab)))
+  }
+
+  if (isS4(act$val)) {
+    return(fail(sprintf("%s is an S4 object, not an S3 object.", act$lab)))
+  }
+
+  if (!inherits(act$val, class)) {
+    msg <- sprintf(
+      "%s inherits from %s not %s.",
+      act$lab,
+      paste0(class(object), collapse = "/"),
+      paste0(class, collapse = "/")
+    )
+    return(fail(msg))
+  }
+
+  pass(act$val)
+}
 ```
 
-## Testing your expectations
+```{r}
+#| error: true
+x1 <- 1:10
+TestClass <- methods::setClass("Test", contains = "integer")
+x2 <- TestClass()
+x3 <- factor()
+
+expect_s3_class(x1, "integer")
+expect_s3_class(x2, "integer")
+expect_s3_class(x3, "integer")
+expect_s3_class(x3, "factor")
+```
 
-testthat comes with three expectations designed specifically to test expectations: `expect_success()` and `expect_failure()`:
+Note that I also check that the `class` argument must be a string. This is an error, not a failure, because it suggests you're using the function incorrectly.
 
-* `expect_success()` checks that your expectation emits exactly one success and zero failures.
-* `expect_failure()` checks that your expectation emits exactly one failure and zero successes. 
-* `expect_failure_snapshot()` captures the failure message in a snapshot, making it easier to review if it's useful or not.
+```{r}
+#| error: true
+expect_s3_class(x1, 1)
+```
+
+## Repeated code
+
+As you write more expectations, you might discover repeated code that you want to extract out into a helper. Unfortunately, creating helper functions is not straightforward in testthat because every `fail()` captures the calling environment in order to give maximally useful tracebacks. Because getting this right is not critical (you'll just get a slightly suboptimal traceback in the case of failure), we don't recommend bothering. However, we document it here because it's important to get it right in testthat itself.
+
+The key challenge is that `fail()` captures a `trace_env` which should be the execution environment of the expectation. This usually works, because the default value of `trace_env` is `caller_env()`. But when you introduce a helper, you'll need to explicitly pass it along:
 
 ```{r}
-test_that("expect_length works as expected", {
-  x <- 1:10
-  expect_success(expect_length(x, 10))
-  expect_failure(expect_length(x, 11))
+expect_length_ <- function(act, n, trace_env = caller_env()) {
+  act_n <- length(act$val)
+  if (act_n != n) {
+    msg <- sprintf("%s has length %i, not length %i.", act$lab, act_n, n)
+    return(fail(msg, trace_env = trace_env))
+  }
 
-  expect_snapshot_failure(expect_length(x, 11))
-})
+  pass(act$val)
+}
+
+expect_length <- function(object, n) {  
+  act <- quasi_label(rlang::enquo(object))
+  expect_length_(act, n)
+}
 ```
+
+A few recommendations:
+
+* The helper shouldn't be user facing, so we give it a `_` suffix to make that clear. 
+* It's typically easiest for a helper to take the labelled value produced by `quasi_label()`.
+* Your helper should usually call both `fail()` and `pass()` and be returned from the wrapping expectation.