Skip to content

Duplicates

Cghlewis edited this page Oct 2, 2023 · 12 revisions

This section is about working with duplicate cases in your data. In education research duplicate data may occur for reasons such as a participant completed a survey twice. In the cases of duplicates, there are often two things we wan to do:

  1. Find and remove duplicates.
  2. Count duplicates in your data.

Review and remove duplicates


Main functions used in examples

Package Functions
janitor get_dupes()
dplyr distinct()

Other functions used in examples

Package Functions
dplyr arrange(); pick()

Resources