You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
From my assessment it meets rules 1-3 for the first definition so I would call it tidy. But using the second definition, it fits rules 1-2, but violates rule 3. There are variables corresponding to three different observational units:
Individual involved in a collision event:
PERSONNMB -- Unique numeric sequence value for each person associated with a collision and a vehicle.
GENDERCDE -- Code indicating person's gender.
Vehicle involved in a collision event:
UNIT_MR_NUMBER -- Unique numeric sequence value for each vehicle in a collision.
VEHMAKETXT -- Description indicating the name of the manufacturer of the vehicle.
Collision event:
INDIVIDUAL_MR_RECORD -- Unique identifier for each collision.
INJUREDNMB -- Total number of people injured in the collision.
In this sense, does it mean the dataset isn't tidy?
The text was updated successfully, but these errors were encountered:
On the tidyr website index, the Github README, and R4DS, tidy data is defined as:
Then in the Tidy data article derived from vignettes/tidy-data.Rmd, it's defined as:
Using these definitions changes how I label this motor vehicle collision dataset.
From my assessment it meets rules 1-3 for the first definition so I would call it tidy. But using the second definition, it fits rules 1-2, but violates rule 3. There are variables corresponding to three different observational units:
In this sense, does it mean the dataset isn't tidy?
The text was updated successfully, but these errors were encountered: