-
Notifications
You must be signed in to change notification settings - Fork 2
/
_dataset-template.qmd
69 lines (52 loc) · 2.54 KB
/
_dataset-template.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
---
title: A descriptive dataset title
author: Your Name
date: Today's Date (e.g. February 29, 2023)
description: A one- or two-sentence description of the data and the question to be answered using it. If possible, give a brief summary of the statistical situation, such as indicating if this is a randomized experiment, observational study, large classification task, etc.
categories:
- list the relevant
- statistical methods
- that can be used
- with this dataset
- one per line
- with two spaces and a hyphen in front
data:
year: Year dataset was collected/produced (YYYY)
files: name-of-data-file.csv
---
## Motivation
The categories above determine how this dataset is listed in the [table of all
datasets](https://cmustatistics.github.io/data-repository/by-method.html). Consult
that page for a list of statistical categories already used by other datasets.
In this first section, describe the source of the dataset and what it's about.
Give any necessary background about it and the research question. See other
datasets on the website for examples.
This file is Markdown, so you *can* use formatting; [here is a guide to the
basics](https://quarto.org/docs/authoring/markdown-basics.html).
## Data
Describe the data. What does each row represent? How many rows are there? If
there is missingness, say how it is coded and why it is present.
### Data preview
```{r, echo=FALSE, results="asis"}
source("../preview_dataset.R")
preview_datasets()
```
### Variable descriptions
| Variable | Description |
|----|-------------|
| Column name | A description of this variable, including units when possible |
| Column name 2 | A description of this variable, including units when possible |
If there are multiple data files, repeat the table for each data file.
## Questions
If possible, list some analyses or research questions that could be answered
with this dataset, or the types of questions you'd ask students. This is mainly
meant to give other instructors ideas for how they could use the data.
## References
Give references to the original source here, such as by pasting the journal
citation:
Longcore, Aldern, Eggers, Flores, Franco, Hirshfield-Yamanishi, Petrinec, Yan,
and Barroso. “Tuning the white light spectrum of light emitting diode lamps to
reduce attraction of nocturnal arthropods”. *Philosophical Transactions of the
Royal Society B* 370: 20140125. <https://doi.org/10.1098/rstb.2014.0125>
If the dataset has a particular license, mention the license here. For example,
"Data available under the Creative Commons Attribution license."