## Definition of data

Data are values of qualitative or quantitative variables, belonging to a set of items.

[http://en.wikipedia.org/wiki/Data](http://en.wikipedia.org/wiki/Data)

---

## Definition of data

Data are values of qualitative or quantitative variables, belonging to a set of items.

[http://en.wikipedia.org/wiki/Data](http://en.wikipedia.org/wiki/Data)

__Set of items__: Sometimes called the population; the set of objects you are interested in

---

## Definition of data

Data are values of qualitative or quantitative variables, belonging to a set of items.

[http://en.wikipedia.org/wiki/Data](http://en.wikipedia.org/wiki/Data)

__Variables__: A measurement or characteristic of an item.

---

## Definition of data

Data are values of qualitative or quantitative variables, belonging to a set of items.

[http://en.wikipedia.org/wiki/Data](http://en.wikipedia.org/wiki/Data)

__Qualitative__: Country of origin, sex, treatment

__Quantitative__: Height, weight, blood pressure

---

## What do data look like?

[http://brianknaus.com/software/srtoolbox/s_4_1_sequence80.txt](http://brianknaus.com/software/srtoolbox/s_4_1_sequence80.txt)

---

## What do data look like?

[https://dev.twitter.com/docs/api/1/get/blocks/blocking](https://dev.twitter.com/docs/api/1/get/blocks/blocking)

---

## What do data look like?

[http://blue-button.github.com/challenge/](http://blue-button.github.com/challenge/)

---

## What do data look like?

[http://www.nytimes.com/2012/06/26/technology/in-a-big-network-of-computers-evidence-of-machine-learning.html?pagewanted=all&_r=0](http://www.nytimes.com/2012/06/26/technology/in-a-big-network-of-computers-evidence-of-machine-learning.html?pagewanted=all&_r=0)

---

## What do data look like?

[http://www.pnas.org/content/109/30/12081.full](http://www.pnas.org/content/109/30/12081.full)

[https://soundcloud.com/uncoolbob/sets/darwintunes](https://soundcloud.com/uncoolbob/sets/darwintunes)

---

## What do data look like?

[http://www.data.gov/](http://www.data.gov/)

-------

## What do data look like?

Rarely

-------

## The data is the second most important thing

* The most important thing in data science is the question
* The second most important is the data
* Often the data will limit or enable the questions
* But having data can't save you if you don't have a question