# Chapter 13 - Experiments and Observational Studies

## Observational Studies

* **observational studies**: researchers don't assign choices; they simply observe them
* **retrospective study**: first identify subjects, then collect data from past events
  * this aspect can introduce errors
* valuable for discovering trends and possible relationships
* **prospective study**: identify subjects first, then collect data as events unfold

## Randomized, Comparative Experiments

* **experiment**: requires **random assignment** of subjects to treatments
* study the relationship between two or more variables
* at least one **factor** or explanatory variable
* experimenter actively and deliberately manipulates the factors to control the details of the possible treatments, and assigns subjects to treatments at random
* **subjects** or **participants**: terms for humans on whom experiments are performed
* **experimental unit**:  more generic term for other individuals on which an experiment is performed
* **levels**: the specific values chosen in an experiment for a factor
* **treatment**: the combination of specific levels for all factors that an experimental unit receives

## The Four Principles of Experimental Design

1. **Control**: control sources of variation other than the factors we are testing by making conditions as similar as possible for all treatment groups.
2. **Randomize**: to allow us to equalize the effects of unknown or uncontrollable sources of variation.  Allows using statistical methods to draw conclusions from an experiment.
3. **Replicate**: 2 kinds
  * apply each treatment to a number of subjects
  * perform the experiment multiple times, under different circumstances
4. **Block**: Group individuals that are similar, with respect to some attribute that _may_ affect the outcome, into **blocks**, then randomize within each block.  This can help remove variability due to differences among blocks.
  * note: blocking, unlike the previous 3 principles, is not _required_

## Step-by-Step Example

* Think
  * plan: state what you want to know
  * response: specify the response variable
  * treatments: specify the factor levels and the treatments
  * experimental units: specify the experimental units
  * experimental design: observe the principles of design
    * control: any sources of variability you know of and can control
    * replicate: results by placing more than one  experimental unit in each treatment group
    * randomly assing: experimental units to treatments, to equalize the effects of unknown or uncontrollable sources of variation; describe how the randomization will be accomplished
    * make a picture: a diagram of your design can help you think clearly about it
  * specify any other experiment details; particularly to support / allow replication
  * specify how to measure the response
* Show
  * display data and compare results for different treatment groups
* Tell
  * answer initial question: is the difference across treatment groups meaningful?
  * use statistical inference to establish statistical significance of differences

## Does the Difference Make a Difference?

* Are the differences we observe between treatment groups as big as we might get just from randomization alone?  If the answer is "No", then we say the results are **statistically significant**
* Use statistical tests to quantify this.

## Experiments and Samples

* sample surveys try to estimate population parameters
* experiments try to assess the effects of treatments

## Control Treatments

* **control treatment**: baseline measurement
* **control group**: experimental units that have the control treatment applied

## Blinding

* **blinding** : disguise the treatments being applied to experimental units
* **single-blind**: blinding treatments from either those who _could influence the results_ or those who _evaluate the results_.
* **double-blind**: blinding treatments from both of the above groups

## Placebos

* **placebo**: a 'fake' treatment that looks just like the treatments being tested
* **placebo effect**: subjects treated with a placebo sometimes improve; highlights the importance of effective blinding and comparing treatments with a control

The best experiments are:

* randomized
* comparative
* double-blind
* placebo-controlled

## Blocking

* when groups of experimental units are similar, it's often a good idea to gather them together into blocks
* blocking isolates the variability attributable to the differences between the blocks
* allows seeing the difference caused by the treatments more clearly
* randomization occurs _within_ blocks => **randomized block design**
* **matching**: pairing subjects because they are similar in ways _not_ under study; can reduce variation in much the same way as blocking

## Adding More Factors

* it's often important to include several factors in the same experiment in order to see what happens when the factor levels are applied in different _combinations_

## Confounding

* **confounded**: when the levels of one factor are associated with the levels of another factor

## Lurking or Confounding?

* a _lurking variable_: a variable associated with both _y_ and _x_ that makes it appear that _x_ may be causing _y_
* a _confounding variable_: associated in a noncausal way with a factor and affects the response; makes it difficult to tell if an effect was due to an experimental factor/treatment or the confounding variable

## What Can Go Wrong?

* Don't give up just because you can't run an experiment: consider an observational study
* Beware of confounding
* Bad things can happen even to good experiments: record additional information; 
* Don's spend entire budget on first run; consider a pilot experiment

## What Have We Learned

* [p. 322-325]