# `nb09`: Space Shuttle Challenger disaster

The content of this notebook is adapted from "Bayesian Methods for Hackers" (Chapter 2), by Cam Davidson-Pilon.

The Space Shuttle Challenger disaster occurred on January 28, 1986, when the shuttle's rocket boosters exploded after lift-off, killing all seven crew members. An investigation by the presidential commission on the accident found that the explosion was caused by the failure of an O-ring in a field joint on the rocket booster. This failure was due to a faulty design that made the O-ring sensitive to outside temperature and other factors. Of the previous 24 flights, data on O-ring failures was available for 23 flights (one flight's data was lost at sea), but only the data from the 7 flights with damage incidents was considered important. This data did not show any clear trend.

![](figures/nb12/oring.png)

In [1]:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

# Data

<div class="alert alert-success">
    
**Exercise**. Load the data from `"data/challenger.csv"`.
    
</div>

<div class="alert alert-success">
    
**Exercise**. Visualize the data.
    
</div>

# Probabilistic model

The visual inspection of the data makes it clear that the probability of an incident increases as the outside temperature decreases.

<div class="alert alert-success">
    
**Exercise**. Propose and implement an adequate probabilistic model.
    
</div>

Let us consider a probabilistic model where the failure $F$ is modeled as a Bernoulli random variable with probability parameter $\rho$ following a logistic regression model that is function of the temperature,
$$\rho(t, \alpha, \beta) = \frac{1}{1 + \exp(\beta t + \alpha)},$$
where $t$ is the outside temperature, and $\alpha$ and $\beta$ are hyper-parameters.

For the prior, we first consider weakly informative priors $p(\alpha) = \mathcal{N}(\alpha | 0, 100)$ and $p(\beta) = \mathcal{N}(\beta | 0, 5)$. 

Overall, our probabilistic model is
$$\alpha \sim \mathcal{N}(\alpha | 0, 100)\\
\beta \sim \mathcal{N}(\beta | 0, 5)\\
F_i \sim \text{Bernoulli}\left(\frac{1}{1 + \exp(\beta t_i + \alpha)}\right)$$
for the observations $i=1, \dots, N$.

# Inference

<div class="alert alert-success">
    
**Exercise**. Compute the posterior distribution of the model parameters.
    
</div>

<div class="alert alert-success">
    
**Exercise**. Diagnose the inference results.
    
</div>

<div class="alert alert-success">
    
**Exercise**. Visualize the posterior distribution.
    
</div>

<div class="alert alert-success">
    
**Exercise**. Show the posterior predictive distribution of the probability of an incident.
    
</div>

On the day of the Challenger disaster, the outside temperature was 31 degrees Fahrenheit. What is the posterior distribution of the probability of a defect occurring, given this temperature? 

# Criticism

## Model comparison

<div class="alert alert-success">
    
**Exercise**. Compare your results using a tighter prior.
    
</div>

<div class="alert alert-success">
    
**Exercise**. Compare your results to random (coin-flip) predictions.
    
</div>