# Exploring the Statistics of Milk Production: 

## Background

The dairy industry is awash with metrics used to help dairy farmers more efficiently produce more significant volumes of milk for consumption in milk, cheese, butter and other consumable products.  Looking specifically to a small herd of cattle in Pennsylvania, this analysis explores some relationships in the available data.

The collected data is from a single herd of dairy cattle.  The lactating portion of this herd is milked twice daily.  Animals who are in a dry period are not actively producing milk and are not included on those dates.  This analysis examines the relationships between milk production, [weather](./milk-production-and-temperature.ipynb), and [numerical ratings of given animals](./milk-production-and-animal-classification.ipynb). This notebook provides a summary of these seperate analyses.


## Selected Terms

The following terms and definitions should be useful for understanding the contents of this analysis.  

- **Milk Weight:** The amount of milk produced by an animal.  Measured in pounds of milk. For reference, a gallon of milk weighs approximately 8.6 pounds.

- **Dry Period:** The period when a cow is not producing milk. Often serves as a time of rest following a lactation period.

- **Lactation Period:** The period when a cow is producing milk.

- **Days in Milk:** The number of consecutive days a given cow has been actively producing milk.

- **Linear Classification Score:** A integer score between 1-100 given to a milk cow, providing a numerical representation of how well a the physical attributes of an animal fits the profile of an 'ideal' milking cow.  A weighted summarization of 18+ assessments of a given animal.

- **Extreme Temperature Day:** A day where the maximum temperature is measured to be $90^{o}F$ or higher or where the minimum temperature is measured below $10^{o}F$.


## Tests and Results

### Linear Classification Score and Daily Average Milk Weight

#### Hypotheses

- ~~**$H_{o Class}$:** The linear classication score is not linearly correlated with the average daily milk weight produced.~~
- **$H_{a Class}$:** The linear classication score is linearly correlated with the average daily milk weight produced.

#### Results

The [results](./milk_production_and_animal_classification.ipynb) suggest that there is a statistically significant linear relationship between linear classification score and daily milk production.  Therefore we can reject the null hypothesis, and infer the alternative is true.


### Maximum Temperature and Milk Weight

#### Hypotheses

- **$H_{o Tmax}$:** The weekly average of maximum temperatures is not linearly correlated with weekly per-capita milk production.
- ~~**$H_{a Tmax}$:** The weekly average of maximum temperatures is linearly correlated with weekly per-capita milk production.~~

#### Results

The [results](./milk-production-and-temperature.ipynb) suggest that there is a not a statistically significant linear relationship between maximum temperature and daily milk production.  Therefore we cannot reject the null hypothesis.

### Minimum Temperature and Milk Weight

#### Hypotheses

- **$H_{o Tmin}$:** The weekly average of minimum temperatures is not linearly correlated with weekly per-capita milk production.
- ~~**$H_{a Tmin}$:** The weekly average of minimum temperatures not linearly correlated with weekly per-capita milk production.~~

#### Results

The [results](./milk-production-and-temperature.ipynb) suggest that there is a not a statistically significant linear relationship between minimum temperature and weekly milk production.  Therefore we cannot reject the null hypothesis.

### Extreme Temperature Days and Milk Weight

- **$H_{o Tex}$:** The number of extreme temperature days per week is not linearly correlated with per-capita weekly milk production.
- ~~**$H_{a Tex}$:** The number of extreme temperature days per week is not linearly correlated with per-capita weekly milk production.~~

#### Results

The [results](./milk-production-and-temperature.ipynb) suggest that there is a not a statistically significant linear relationship between the number of extreme temperature days and weelky milk production.  Therefore we cannot reject the null hypothesis.



## Future Considerations

- Is the impact of extreme temperatures offset by one or two days?
- What is the impact of consecutive extreme temperature days?
- Does age have a positive or negative impact on milk production volumes?
- How does the number of days since calving impact milk production?
- Is days-in-milk a have predictive weight in milk production?
- How do butterfat percentage and Somatic Cell count vary with milk production?
- What relationships exist between flow-rate (max and average) and somatic cell count?
