### Causal Relations

<img src="images/image1.jpeg" width="600" height="400" /> 

#### For most interventions on height, they will necessarily influence on weight but there are fewer intervention on weight that influence height.

<img src="images/image2.jpeg" width="600" height="400" /> <img src="images/image3.jpeg" width="600" height="400" />

#### Height influences weight and sex influences both on height and sex. The effect of height on weight is directly while sex influences directly and also indirectly on weight by it's effect on height.

<img src="images/image4.jpeg" width="600" height="400" /> <video controls width="600" height="400" src="files/recording1.mp4" title="Animation"></video>

### Unmeasured Causes
Unmeasured causes are those influences on variables that are not directly observed or included in a model. These can generate variation in each of the measured variables and are typically represented as stochastic variables in a generative model. They are considered ignorable unless they're shared among the measured variables, which could introduce confounds.

### Temperature and Sex Determination in Turtles
In some species, such as turtles and lizards, sex determination is influenced by ambient temperature during gestation. For example, in turtles, warmer temperatures tend to produce more females, while cooler temperatures produce more males. This temperature-based sex determination also indirectly influences other traits like body weight, as temperature affects the ecology and availability of food.

### Confounds
A confound is an unmeasured common cause that affects multiple variables within a study. If not accounted for, confounds can lead to incorrect conclusions about the relationships between variables. In causal modeling, it's crucial to consider potential confounds and either measure them or adjust the analysis to account for their influence.


<img src="images/image5.jpeg" width="600" height="400" /> <img src="images/image6.jpeg" width="600" height="400" />

--------------------------------

<img src="images/image8.jpeg" width="600" height="400" /> <img src="images/image9.jpeg" width="600" height="400" />

<img src="images/image10.jpeg" width="600" height="400" /> <img src="images/image11.jpeg" width="600" height="400" />

# Understanding Differences in Mean Weights

## Introduction

To understand the difference in mean weights between two categories (e.g., men and women), we analyze their posterior distributions. The key is to compute the contrast, which is the difference between the means of the two categories.

## Posterior Distributions

- **Posterior distributions** represent our updated beliefs about the parameters (mean weights) after incorporating observed data.

## Computing the Contrast

To determine the difference in mean weights:

1. **Simulate from Posterior Distributions**:
   - Draw samples from the posterior distributions of the mean weights for both categories.
   - Calculate the difference in means for each pair of samples.

2. **Assessing Overlap and Difference**:
   - Even if individual weight distributions overlap, it does not mean the means are the same.
   - The key is the distribution of the differences in means, not the overlap of individual values.

<img src="images/image16.jpeg" width="600" height="400" /> <img src="images/image17.jpeg" width="600" height="400" />


## Statistical Significance of the Difference

- **Overlap of Distributions**:
  - Overlap in individual distributions does not imply a lack of difference in means.
  - The posterior distribution of the differences provides a reliable estimate of whether the means are different.

## Practical Implications

1. **Mean Differences**:
   - The computed contrast provides a range for the difference in mean weights between the categories.

2. **Real-Life Context**:
   - Despite overlaps in individual weights, the difference in means can be statistically significant and practically meaningful.
   - For example, it can indicate the probability that a randomly selected individual from one category weighs more than one from another category.

## Conclusion

Overlap in individual distributions does not negate a statistically significant difference between means. The contrast (difference in posterior distributions) is crucial for understanding and quantifying the differences between categories. Always compute and interpret this contrast to make informed decisions.

<img src="images/image20.jpeg" width="600" height="400" /> <img src="images/image21.jpeg" width="600" height="400" />

## Estimating the Causal Effect of Sex on Weight

We've addressed the total causal effect of sex on weight using two contrasts:
- **Means comparison**: Examines the difference in average weights.
- **Distributional comparison**: Considers the entire weight distribution.

### Causal Effect Analysis
- **Hypothetical Intervention**: Considers the effect of changing someone's birth sex on their weight.
- **Distributional Estimates**: These plots are distributions. Points are decisions, not scientific estimates. The statistical estimate with all scientific information is the distribution.

### Direct Causal Effect of Sex on Weight
To understand the direct causal effect of sex on weight, we need a new model to partial out the indirect effect of height. Instead of "control," we use "block" or "stratify."

### Generative Model Parameters
- **Indirect Effect Parameters**: Influence the indirect effect.
- **Direct Effect Parameters**: Represent the direct effect. For example, men are 10 kg heavier on average regardless of height.

### Simulation and Analysis
In our simulation:
- **Same Slopes for Both Sexes**: The effect of height on weight is identical for men and women.
- **Direct Effect Simulation**: Men are consistently 10 kg heavier regardless of height. Regression lines have the same slope but different intercepts due to this direct effect.

<img src="images/image23.jpeg" width="600" height="400" />


### Developing a Statistical Model
To model this, we augment our linear model to include height, which allows us to stratify by height. We use a technique called **centering**, subtracting the average height (h̅) to simplify interpretation.

### Centering Explained
- **Alpha (α)**: Expected weight at the average height.
- **Beta (β)**: Effect of height on weight.
- **Benefits of Centering**:
  - Simplifies software calculations.
  - Easier to conceptualize priors.
  - Alpha directly represents average weight at average height.

   <img src="images/image24.jpeg" width="600" height="400" /> 

### Regression Line and Grand Mean
The regression line passes through the grand mean of the data. If an individual's height is average, the best guess for their weight is the average weight. The grand mean, where the average height and weight intersect, is pivotal.

 <img src="images/image25.jpeg" width="600" height="400" />

### Summary
- **Points in Data**: Centered around the grand mean.
- **Regression Lines**: Reflect expectations, pivoting around the grand mean.
- **Alpha's Role**: Expected weight at the average height simplifies modeling.

> **Note**: Centering helps by making alpha the expected weight at average height, streamlining many modeling tasks.

### Stratifying by Categorical Variables
Stratifying by categorical variables, like sex, simplifies modeling tasks. We allow both the slope and intercept (alpha) to vary by sex. This adjustment is straightforward in code, using subscripted variables.

### Example of Stratification
- **Addressing in Vectors**: Each sex has a specific address in the alpha and beta vectors.
  - **Male (S = 2)**: Address 2 in both vectors.
  - **Female (S = 1)**: Address 1 in both vectors.
- **Data Simulation**: Testing with synthetic data to understand the stratified model.

 <img src="images/image26.jpeg" width="600" height="400" />

### Analyzing the Direct Effect
To estimate the direct effect of sex on weight, we:
1. **Simulate Individuals**: For each height, simulate expected weights for men and women.
2. **Compute Differences**: Calculate the difference in expected weights at each height.

<img src="images/image28.jpeg" width="600" height="400" />

### Simulation Process
- **Link Function**: From the rethinking package, used for simulation.
- **Data Inputs**: S vector of ones (for females) and twos (for males).
- **Posterior Distribution**: Difference in expected weights between men and women at each height.

### Results Interpretation
- **Posterior Distribution Plot**: Shows differences in expected weights across heights.
  - **Horizontal Line at Zero**: Indicates negligible differences at most heights.
  - **Small Differences**: Men slightly heavier at very tall and short heights.

### Key Insight
- **Causal Effect Through Height**: Nearly all the causal effect of sex on weight is mediated through height differences.
  - **Big Mean Differences**: Observable due to men being generally taller.
  - **Direct Effect**: Minimal direct effect of sex on weight after accounting for height.

### Summary
- **Total Causal Effect**: Estimated on the left.
- **Direct Causal Effect**: Estimated on the right, showing almost no direct effect of sex on weight.

> **Conclusion**: The primary factor in the causal effect of sex on weight is height, with negligible direct effect of sex itself.

<img src="images/image30.jpeg" width="600" height="400" />

##### We found that, on average, men weigh more than women. However, upon controlling for height we observed that this weight difference largely diminishes. This indicates that the observed weight disparity between men and women can be primarily attributed to differences in height. Interestingly, as height increases, women tend to surpass men slightly in average height. So, we can conclude that **almost all of the causal effect of sex is through height.**

<img src="images/image31.jpeg" width="600" height="400" />