# Inequalities

Suppose you have a set of bounded random variables $$ X_1, X_2, \ldots, X_n $$ and you're studying their sum $$ S = \sum_{i=1}^n X_i $$

1. **Hoeffding's inequality**:
   $$
   P\left(|S - \mathbb{E}[S]| \geq t\right) \leq 2\exp\left(-\frac{2t^2}{nB^2}\right),
  $$
   where \( B \) is the range of the variables.

   - **Pros**: Simple to use, doesn't require knowing the variance or distribution.
   - **Cons**: Doesn't leverage variance—if the data has low variance or small residuals, this inequality might overestimate the probability of deviation.

2. **Bennett's inequality**:
   $$
   P\left(|S - \mathbb{E}[S]| \geq t\right) \leq \exp\left(-\frac{\sigma^2}{B^2} h\left(\frac{Bt}{\sigma^2}\right)\right),
   $$
   where $$ \sigma^2 $$ is the variance, \( B \) is the range, and $$ h(u) = (1 + u)\ln(1 + u) - u $$

   - **Pros**: Incorporates variance, so it provides a tighter bound when $$ \sigma^2 $$ is small.
   - **Cons**: Requires additional information about the variance $$ \sigma^2 $$

---

### Practical Meaning
If your model has small residuals or variance, Hoeffding's inequality won't recognize this and will provide a conservative bound. In contrast, Bennett's inequality "takes advantage" of the favorable situation (low variance) to offer a more precise probability bound.

Thus:
- **Hoeffding** is "rough" because it's overly cautious and doesn't adapt to data properties.
- **Bennett** is "stronger" because it uses extra information to sharpen its estimates.

To determine whether a Poisson random variable \( X \) is sub-Gaussian, sub-exponential, or neither, we analyze its **moment generating function (MGF)**, which is given as:

$$
M_X(s) = \mathbb{E}[e^{sX}] = e^{\lambda(e^s - 1)},
$$
where $$ \lambda > 0 $$ is the rate parameter of the Poisson distribution.

### **Sub-Gaussian Random Variables**
A random variable \( X \) is sub-Gaussian if its tails decay at least as fast as a Gaussian distribution. Equivalently, \( X \) is sub-Gaussian if there exists a constant $$ \sigma^2 > 0 $$ such that:

$$
\mathbb{E}[e^{s(X - \mathbb{E}[X])}] \leq e^{\frac{\sigma^2 s^2}{2}} \quad \text{for all } s \in \mathbb{R}.
$$

For the Poisson distribution:
- $$ \mathbb{E}[X] = \lambda $$
- Centering \( X \) by subtracting $$ \mathbb{E}[X] $$ the MGF of $$ X - \mathbb{E}[X] $$ becomes:
  $$
  \mathbb{E}[e^{s(X - \lambda)}] = e^{\lambda(e^s - 1 - s)}.
  $$

To satisfy the sub-Gaussian criterion, $$ \lambda(e^s - 1 - s) $$ must grow at most quadratically in \( s \). However, the term $$ e^s - 1 - s $$ grows faster than $$ s^2 $$ for large \( s \), implying that \( X \) is **not sub-Gaussian**.

---

### **Sub-Exponential Random Variables**
A random variable \( X \) is sub-exponential if its tails decay at least as fast as an exponential distribution. \( X \) is sub-exponential if there exist constants $$ \nu > 0 $$ and $$ \alpha > 0 $$ such that:

$$
\mathbb{E}[e^{s(X - \mathbb{E}[X])}] \leq e^{\frac{\nu^2 s^2}{2}} \quad \text{for } |s| \leq \frac{1}{\alpha}.
$$

For the Poisson distribution:
- For small \( s \), $$ e^s - 1 - s \approx \frac{s^2}{2} $$ so $$ \mathbb{E}[e^{s(X - \lambda)}] \approx e^{\frac{\lambda s^2}{2}} $$ which aligns with sub-Gaussian behavior locally.
- For larger \( s \), $$ e^s - 1 - s $$ dominates, and the growth is not bounded by a quadratic term. However, the MGF does grow at a rate consistent with exponential decay in the tails of the Poisson distribution.

Thus, the Poisson distribution is **sub-exponential** but **not sub-Gaussian**.
