# 2.6 Environment and Distribution Shift


## 2.6.1 Types of Distribution Shift
In one classic setup, we assume that our training data
was sampled from some distribution $p_S(\mathbf{x},y)$
but that our test data will consist
of unlabeled examples drawn from
some different distribution $p_T(\mathbf{x},y)$.
Already, we must confront a sobering reality.
Absent any assumptions on how $p_S$
and $p_T$ relate to each other,
learning a robust classifier is impossible.

Covariate shifts, label shifts, and conceptual shifts are three common scenarios for describing changes in the distribution of data, and they help us to understand under what circumstances a model may be underperforming and to consider how to adapt to these changes.

### Covariate Shift
we assume that while the distribution of inputs
may change over time, the labeling function,
i.e., the conditional distribution
$P(y \mid \mathbf{x})$ does not change.
Statisticians call this *covariate shift*
because the problem arises due to a
shift in the distribution of the covariates (features).

协变量转移：输入数据（特征或协变量）的分布在训练集和测试集之间发生了变化，但在给定输入的条件下输出（或标签）的分布保持不变

### Label Shift
*Label shift* describes the converse problem.
Here, we assume that the label marginal $P(y)$
can change
but the class-conditional distribution
$P(\mathbf{x} \mid y)$ remains fixed across domains.
Label shift is a reasonable assumption to make
when we believe that $y$ causes $\mathbf{x}$.

标签转移：标签的整体分布在训练和测试数据之间发生了变化，但在给定特定标签的条件下输入数据的分布保持不变。
### Concept Shift
*concept shift* arises when the very definitions of labels can change.

概念转移：标签或类别定义本身发生变化的情况。它通常涉及到时间或地理位置的变化，导致某些词语或概念的含义发生变化


## 2.6.2 Correction of Distribution Shift

### Empirical Risk and Risk

The model is trained by iterating over features and labels and adjusting its parameters to minimize the loss on the training set.
This is termed as minimizing the empirical risk.
Ideally, you'd want to minimize the risk over the entire population, but in practice, we only have a sample of this population, so we minimize the empirical risk instead.

### Covariate Shift Correction



## 2.6.3 A Taxonomy of Learning Problems

### Batch Learning

Train a model using a fixed dataset and then deploy it to make predictions on new data that comes from the same distribution. The model isn't updated after deployment.

### Online Learning
Data arrives sequentially. First, you see a new data point, make a prediction based on your current model, then observe the true label and adjust the model accordingly.

### Bandits
A simplified online learning scenario where instead of a continuous function to optimize, you have a finite set of options (or "arms") to choose from. You want to maximize your reward by picking the best options over time.


### Control
Managing a system where the environment remembers past actions. The system's subsequent states are influenced by prior actions.

### Reinforcement Learning
Operating in an environment with memory, where every action leads to a certain reward. The aim is to learn a strategy to maximize rewards over time. The environment can be cooperative or adversarial.

### Considering the Environment
Realizing that strategies that work in static environments might not always work in adaptive ones. The dynamics of how and when the environment changes influence the strategies and algorithms we employ.


## 2.6.4 Summary

In many cases training and test sets do not come from the same distribution. This is called distribution shift. The risk is the expectation of the loss over the entire population of data drawn from their true distribution. However, this entire population is usually unavailable. Empirical risk is an average loss over the training data to approximate the risk. In practice, we perform empirical risk minimization.

Under the corresponding assumptions, covariate and label shift can be detected and corrected for at test time. Failure to account for this bias can become problematic at test time. In some cases, the environment may remember automated actions and respond in surprising ways. We must account for this possibility when building models and continue to monitor live systems, open to the possibility that our models and the environment will become entangled in unanticipated ways.
