# Part 1
We start from the 1-D Itô SDE
$$
dx_t = f(x_t,t)\,dt + g(x_t,t)\,dW_t,
$$
and denote the density of \(x_t\) by \(p(x,t)\).

---

### 1. Fokker–Planck equation of the SDE

The density \(p\) satisfies the Fokker–Planck (forward Kolmogorov) equation
$$
\frac{\partial p}{\partial t}
= -\frac{\partial}{\partial x}\big(f(x,t)p(x,t)\big)
  +\frac12 \frac{\partial^2}{\partial x^2}\big(g^2(x,t)p(x,t)\big).
\tag{FP}
$$

---

### 2. Continuity equation of a deterministic ODE

Consider a deterministic ODE
$$
dx_t = u(x_t,t)\,dt.
$$
If its solution also has density \(p(x,t)\), then \(p\) must satisfy the
continuity equation
$$
\frac{\partial p}{\partial t}
= -\frac{\partial}{\partial x}\big(u(x,t)p(x,t)\big).
\tag{CE}
$$

---

### 3. Match the two PDEs

Impose that both (FP) and (CE) describe the same \(p\):

$$
-\partial_x(up)
=
-\partial_x(fp)
+ \frac12 \partial_x^2\big(g^2 p\big).
$$
Rearrange:
$$
\partial_x\big((u-f)p\big)
= -\frac12 \partial_x^2\big(g^2 p\big).
$$

Integrate w.r.t. \(x\) and assume vanishing boundary terms so that the
integration constant is zero:
$$
(u-f)p = -\frac12 \partial_x\big(g^2 p\big).
$$
Hence
$$
u(x,t) = f(x,t) - \frac{1}{2p(x,t)}\partial_x\big(g^2(x,t)p(x,t)\big).
$$

---

### 4. Expand and simplify

Expand the derivative:
$$
\partial_x\big(g^2 p\big)
= (\partial_x g^2)\,p + g^2\,\partial_x p.
$$
Therefore
$$
u = f - \frac12\partial_x g^2
    -\frac{g^2}{2}\frac{1}{p}\partial_x p.
$$
Since $$\frac{1}{p}\partial_x p = \partial_x \log p$$, we obtain

$$
u(x,t)
= f(x,t)
- \frac12 \partial_x g^2(x,t)
- \frac{g^2(x,t)}{2}\,\partial_x \log p(x,t).
$$

Thus the corresponding probability flow ODE is

$$
dx_t =
\Big[
f(x_t,t)
- \frac12 \partial_x g^2(x_t,t)
- \frac{g^2(x_t,t)}{2}\,\partial_x \log p(x_t,t)
\Big]dt.
$$

# Part 2
## 1. Future AI Ability: Fully Autonomous, Accident-Free Driving in Dense Traffic

One important ability that AI cannot currently achieve, but may reasonably accomplish 20 years from now, is **fully autonomous, accident-free driving in dense and highly interactive traffic environments**. Unlike today’s semi-autonomous systems that depend strongly on human supervision, this future capability would allow an AI agent to independently plan, navigate, and make safety-critical decisions amidst unpredictable human drivers.

This ability has major societal significance. Road transportation affects millions of people daily, and most traffic accidents arise from human error. A highly reliable autonomous driving system could drastically reduce collisions, provide mobility for populations who cannot drive, and improve overall traffic efficiency. Achieving such performance requires far more than simple perception; the AI must **predict multi-agent behaviors**, reason under uncertainty, and plan safe trajectories several seconds into the future. These challenges highlight a level of integrated intelligence that current AI systems have not yet achieved.

---

## 2. Learning Paradigms Involved: Supervised Regression + Model-Based RL (via MPC)

Developing such an advanced driving capability requires combining multiple machine-learning paradigms.

First, low-level driving actions—such as **acceleration change** and **steering-angle change**—can be learned through **supervised regression**. Following the methodology of [Separating the World Model and Ego Models for Self-Driving](https://arxiv.org/abs/2204.07184), large-scale human driving datasets can be used to train a model that maps observed states to continuous control actions, providing human-like priors for basic driving behavior.

However, fully autonomous driving also involves **sequential decision-making under uncertainty**, which aligns with the objectives of **reinforcement learning (RL)**. Traditional RL, however, is impractical in real-world driving because it requires millions of interactions that are slow, expensive, and potentially dangerous.  
To overcome these limitations, a **model-based RL approach implemented through Model Predictive Control (MPC)** is more suitable. Instead of interacting with the real world, the agent uses a **learned world model** to simulate future trajectories and chooses actions that minimize a cumulative cost. This preserves the core idea of RL—optimizing long-term outcomes—while avoiding unsafe real-world exploration.

Thus, the task naturally involves a hybrid framework:
- **Supervised Learning:** provides action regression and forward dynamics models.  
- **Model-Based RL via MPC:** performs multi-step simulated rollouts and selects actions that minimize the trajectory cost.  

This combination is appropriate because the task requires both large-scale data-driven modeling and safe long-horizon planning.

---

## 3. First-Step Model Problem: Implementing DFM-KM MPC

As a practical first step toward the long-term goal of fully autonomous driving, I propose implementing the **DFM-KM MPC** approach from *Separating the World Model and Ego Models for Self-Driving*. This simplified model captures the essential subproblem of **planning safe trajectories using a learned world model**.

### How this simplified task represents the ultimate ability
DFM-KM MPC separates ego dynamics and environmental dynamics:  
- The **ego vehicle** is predicted using a differentiable kinematic model.  
- The **environment** is predicted using a learned stochastic world model.  

This decomposition isolates a key component of full autonomy—understanding how one’s own actions propagate into future states while accounting for surrounding traffic. Although simplified, this setting still requires the agent to handle uncertainty, perform multi-step prediction, and optimize actions over a short planning horizon.

### Testability of the model problem
The problem is highly testable because:
- Ego-state updates are analytically differentiable.  
- The environment model provides predicted rasterized states.  
- The cost function \( C^{km} \) is implemented with differentiable masks, enabling interpretable visualization.  
- Success can be measured using collision rates, smoothness, or minimized cumulative cost.  

These properties make it straightforward to evaluate whether the model is learning the correct driving behavior.

### Required mathematical and ML tools
Solving this model problem requires:
- Supervised regression for action prediction or forward models.  
- A differentiable kinematic ego-model.  
- A learned world model \( f_\theta^{env} \).  
- Gradient-based trajectory optimization using **Model Predictive Control (MPC)**.  
- Reinforcement-learning principles for multi-step cost minimization in simulation.  

Together, these components form a feasible yet meaningful first research step toward the long-term goal of safe, fully autonomous driving.

---

# Part 3
If every stochastic diffusion process has a corresponding probability flow ODE that preserves its marginal distributions, does this mean randomness is fundamentally unnecessary for generative modeling?