## From Maxwell's Equations to Scalar Diffraction Theory to Fourier Optics

### Maxwell's Equations

The differential form of Maxwell's equations are:

\begin{align}
\nabla \times \vec{E} &= \frac{-\partial \vec{B}}{\partial t} &\mbox{Faraday's Law} \\
\nabla \times \vec{B} &= \mu \mathbf{J} + \mu \epsilon \frac{\partial \vec{E}}{\partial t} &\mbox{Ampere's Law}\\
\nabla \cdot \vec{E} &= \frac{1}{\epsilon} \rho  & \mbox{Gauss's Law}\\
\nabla \cdot \vec{B} &= 0 & \mbox{No Name}\\
\end{align}

where $\vec{E} \; [V/m]$ is the electric field, $\vec{B} \; [N/A\cdot m]$ is the magnetic field, $\rho \; [C/m^3]$ is electric charge density, $\mathbf{J} \; [A/m^2]$ is the current density, $\epsilon \; [F/m]$ is permittivity, and $\mu \; [H/m]$ is permeability. Here, $(\times)$ is the vector cross product, $(\cdot)$ is the vector dot product, and $\nabla = \frac{\partial}{\partial x}\hat{x} + \frac{\partial}{\partial y}\hat{y} + \frac{\partial}{\partial z}\hat{z}$ where $\hat{x}$, $\hat{y}$, $\hat{z}$ are unit vectors in the $x$, $y$, and $z$ directions respectively. The electric and magnetic fields are related to the electric displacement ($\vec{D} \; [C/m^2]$) and magnetic intensity ($\vec{H} \; [A/m]$) through what are called the constituent relations:

\begin{align}
\vec{H} &= \mu \vec{B} \\
\vec{D} &= \epsilon \vec{E}.
\end{align}

From this point forward, unless specifically mentioned, $\mu = \mu_r \cdot \mu_0$ and $\epsilon = \epsilon_r \cdot \epsilon_0$, where $\mu_r$ is the permeability of the medium of propagation, $\mu_0$ is the permeability of free space, $\epsilon_r$ is the permittivity specific to the medium of propagation, and $\epsilon_0$ is the permitivity of free space. In most cases $\mu_r = 1$ and therefore $\mu = \mu_0$. It's common to write Maxwell's equations in terms of $\vec{E}$ and $\vec{H}$ due to the similarity of units, i.e., $[V/m]$ and $[A/m]$ respectively. In addition, we are interested in free-space propagation of these fields which means we can assume $\rho = 0$ and $\mathbf{J} = 0$. With these considerations, we can rewrite Maxwell's equations as

\begin{align}
\nabla \times \vec{E} &= -\mu_0  \frac{\partial \vec{H}}{\partial t}\\
\nabla \times \vec{H} &= \epsilon \frac{\partial \vec{E}}{\partial t}\\
\nabla \cdot \vec{E} &= 0\\
\nabla \cdot \mu_0\vec{H} &= 0\\
\end{align}

which form a system of first order coupled partial differential equations (PDEs). This is the starting point for our scalar wave equation derivation. 

### Scalar Wave Equation

The goal of deriving the scalar wave equation is two-fold. First, we would like to decouple the system of PDEs. Second, we would like to remove the vector nature of the equations. 

#### 1) Decoupling the PDEs
Before we can decouple the PDEs, we need to make some broad assumptions about our medium of propagation. Without any motivation we assume the wave is propagating in a dielectric medium that is linear, isotropic, homogeneous, nondispersive, and nonmagnetic. While we wont really get into the consequences of these assumptions here, rest assured these assumptions are valid for free space propagation.

To decouple the PDEs we take the curl of Faraday's law and simplify with the vector identity $\nabla \times (\nabla \times \vec{E}) = \nabla(\nabla \cdot \vec{E}) - \nabla^2 \vec{E}$, Ampere's law, and Gauss's law assuming no charge density: 
\begin{align}
\nabla \times (\nabla \times \vec{E}) &= \nabla \times \left(-\mu \frac{\partial \vec{H}}{\partial t} \right) \\
\nabla(\nabla \cdot \vec{E}) - \nabla^2 \vec{E} &= -\mu \frac{\partial}{\partial t} \left(\nabla \times \vec{H} \right) \\
-\nabla^2 \vec{E} &= -\mu \frac{\partial}{\partial t} \left(\epsilon \frac{\partial \vec{E}}{\partial t}\right) \\
\nabla^2 \vec{E} -\mu \epsilon \left( \frac{\partial^2 \vec{E}}{\partial t^2}\right) &= 0  \\
\end{align}
which is the vector wave equation. Notice that we started with a system of first order coupled partial differential equations, and we have now arrived at a second order (un-coupled) partial differential equation. There is one final simplification that is typical in most textbooks. We first need to define the index of refraction $n = \sqrt{\frac{\epsilon}{\epsilon_0}}$ and the speed of light $c = \frac{1}{\sqrt{\mu_0 \epsilon_0}}$. Substituting in, we arrive at the 'standard' vector wave equation
\begin{equation}
\nabla^2 \vec{E} - \frac{n^2}{c^2} \left( \frac{\partial^2 \vec{E}}{\partial t^2}\right) = 0.
\end{equation}
The magnetic field satisfies an identical equation
\begin{equation}
\nabla^2 \vec{H} - \frac{n^2}{c^2} \left( \frac{\partial^2 \vec{H}}{\partial t^2}\right) = 0.
\end{equation}

#### 2) Removing the Vectorial Dependence
Both the electric field and magnetic field can be decomposed into their rectilinear components, i.e., $\vec{E} = E_x\hat{x} + E_y\hat{y} + E_z\hat{z}$ and $\vec{H} = H_x\hat{x} + H_y\hat{y} + H_z\hat{z}$. Each of these components must satisfy the vector wave equation derived previously, e.g., 

\begin{equation}
\nabla^2 E_x - \frac{n^2}{c^2} \left( \frac{\partial^2 E_x}{\partial t^2}\right) = 0
\end{equation}
and similarly for $E_y$, $E_z$, $H_x$, $H_y$, and $H_z$. We can therefore summarize the behavior of all the components through a single scalar wave equation,
\begin{equation}
\nabla^2 u(P,t) - \frac{n^2}{c^2} \left( \frac{\partial^2 u(P,t)}{\partial t^2}\right) = 0,
\end{equation}
where $u(P,t)$ represents any of the scalar field components, and we've explicitly introduced the dependence of $u$ on both position $P$ and time $t$.

### Helmholtz Equation

As we derived above, let the light disturbace at postion $P$ and time $t$ be represented by the sclar function $u(P,t)$. For a monochromatic (single wavelength) wave, the scalar field may be writted explicitly as 

\begin{equation}
u(P,t) = A(P)cos[2\pi \nu t - \phi(P)]
\end{equation}
where $A(P)$ is the amplitude of the wave at position $P$, $\phi(P)$ is the phase of the wave at position $P$, and $\nu = $ is the optical frequency. We can rewrite in complex notation,

\begin{equation}
u(P,t) = \Re \left \{ U(P)exp(-j 2 \pi \nu t) \right \}
\end{equation}
where $\Re \{ \cdot \}$ signifies "real part of", and $U(P)$ is a complex function of position somtimes called a phasor. Because the time dependence of the disturbance $u(P,t)$ is known a priori, $U(P)$ is an adequate description of the disturbance. Substituting $U(P)$ into our scalar wave equation, it follows that $U$ must obey the time-independent equation

\begin{equation}
(\nabla^2 + k^2)U = 0
\end{equation}
where $k = 2\pi / \lambda$ is called the wavenumber. This relation is known as the Helmholtz equation and the complex amplitude of any monochromatic optical disturbance propagating in vacuum or in a homogeneous dielectric medium must obey such a relation.

### Seperable Solution to the Helmholtz Equation

We start by trying to find solutions which are seperable, e.g., 
\begin{equation}
u_s(x,y,z) = f_x(x) \times f_y(y) \times f_z(z).
\end{equation}
The laplacian of this separable solution can be written as
\begin{equation}
\nabla^2 u_s = \frac{\partial^2 u_s}{\partial x^2} + \frac{\partial^2 u_s}{\partial y^2} + \frac{\partial^2 u_s}{\partial z^2}.
\end{equation}
After applying the derivatives, substituting into the Helmholtz equation, and rearranging terms we arrive at the following:
\begin{equation}
\frac{f^{\prime \prime}_x(x)}{f_x(x)} + \frac{f^{\prime \prime}_y(y)}{f_y(y)} + \frac{f^{\prime \prime}_z(z)}{f_z(z)} + k^2 = 0.
\end{equation}
It follows from standard PDE analysis that each quotient in the above equation must be a constant (for two functions of different variables to sum to zero they must by necessity be constant). We denote these constants as $k_x^2$, $k_y^2$, and $k_z^2$ respectively. We now arrive at equations for each of our components along with one separation condition:

\begin{align}
\frac{d^2}{dx^2}f_x(x) & + k^2_xf_x(x) = 0 \\
\frac{d^2}{dy^2}f_y(y) & + k^2_yf_y(y) = 0 \\
\frac{d^2}{dz^2}f_z(z) & + k^2_zf_z(z) = 0 \\
k_x^2 + k_y^2 &+ k_z^2 = k^2. \\
\end{align}

Each of these equations has a solution in the form of complex exponentials. Combining back into our separable solution we have

\begin{align}
u_s(x,y,z) &= Ae^{jk_xx}e^{jk_yy}e^{jk_zz} \\
&= Ae^{i(k_xx + k_yy)}e^{\pm iz \sqrt{k^2 - k_x^2 - k_y^2}}.
\end{align}

To get the general solution, we must take a linear combination of all seperable solutions. As there are infinitely many, we take the integral : 
\begin{align}
u(x,y,z) &= \iint^{\infty}_{-\infty} u_s(x,y,z,k_x,k_y)dk_xdk_y \\
&= \iint^{\infty}_{-\infty} A(k_x, k_y)e^{i(k_xx + k_yy)}e^{\pm iz \sqrt{k^2 - k_x^2 - k_y^2}} dk_xdk_y.
\end{align}

At this point we make the following observation

\begin{equation}
u(x,y,0) = \iint^{\infty}_{-\infty} A(k_x, k_y)e^{i(k_xx + k_yy)} dk_xdk_y
\end{equation}

which implies that $A(k_x,k_y)$ is the Fouier transform of $u$, i.e.,

\begin{equation}
A(k_x, k_y) = \mathcal{F}\left\{ u(x,y,0) \right\}.
\end{equation}

With this result in mind, we return to our original expression
\begin{equation}
u(x,y,z) = \iint^{\infty}_{-\infty} A(k_x, k_y)e^{i(k_xx + k_yy)}e^{\pm iz \sqrt{k^2 - k_x^2 - k_y^2}} dk_xdk_y
\end{equation}
which we now realize to be an inverse Fourier transform of $A(k_x, k_y)e^{\pm iz \sqrt{k^2 - k_x^2 - k_y^2}}$, which we can write explicitly
\begin{equation}
u(x,y,z) = \mathcal{F}^{-1} \left \{ A(k_x, k_y)e^{\pm iz \sqrt{k^2 - k_x^2 - k_y^2}} \right \}.
\end{equation}

### Concluding Thoughts

The above derivations form the basis (read starting point) for Fourier optics. It states rather concisely that the disturbance due to a scalar field at $z=0$ is simply the inverse Fourier Transform of the Fourier Transform of the field at $z=0$ multiplied by a propagation transfer function. Fourier optics combines electromagnetics, sampling theory, and signal processing into a functional description of the wave nature of light.