# Chapter 2: Transmon Qubits in Cavity QED

Superconducting qubits are promising physical realizations of quantum bits. These devices are electrical circuits made of superconducting elements (capacitors, inductors, and Josephson junctions) with quantized energy levels and very low dissipation. Interestingly, the physics of such circuits closely parallels that of **cavity quantum electrodynamics (cavity QED)**, which deals with atoms interacting with quantized electromagnetic modes in a cavity. In this chapter, we introduce the basic cavity QED model (the Jaynes-Cummings model) and then describe how it is implemented in superconducting circuits (circuit QED) with a focus on the transmon qubit. We will see how the transmon's low anharmonicity necessitates treating higher energy levels and how coupling a transmon to a resonator leads to important effects like **dispersive shifts**, which we will derive perturbatively.

## 2.1 Cavity QED and the Jaynes–Cummings Model

**Cavity QED** studies the interaction between atoms and the electromagnetic field inside a reflective cavity. A cavity resonator supports discrete electromagnetic modes (standing waves), so that an atom inside interacts predominantly with these quantized modes of specific resonant frequencies. Photons emitted by the atom are confined in the cavity and can be reabsorbed, leading to coherent oscillations (Rabi oscillations) between atomic states and photon states.

Mathematically, a single mode of the cavity field with frequency $\omega_c$ is described as a quantum harmonic oscillator. Its Hamiltonian is:

$$
H_{\text{field}} = \omega_c \left( a^\dagger a + \frac{1}{2} \right) \,,
$$

where $a^\dagger$ and $a$ are the photon creation and annihilation operators for the mode, satisfying $[a, a^\dagger] = 1$. We will often drop the constant vacuum energy term $\frac{1}{2}\omega_c$, since it only adds an irrelevant global energy offset.

Now, consider an atom with two energy levels (a qubit-like two-level atom) interacting with this single cavity mode. Let $|0\rangle$ denote the atomic ground state and $|1\rangle$ the excited state, with transition frequency $\omega_a$. The Hamiltonian of the isolated two-level atom can be written as:

$$
H_{\text{atom}} = -\frac{\omega_a}{2}\,\sigma_z \,,
$$

where $\sigma_z = |0\rangle\langle 0| - |1\rangle\langle 1|$ is the Pauli $Z$ operator for the two-level system (with eigenvalues $+1$ on $|0\rangle$ and $-1$ on $|1\rangle$). The atom’s ground state has energy $-,\omega_a/2$ and the excited state $+\omega_a/2$ under this Hamiltonian, so that $\omega_a$ indeed represents the energy splitting between $|1\rangle$ and $|0\rangle$.

If the atom is coupled to the cavity field mode, the total Hamiltonian is often modeled by the **Jaynes–Cummings (JC) Hamiltonian**, which includes a dipole-like interaction term. The full system Hamiltonian (prior to making any approximations) can be written as:

$$
H = H_{\text{field}} + H_{\text{atom}} + H_{\text{int}} 
  = \omega_c\,a^\dagger a \;-\; \frac{\omega_a}{2}\,\sigma_z \;+\; g\,(a^\dagger + a)\,(\sigma_+ + \sigma_-)\,,
$$

where $g$ is the atom–field coupling strength. Here $\sigma_+ = |1\rangle\langle 0|$ and $\sigma_- = |0\rangle\langle 1|$ are the atomic raising and lowering operators, respectively, and $(a^\dagger + a)(\sigma_+ + \sigma_-)$ represents the exchange of excitations between the atom and the field. Specifically, an $a^\dagger \sigma_-$ term corresponds to the atom de-exciting (|1〉→|0〉) and emitting a photon into the field mode (photon creation $a^\dagger$), while an $a,\sigma_+$ term corresponds to the atom being excited (|0〉→|1〉) by absorbing a photon from the field (photon annihilation $a$).

It is often useful to move into a rotating reference frame at the atomic and cavity frequencies to simplify the interaction. We perform a unitary transformation to the interaction picture with $U(t) = \exp{i(H_{\text{field}} + H_{\text{atom}})t}$. 

First we split the Hamiltonian:

$$
H \;=\;H_{\text{field}}+H_{\text{atom}}+H_{\text{int}}
      \;=\; \underbrace{\Bigl[\,
           \omega_c\,a^{\dagger}a-\frac{\omega_a}{2}\sigma^{z}\Bigr]}_{H^{0}}
        \;+\; \underbrace{g\,(a^{\dagger}+a)(\sigma^{+}+\sigma^{-})}_{H_{\text{int}}}.
$$

We will treat
$H^{0}=H_{\text{field}}+H_{\text{atom}}$ as the “free” part and
$H_{\text{int}}$ as the perturbation.

Then, take

$$
U(t)=e^{\,iH^{0}t},\qquad 
H^{0}=\omega_c a^{\dagger}a-\frac{\omega_a}{2}\sigma^{z}.
$$

Because $H^{0}$ is time-independent we have

$$
\dot U(t)=iH^{0}U(t),\quad i\dot U\,U^{\dagger}=-H^{0}.
$$


For *any* unitary $U(t)$

$$
H_{I}(t)=UHU^{\dagger}+i\dot U\,U^{\dagger}.
$$



To understand the formula above, consider the Schrödinger-picture state $\,|\psi_S(t)\rangle$ which obeys

$$
i\,\frac{d}{dt}|\psi_S(t)\rangle = H\,|\psi_S(t)\rangle,
\tag{1}
$$

with (possibly time-dependent) Hamiltonian $H$ and $\hbar=1$. Choose a unitary operator $U(t)$ (invertible, $U^\dagger U = UU^\dagger = \mathbb{I}$).
Define the *interaction-picture* state

$$
|\psi_I(t)\rangle \;\equiv\; U(t)\,|\psi_S(t)\rangle.
\tag{2}
$$

Because $U$ is unitary, the map $|\psi_S\rangle \mapsto |\psi_I\rangle$ is norm-preserving. Differentiate (2):

$$
\frac{d}{dt}|\psi_I(t)\rangle
      = \dot U(t)\,|\psi_S(t)\rangle
      + U(t)\,\frac{d}{dt}|\psi_S(t)\rangle,
\tag{3}
$$

where the dot denotes $d/dt$. Insert (1) for $\frac{d}{dt}|\psi_S\rangle$:

$$
\frac{d}{dt}|\psi_I(t)\rangle
      = \dot U\,|\psi_S\rangle
      - i\,U\,H\,|\psi_S\rangle.
\tag{4}
$$

Now eliminate $|\psi_S\rangle$ in favour of $|\psi_I\rangle$ using (2):
$ |\psi_S\rangle = U^{\dagger}|\psi_I\rangle.$

Plug in:

$$
\frac{d}{dt}|\psi_I(t)\rangle
      = \dot U\,U^{\dagger}\,|\psi_I\rangle
      - i\,U\,H\,U^{\dagger}\,|\psi_I\rangle.
\tag{5}
$$


Factor out $i$ and rearrange (5) into Schrödinger form:

$$
i\,\frac{d}{dt}|\psi_I(t)\rangle
      = \bigl[\,U\,H\,U^{\dagger}
              + i\,\dot U\,U^{\dagger}\bigr]\,|\psi_I\rangle.
\tag{6}
$$

Hence the *interaction-picture Hamiltonian* is

$$
\boxed{\,H_I(t)=U(t)\,H\,U^{\dagger}(t)+i\,\dot U(t)\,U^{\dagger}(t)\,}.
$$


If $H=H_0+H_{\text{int}}$ and one chooses

$$
U(t)=e^{iH_0t},
$$

then

$$
\dot U\,U^{\dagger}=iH_0,
\qquad
U\,H\,U^{\dagger}=H_0 + e^{iH_0t}H_{\text{int}}e^{-iH_0t},
$$

so

$$
H_I(t)=H_0 + H_{\text{int},I}(t) - H_0
      = H_{\text{int},I}(t),
$$

with $H_{\text{int},I}(t)\equiv e^{iH_0t}H_{\text{int}}e^{-iH_0t}$.
The $H_0$ terms cancel—the familiar result that the interaction picture removes the “free” evolution generated by $H_0$ from the Hamiltonian seen by the state.

With the choice above,

$$
H_{I}(t)=U(H^{0}+H_{\text{int}})U^{\dagger}-H^{0}
        =UH_{\text{int}}U^{\dagger},
$$

so only the interaction term needs to be transformed. Use the Baker–Campbell–Hausdorff identity together with the commutators
$[a,a^{\dagger}]=1$ and $[\sigma^{z},\sigma^{\pm}]=\pm2\sigma^{\pm}$:

$$
\begin{aligned}
U(t)\,a\,U^{\dagger}(t) &= e^{i\omega_c t}\,a,
&\quad 
U(t)\,a^{\dagger}\,U^{\dagger}(t) &= e^{-i\omega_c t}\,a^{\dagger},\\[6pt]
U(t)\sigma^{+}U^{\dagger}(t) &= e^{-i\omega_a t}\,\sigma^{+},
&\quad
U(t)\sigma^{-}U^{\dagger}(t) &= e^{+i\omega_a t}\,\sigma^{-}.
\end{aligned}
$$

*(Signs follow from $H_{\text{atom}}=-\tfrac{\omega_a}{2}\sigma^{z}$.)*

Write $H_{\text{int}}$ as a sum of four products and dress each one:

$$
\begin{aligned}
U(a^{\dagger}\sigma^{+})U^{\dagger} &= 
      e^{i\omega_c t}\,e^{-i\omega_a t}\,a^{\dagger}\sigma^{+}
      = a^{\dagger}\sigma^{+}e^{\,i\Delta t},\\
U(a^{\dagger}\sigma^{-})U^{\dagger} &= 
      e^{i\omega_c t}\,e^{+i\omega_a t}\,a^{\dagger}\sigma^{-}
      = a^{\dagger}\sigma^{-}e^{\,i\omega_c t},\\
U(a\sigma^{+})U^{\dagger} &=
      e^{-i\omega_c t}\,e^{-i\omega_a t}\,a\sigma^{+}
      = a\sigma^{+}e^{-i\omega_c t},\\
U(a\sigma^{-})U^{\dagger} &=
      e^{-i\omega_c t}\,e^{+i\omega_a t}\,a\sigma^{-}
      = a\sigma^{-}e^{-i\Delta t},
\end{aligned}
$$

where $\displaystyle\Delta\equiv\omega_c-\omega_a$ is the atom–cavity detuning.

(The ordering of $\omega_c t$ and $\Delta t$ factors matches the sign convention in the excerpt.)

Multiplying by the overall coupling $g$ and collecting the four terms:

$$
\boxed{\,%
H_{I}(t)
  = g\Bigl(
      a^{\dagger}\sigma^{+}e^{\,i\omega_c t}
      +a^{\dagger}\sigma^{-}e^{\,i\Delta t}
      +a\sigma^{+}e^{-i\Delta t}
      +a\sigma^{-}e^{-i\omega_c t}
     \Bigr)\,}.
$$

where we have defined the detuning $\Delta = \omega_c - \omega_a$ (the difference between the cavity mode frequency and the atomic transition frequency) and $\omega_\Sigma = \omega_c + \omega_a$. In deriving $H_I(t)$, we used $a(t) = a,e^{-i\omega_c t}$, $a^\dagger(t) = a^\dagger e^{+i\omega_c t}$ for the field operators and $\sigma_-(t) = \sigma_- e^{-i\omega_a t}$, $\sigma_+(t) = \sigma_+ e^{+i\omega_a t}$ for the atomic operators, then substituted into the interaction term and simplified.

In the above expression, we see two kinds of terms oscillating with different frequencies:

* Terms like $a,\sigma_+,e^{-i\Delta t}$ and $a^\dagger,\sigma_-,e^{+i\Delta t}$, which oscillate with frequency $\Delta$ (the **detuning**), and
* Terms like $a,\sigma_-,e^{-i\omega_\Sigma t}$ and $a^\dagger,\sigma_+,e^{+i\omega_\Sigma t}$, which oscillate with the much higher frequency $\omega_\Sigma$ (approximately the sum of the atomic and cavity frequencies).

If the atom and cavity are near resonance or only modestly detuned (such that $|\Delta| \ll \omega_\Sigma$), then $\omega_\Sigma$ is a very large frequency compared to $\Delta$. In this case, the terms oscillating at $\omega_\Sigma$ (often called **counter-rotating terms**) average out to nearly zero effect over timescales of interest, because they rotate *very fast*. Neglecting these rapidly oscillating terms is known as the **rotating wave approximation (RWA)**. The rationale is that over a timescale long compared to $1/\omega_\Sigma$, these terms produce negligible net interaction (their effect *destructively interferes* due to the rapid phase oscillation). We retain only the near-resonant terms oscillating with the smaller frequency $\Delta$.

After applying the RWA and then transforming back to the stationary frame, we obtain the **Jaynes–Cummings Hamiltonian**:

$$
H_{\text{JC}} \;=\; \omega_c\,a^\dagger a \;-\; \frac{\omega_a}{2}\,\sigma_z \;+\; g\Big(a\,\sigma_+ + a^\dagger\,\sigma_- \Big) \,.
$$

This simplified Hamiltonian $H_{\text{JC}}$ captures the essence of the resonant (or near-resonant) atom-cavity interaction, excluding the counter-rotating processes. The coupling term $g(a,\sigma_+ + a^\dagger,\sigma_-)$ describes the exchange of a quantum of excitation between the atom and the field: $a^\dagger \sigma_-$ creates a photon while lowering the atom (emission), and $a,\sigma_+$ annihilates a photon while raising the atom (absorption).

**Energy eigenstates (dressed states):** The Jaynes–Cummings Hamiltonian can be diagonalized within each subspace of total excitation number. Because $H_{\text{JC}}$ conserves the total number of excitations ($n_{\text{photons}} +$ atomic excitation), it couples only the states $|n+1, 0\rangle$ (photon number $n+1$, atom in ground state) and $|n, 1\rangle$ (photon number $n$, atom in excited state) for each non-negative integer $n$. Let $|n+1,0\rangle$ denote the state with $(n+1)$ photons and atom in $|0\rangle$, and $|n,1\rangle$ the state with $n$ photons and atom in $|1\rangle$. In the basis ${,|n+1,0\rangle,;|n,1\rangle,}$, the Hamiltonian restricted to this two-dimensional subspace is:

$$
H_{\text{JC}}^{(n)} = \begin{pmatrix}
\omega_c (n+1) - \frac{\omega_a}{2} & g\sqrt{\,n+1\,} \\
g\sqrt{\,n+1\,} & \omega_c n + \frac{\omega_a}{2}
\end{pmatrix}\!,
$$

where the diagonal entries are the energies of $|n+1,0\rangle$ and $|n,1\rangle$ respectively (with the $-\omega_a/2$ or $+\omega_a/2$ coming from the atomic term), and the off-diagonal terms $g\sqrt{n+1}$ are the coupling matrix elements (the factor $\sqrt{n+1}$ arises from $\langle n+1|a|n+2\rangle = \sqrt{n+2}$, etc., but here it couples states with a difference of one photon).

Diagonalizing this $2\times2$ matrix yields the eigenstates, often called **dressed states** because they are superpositions of atom+photon states. The dressed energy eigenstates for the subspace with total excitation $n+1$ are:

$$
|n,+\rangle \;=\; \sin\frac{\vartheta_n}{2}\,|n+1,0\rangle \;+\; \cos\frac{\vartheta_n}{2}\,|n,1\rangle \,, 
$$

$$
|n,-\rangle \;=\; \cos\frac{\vartheta_n}{2}\,|n+1,0\rangle \;-\; \sin\frac{\vartheta_n}{2}\,|n,1\rangle \,,
$$

where the mixing angle $\vartheta_n$ is defined by $\tan \vartheta_n = \frac{2g\sqrt{,n+1,}}{\Delta}$ (it measures the degree of mixing between the two basis states). Physically, $\vartheta_n$ is small (so that $|n,+\rangle \approx |n+1,0\rangle$ and $|n,-\rangle \approx |n,1\rangle$) when the atom and cavity are far off-resonance ($g\sqrt{n+1} \ll |\Delta|$), but becomes $\pi/2$ at exact resonance $\Delta=0$ (maximal mixing).

The corresponding eigenenergies of these dressed states are found by solving the $2\times2$ matrix eigenvalue problem. They are:

$$
E_{|n,\pm\rangle} \;=\; \omega_c\Big(n+\frac{1}{2}\Big) \;\pm\; \frac{\Delta}{2}\sqrt{\,1 + \frac{4g^2(n+1)}{\Delta^2}\,}\,\,,
$$

where we have added the photon zero-point energy $\omega_c/2$ back for completeness. These energies $E_{|n,\pm\rangle}$ are plotted qualitatively in Fig. 2.1 as a function of the detuning $\Delta$. At exact resonance ($\Delta=0$), the two uncoupled states $|n+1,0\rangle$ and $|n,1\rangle$ are degenerate, and the coupling $g$ splits the degeneracy (an **avoided crossing**), yielding a symmetric and an antisymmetric combination separated by the Rabi splitting $2g\sqrt{n+1}$. For large detuning ($|\Delta| \gg g\sqrt{n+1}$), the mixing is weak and the dressed energies approach those of the uncoupled states.

### 2.1.1 Dispersive Regime

The **dispersive regime** refers to the situation where the atom and cavity are far off-resonant relative to their coupling—meaning the detuning $|\Delta|$ is much larger than the coupling strength $g$ (formally, $|g/\Delta| \ll 1$). In this regime, the atom and photon do not exchange energy efficiently; instead, the interaction manifests primarily as a perturbative shift of the energy levels. The dressed states $|n,\pm\rangle$ then tend towards the bare states $|n+1,0\rangle$ and $|n,1\rangle$ with only small admixture.

Mathematically, as $\Delta \to$ large, we can expand the square root in the energy formula above to see the leading perturbative effect. For $|\Delta| \gg g\sqrt{n+1}$, we use $\sqrt{1+x} \approx 1 + \frac{x}{2} - \frac{x^2}{8} + \cdots$. Let $x = \frac{4g^2(n+1)}{\Delta^2}$, which is small in the dispersive regime. Then to first order in $x$:

$$
\sqrt{\,1 + \frac{4g^2(n+1)}{\Delta^2}\,} \;\approx\; 1 + \frac{2g^2(n+1)}{\Delta^2}\,.
$$

Substituting this into $E_{|n,\pm\rangle}$ and simplifying, we get approximate expressions for the dressed energies. Specifically, each of $E_{|n,\pm\rangle}$ approaches one of the uncoupled energies plus a small shift. For the *lower* dressed state $|n,-\rangle$ (which correlates with $|n,1\rangle$ for large detuning) we find:

$$
E_{|n,-\rangle} \;\approx\; \omega_c\,n + \frac{\omega_a}{2} \;-\; \frac{g^2(n+1)}{\Delta}\,,
$$

and for the *upper* dressed state $|n,+\rangle$ (correlating with $|n+1,0\rangle$):

$$
E_{|n,+\rangle} \;\approx\; \omega_c\,(n+1) - \frac{\omega_a}{2} \;+\; \frac{g^2(n+1)}{\Delta}\,. 
$$

In deriving these, we used $\Delta = \omega_c - \omega_a$, so that $\frac{\Delta}{2} \approx \pm\frac{1}{2}(\omega_c - \omega_a)$ gave us the uncoupled energies, and the $\pm \frac{g^2(n+1)}{\Delta}$ term is the shift. We can re-express the above results more clearly as modifications to the energies of the uncoupled states $|n+1,0\rangle$ and $|n,1\rangle$:

* The state $|n+1,0\rangle$ (one extra photon with atom in ground state), which **unperturbed** would have energy $E_{|n+1,0\rangle} = \omega_c(n+1) - \frac{\omega_a}{2}$, now has an extra upward shift of approximately $+\frac{g^2(n+1)}{\Delta}$. Thus its energy in the dispersive regime is $E_{|n,+\rangle} \approx \omega_c(n+1) - \frac{\omega_a}{2} + \frac{g^2(n+1)}{\Delta}$.

* The state $|n,1\rangle$ (atom excited, one fewer photon), with unperturbed energy $E_{|n,1\rangle} = \omega_c n + \frac{\omega_a}{2}$, acquires a downward shift of approximately $-\frac{g^2(n+1)}{\Delta}$. So $E_{|n,-\rangle} \approx \omega_c n + \frac{\omega_a}{2} - \frac{g^2(n+1)}{\Delta}$.

We see that **to first order in $g/\Delta$ the only effect of the atom–cavity coupling is to shift the energies of the bare states**. Importantly, the magnitude of the energy shift depends on the state of both the atom and the field (through the factor $n+1$ which is the photon number plus one if the atom is in the ground state, or just the photon number $n$ if the atom is excited).

In other words, in the dispersive regime the coupling effectively yields a photon-number-dependent shift of the atomic transition frequency and an atom-state-dependent shift of the cavity frequency. This is often called the **AC Stark shift** (or **dispersive shift**) for the cavity frequency and an analogous **dispersive Lamb shift** for the atomic frequency. Concretely, from the above expressions one can read off that the cavity mode frequency $\omega_c$ is *renormalized* to $\omega_c + \chi$ when the atom is in $|0\rangle$, but to $\omega_c - \chi$ when the atom is in $|1\rangle$, where

$$
\chi \;\equiv\; \frac{g^2}{\Delta}\,
$$

to leading order. Equivalently, the atomic transition frequency $\omega_a$ is effectively shifted by $\pm \chi$ depending on the number of photons in the cavity. These shifts are small compared to the base frequencies (since $g \ll |\Delta|$), but they are measurable and in fact extremely useful.

**Physical significance (dispersive readout):** The state-dependent frequency shift means that by measuring the cavity frequency, one can infer the state of the atom (qubit) without directly measuring the atom. In practice, one probes the cavity with a microwave and sees a slight resonance frequency change depending on whether the qubit is in $|0\rangle$ or $|1\rangle$. This is the basis of **quantum nondemolition readout** in circuit QED: the qubit state can be detected by the phase shift or transmission of a microwave tone through the cavity, induced by the AC Stark shift $2\chi$ between the two qubit states. (Here $2\chi = 2g^2/\Delta$ is the difference in cavity frequency for qubit in $|0\rangle$ vs $|1\rangle$.)

In summary, the dispersive limit allows a perturbative treatment of the atom–cavity interaction, yielding an **effective Hamiltonian** where the coupling term is replaced by an interaction that shifts energies depending on the qubit state. We will derive such effective Hamiltonians more systematically in the next chapter, but the simple expansion above already gave us a first-order approximation of the **dispersive Hamiltonian**:

$$
H_{\text{disp}} \;\approx\; \omega_c\,a^\dagger a \;+\; \frac{\omega_a + \delta\omega_a}{2}\,\sigma_z \;+\; \chi\,a^\dagger a\,\sigma_z \,,
$$

where $\chi = g^2/\Delta$ and $\delta\omega_a$ is a small shift in the qubit frequency (often called the Lamb shift). In fact, comparing to the expressions, $\delta\omega_a \approx -,\frac{g^2}{\Delta}$ (so the qubit frequency is slightly reduced by the interaction, because $\Delta = \omega_c - \omega_a$ typically positive in circuit QED setups). The $a^\dagger a,\sigma_z$ term indicates that the cavity's photon number influences the qubit energy splitting and vice versa.

The key point is that in the dispersive regime, the coupling does not swap excitations (since off-resonant) but instead results in **state-dependent frequency shifts**. These shifts are exploited for qubit readout (as described above, measuring the cavity frequency reveals the qubit state). We shall revisit these effective Hamiltonians with more rigorous perturbation theory soon.

## 2.2 Circuit QED

We now turn to **circuit QED**, the implementation of cavity QED concepts using superconducting circuits. In circuit QED, a superconducting microwave resonator plays the role of the cavity, and superconducting qubits (circuits with two or more energy levels) play the role of atoms. The strong coupling between a qubit and the resonator photons can reach the regime analogous to cavity QED, enabling phenomena like Rabi oscillations and dispersive shifts, but now in a solid-state device.

### 2.2.1 The Transmon Qubit

First, we need a superconducting element that can act as a qubit. A simple LC oscillator circuit is not suitable as a qubit because its energy levels are equally spaced (harmonic oscillator); if we tried to use two levels of a harmonic oscillator as |0〉 and |1〉, any drive resonant with the 0→1 transition would also excite 1→2, 2→3, etc., making it impossible to address only two levels. We need an **anharmonic** oscillator—one with unequal level spacings—so that one pair of levels can serve as a qubit distinct from higher levels. This is achieved by introducing a **Josephson junction** into the circuit.

A **Josephson junction** consists of two superconductors separated by a thin insulating barrier through which Cooper pairs (paired superconducting electrons) can quantum tunnel. The junction acts as a nonlinear inductor with inductance depending on current/phase, providing the needed anharmonicity. The transmon qubit is a specific design of a superconducting qubit that is essentially a capacitively shunted Josephson junction. It is derived from the earlier **Cooper pair box** qubit design.

In a Cooper pair box (CPB), we have a small superconducting island connected via a Josephson junction (with Josephson energy $E_J$) to a superconducting reservoir, and the island also has a capacitance $C$ to ground (and often a gate capacitance $C_g$ to a voltage source to tune the charge). The relevant degrees of freedom are the number $n$ of Cooper pairs on the island (which is related to charge $q = -2en$ on the island) and the superconducting phase difference $\varphi$ across the junction. The CPB Hamiltonian can be written as:

$$
H_{\text{CPB}} = 4E_C\,(n - n_g)^2 \;-\; E_J \cos\!\varphi \,,
$$

where $E_C = \frac{e^2}{2C}$ is the charging energy associated with one Cooper pair (with $C$ the total relevant capacitance of the island, including $C_g$ and the junction’s capacitance $C_J$), and $n_g = C_g V_g/(2e)$ is the dimensionless gate charge (the offset charge controlled by an applied gate voltage $V_g$ on the gate capacitor $C_g$). The first term $4E_C (n-n_g)^2$ is the electrostatic energy (parabolic in $n$) and the second term $-E_J \cos\varphi$ is the Josephson tunneling energy (which is $2\pi$-periodic in the phase $\varphi$ and provides the anharmonicity). The operators $n$ and $\varphi$ are canonically conjugate (with $[\varphi, n] = i$), analogous to position and momentum.

For the transmon qubit, one operates in the regime $E_J \gg E_C$ (the Josephson energy is large compared to charging energy). In this regime, the qubit is less sensitive to charge noise (hence improved coherence) at the cost of reduced anharmonicity. The transmon can be thought of as a weakly anharmonic oscillator: its energy levels resemble those of a harmonic oscillator but with a slight nonlinearity. Perturbatively, the transmon’s level spacings are given by $\hbar\omega_{10} \approx \sqrt{8E_J E_C} - E_C$ for the 0→1 transition, and the next transition is $\hbar\omega_{21} \approx \sqrt{8E_J E_C} - 3E_C$ (this difference $E_C$ is the anharmonicity). In summary, the transmon’s Hamiltonian can be expanded to lowest order as:

$$
H_{\text{transmon}} \approx \omega_{10}\,|1\rangle\langle 1| + (\omega_{10} - \delta)\,|2\rangle\langle 2| + \cdots,
$$

where $\delta$ (often denoted $\alpha$ or $E_C$ in literature) is the anharmonicity (the 1→2 transition is lower in frequency by $\delta$ compared to the 0→1). In more familiar oscillator language, one sometimes models the transmon as a **Duffing oscillator** (an oscillator with a quartic nonlinearity): the transmon can be approximated by a Hamiltonian of the form

$$
H \approx \omega_q\, c^\dagger c \;+\; \frac{\delta}{2}\,c^{\dagger 2} c^2 \,,
$$

where $c^\dagger, c$ are bosonic creation/annihilation operators for excitations of the transmon (with the understanding that this is only valid for low excitation numbers due to the finite-level nature of the transmon). The $\delta$ term ensures that each subsequent excitation costs extra energy, thus preventing equal spacing.

In the transmon qubit, because $E_J \gg E_C$, the anharmonicity $\delta$ is small compared to the fundamental frequency $\omega_{10}$. This **low anharmonicity** is a hallmark of the transmon: it improves coherence but requires us to account for the possibility of populating levels beyond $|1\rangle$ when we apply strong drives or have couplings (since transitions to $|2\rangle$ are only slightly off-resonant from those to $|1\rangle$). Throughout this thesis, we will **not** treat the transmon as a strict two-level system but rather include at least the first few levels in analyses.

**Flux tuning (briefly):** Many superconducting qubits, including certain transmon variants, allow tuning of $E_J$ (and thus the qubit frequency) by applying magnetic flux through a superconducting loop that contains the Josephson junction (or multiple junctions). This is discussed in Appendix A. By adjusting flux, one can change the qubit frequency $\omega_q$ in situ. Transmons that are fixed-frequency have a single junction, while flux-tunable transmons have a SQUID (two junctions in loop) effectively acting as a junction with flux-dependent $E_J(\Phi) = E_{J,\text{max}}|\cos(\pi\Phi/\Phi_0)|$, where $\Phi_0 = h/(2e)$ is the flux quantum. Flux tuning is used, for example, in the iSWAP gate described later (Section 4.2.2), to bring qubits in and out of resonance.

### 2.2.2 Resonator and Qubit Coupling in Circuit QED

A superconducting **transmission line resonator** acts as the cavity in circuit QED. It can be approximated as a lumped-element LC oscillator for a single mode. If $L$ and $C$ are the effective inductance and capacitance of a particular resonant mode, the resonant frequency is $\omega_r = 1/\sqrt{LC}$. The Hamiltonian of the resonator mode can be written analogous to Eq. (2.1):

$$
H_{\text{res}} = \omega_r \left( a^\dagger a + \frac{1}{2} \right) \,,
$$

where $a^\dagger$ creates an excitation (photon) in the resonator mode. The inductive energy and capacitive energy correspond to the conjugate variables of flux $\Phi$ through the inductor and charge $q$ on the capacitor. Specifically, one can write the resonator Hamiltonian as $q^2/(2C) + \Phi^2/(2L)$, and relate these to the $a, a^\dagger$ through $q \propto i\sqrt{\frac{C\hbar\omega_r}{2}}(a^\dagger - a)$ and $\Phi \propto \sqrt{\frac{L\hbar\omega_r}{2}}(a^\dagger + a)$. We will omit the constant $\omega_r/2$ term as usual. In practice, a transmission line resonator has many modes, but typically only the lowest-frequency mode (or one particular mode) is near resonance with the qubit, and higher modes are far off-resonant and can be ignored for the most part, justifying the single-mode approximation.

The coupling between a transmon qubit and the resonator is typically through a capacitive interaction. If the transmon is placed in or near a node of the resonator, an interaction of the form $g (c + c^\dagger)(a + a^\dagger)$ arises, where $c$ ($c^\dagger$) is the annihilation (creation) operator for the qubit (treated as an oscillator or using the Jaynes-Cummings two-level approximation) and $a$ ($a^\dagger$) for the resonator mode. For a two-level approximation of the qubit, this reduces to a Jaynes-Cummings type term $g(a,\sigma_+ + a^\dagger,\sigma_-)$ as we introduced. Thus, the physics of a transmon coupled to a resonator mode is formally the same as the cavity QED Jaynes-Cummings model described earlier.

The coupling strength $g$ depends on the capacitances and electric field participation of the transmon in the resonator mode. Typical strong coupling values in circuit QED are on the order of tens to hundreds of MHz, while $\omega_r$ and $\omega_q$ are in the GHz range. Thus, $g/\omega_{r,q} \sim 10^{-2}$ or $10^{-3}$, but still $g$ can be larger than the resonator and qubit linewidths, achieving the strong coupling regime where coherent oscillations are observable.

### 2.2.3 Measurement and Readout

As mentioned, in the dispersive regime of circuit QED, the qubit-state-dependent frequency shift of the resonator enables quantum non-demolition readout of the qubit. If the qubit-resonator detuning $\Delta = \omega_r - \omega_q$ is large, one can derive an effective Hamiltonian (we will do so in Chapter 3) of the form:

$$
H_{\text{disp}} = \Big(\omega_r + \chi\,\sigma_z\Big)a^\dagger a \;+\; \frac{\widetilde{\omega}_q}{2}\,\sigma_z \,,
$$

where $\chi = g^2/\Delta$ as before, and $\widetilde{\omega}_q$ is the slightly shifted qubit frequency. In this Hamiltonian, when the qubit is in $|0\rangle$ ($\sigma_z=+1$), the resonator frequency is $\omega_r + \chi$; when the qubit is in $|1\rangle$ ($\sigma_z=-1$), the resonator frequency is $\omega_r - \chi$. To perform a readout, we can send a probe microwave at a frequency near $\omega_r$. The phase or amplitude of the reflected/transmitted microwave will differ depending on the qubit state because the effective resonant frequency of the resonator is qubit-state-dependent. By measuring this difference (often via homodyne detection of the cavity's output field), we infer the qubit state with high fidelity.

As a simple illustration, consider the following scenario: The resonator is prepared in a coherent state $|\alpha\rangle$ (a state with a well-defined amplitude $\alpha$, e.g., by driving the cavity with a weak microwave so it has a small coherent excitation). The joint state of the qubit-resonator system might be $|\Psi(0)\rangle = |0\rangle \otimes |\alpha\rangle$ or $|1\rangle \otimes |\alpha\rangle$ depending on the qubit state (here $|0\rangle, |1\rangle$ refer to qubit states and $|\alpha\rangle$ is the resonator state). We then let this state evolve for some time $t$ under the dispersive Hamiltonian. Because $a^\dagger a$ (photon number) commutes with itself, the evolution simply imprints a qubit-state-dependent phase on the coherent state. Specifically, if initially $|\Psi(0)\rangle = |0\rangle|\alpha\rangle$, then under $H_{\text{disp}}$:

$$
|\Psi(t)\rangle = e^{-i(\omega_r+\chi) a^\dagger a\,t}\,|\alpha\rangle \otimes |0\rangle = |0\rangle \otimes |\alpha e^{-i\chi t}\rangle \,,
$$

whereas if $|\Psi(0)\rangle = |1\rangle|\alpha\rangle$:

$$
|\Psi(t)\rangle = |1\rangle \otimes |\alpha e^{+i\chi t}\rangle \,.
$$

Thus the coherent state of the resonator picks up a phase rotation of $-\chi t$ or $+\chi t$ in phase space, depending on the qubit state. After some interaction time, the two possible cavity states ($|\alpha e^{-i\chi t}\rangle$ vs $|\alpha e^{+i\chi t}\rangle$) diverge in phase. By homodyne measuring the phase of the resonator’s field, one can distinguish these two cases and thereby determine the qubit state. In practice, one chooses a measurement time such that the phase difference between the two qubit-conditioned cavity states is $\pi$ (maximizing distinguishability).

In summary, circuit QED provides a powerful architecture: the resonator serves both as a mediating bus for two-qubit interactions and as a readout device for the qubit. The transmon qubit embedded in circuit QED retains good coherence (due to the transmon’s insensitivity to charge noise) while allowing strong coupling to microwave photons for control and measurement. In the next chapter, we will formalize the derivation of effective Hamiltonians like the dispersive Hamiltonian above using systematic perturbative methods (including the Schrieffer–Wolff transformation). This will deepen our understanding of phenomena like the Stark shift and also set the stage for analyzing multi-qubit gate mechanisms in Chapter 4.




# Chapter 3: Deriving Effective Hamiltonians

In a system where a **small perturbative coupling** connects two (or more) parts of a Hamiltonian, it is often useful to derive an **effective Hamiltonian** that describes the low-energy dynamics without explicitly including high-energy virtual excitations. In this chapter, we present and compare several methods for deriving effective Hamiltonians perturbatively: the **Schrieffer–Wolff (SW) transformation** (a unitary perturbation theory approach), **adiabatic elimination**, **time-averaging (Magnus or averaging techniques)**, and ordinary perturbation theory. We will illustrate these methods using the Jaynes–Cummings model from Chapter 2 as a running example, deriving the dispersive Hamiltonian and related shifts systematically. We pay special attention to the Schrieffer–Wolff method, as it will be crucial for analyzing two-qubit gate schemes in the next chapter.

## 3.1 Schrieffer–Wolff Transformation

The **Schrieffer–Wolff transformation** is a method from perturbation theory (sometimes called **quasi-degenerate perturbation theory**) that block-diagonalizes a Hamiltonian up to a given order in a small parameter. By performing a carefully chosen unitary transformation, one can “remove” the effects of coupling between certain subspaces (such as between a low-energy subspace and a high-energy subspace) to a desired order in perturbation, yielding an effective Hamiltonian that acts within one subspace (e.g., the low-energy subspace) only.

The general setup is as follows. We have a Hamiltonian that can be decomposed as

$$
H = H_0 + H' = H_0 + \varepsilon V\,,
$$

where $H_0$ is a piece we know how to diagonalize (the “unperturbed” Hamiltonian) and $H' = \varepsilon V$ is a perturbation. We treat $\varepsilon$ as a formal small parameter to keep track of orders (at the end, one can set $\varepsilon = 1$ after expanding perturbatively). We assume the spectrum of $H_0$ is such that its eigenstates can be separated into two groups (subspaces) $A$ and $B$ with an energy gap $\Delta$ between any state in $A$ and any state in $B$. In other words, $H_0$ has eigenvalues $E_i^{(0)}$ and eigenstates $|i\rangle$ such that we can split the index $i$ into two sets (A and B) and if $|p\rangle \in A$ and $|q\rangle \in B$, then $|E_p^{(0)} - E_q^{(0)}| \ge \Delta > 0$. The perturbation $H'$ causes weak mixing between these subspaces but (for sufficiently small $\varepsilon$) not enough to close the gap or cause large changes in energies. This situation is common in dispersive regimes (like the qubit+resonator with large detuning: low-energy subspace = qubit in ground state, high-energy = qubit excited).

The Schrieffer–Wolff method seeks a unitary transformation $U = e^{-S}$ (with $S$ an anti-Hermitian operator, $S^\dagger = -S$) that approximately block-diagonalizes $H$ to a given order in $\varepsilon$. Specifically, we want $U$ such that the **effective Hamiltonian**

$$
H_{\text{eff}} = e^{-S} H\, e^{S}
$$

has no coupling between subspace $A$ and $B$ up to some order in $\varepsilon$. Equivalently, $H_{\text{eff}}$ can be written as a block-diagonal matrix (blocks corresponding to $A$ and $B$) to that perturbative order. We also demand (for convenience) that the transformation does not modify $H_0$ to zeroth order, i.e., we expect $H_{\text{eff}}$ to reduce to $H_0$ when $\varepsilon \to 0$.

To find $S$, we use the Baker–Campbell–Hausdorff (BCH) expansion for $H_{\text{eff}}$:

$$
H_{\text{eff}} = e^{-S}He^{S} = H + [H, S] + \frac{1}{2!}[[H, S], S] + \frac{1}{3!}[[[H,S],S],S] + \cdots\,.
$$

This infinite series expansion expresses $H_{\text{eff}}$ in terms of nested commutators of $H$ with $S$. We choose $S$ order by order in $\varepsilon$ such that all *block off-diagonal* parts of $H_{\text{eff}}$ vanish to that order. Let us denote by $H_{\text{eff}}^{\text{off}}$ the off-block-diagonal part of $H_{\text{eff}}$ (i.e., terms that connect states from $A$ to $B$). Our goal is to impose $H_{\text{eff}}^{\text{off}} = 0$ up to the desired order.

To facilitate solving this perturbatively, we also split the perturbation $H'$ into its block-diagonal part $H_1$ and block-off-diagonal part $H_2$ with respect to $(A,B)$ subspaces. That is,

* $H_1$ acts within $A$ or within $B$ (it has zero matrix elements between $A$ and $B$),
* $H_2$ has only off-diagonal matrix elements (only connects states from $A$ to $B$ and vice versa).

In many cases, like the Jaynes–Cummings model, if we choose $A$ = “atom in ground state” and $B$ = “atom in excited state”, the perturbation $H'$ might itself be purely off-diagonal ($H_1=0, H_2=H'$), because it takes one between those sectors by flipping the qubit. But in general one can always decompose $V = H_1 + H_2$ accordingly.

Given this decomposition, the condition that $H_{\text{eff}}$ has no block-off-diagonal terms to first order means we want to cancel the $H_2$ by the transformation. We expand $S$ as well in a power series:

$$
S = \sum_{j=1}^\infty S^{(j)}\,,
$$

where $S^{(j)}$ is of order $\varepsilon^j$. The transformation is chosen such that at each order, the off-diagonal part is eliminated. The first few conditions one derives (by collecting terms order by order in $\varepsilon$ and setting off-diagonal parts to zero) are:

* **Order $\varepsilon^1$:** $[H_0, S^{(1)}] = -H_2$.

* **Order $\varepsilon^2$:** $[H_0, S^{(2)}] = -[H_1, S^{(1)}]$.

* **Order $\varepsilon^3$:** $[H_0, S^{(3)}] = -[H_1, S^{(2)}] - \frac{1}{3} [[H_2, S^{(1)}], S^{(1)}]$.

and so on. These equations come from requiring that the sum of off-diagonal terms at each order cancels out. We see that to first order, $S^{(1)}$ must satisfy $[H_0, S^{(1)}] = -H_2$. Since $H_0$ is diagonal in the chosen basis, this equation can be solved by choosing the matrix elements of $S^{(1)}$ between any $|p\rangle \in A$ and $|q\rangle \in B$ as:

$$
\langle p|S^{(1)}|q\rangle = -\,\frac{\langle p|H_2|q\rangle}{E_p^{(0)} - E_q^{(0)}} \,,
$$

and $\langle q|S^{(1)}|p\rangle$ is the negative conjugate to keep $S$ anti-Hermitian. In other words, in the energy basis of $H_0$, the off-diagonal elements of $S^{(1)}$ are given by the off-diagonal elements of $H_2$ divided by the energy differences. This ensures $H_2 + [H_0, S^{(1)}] = 0$ at order $\varepsilon$, cancelling the first-order mixing. We see that this formula is well-defined because $E_p^{(0)} - E_q^{(0)} \neq 0$ (no degeneracy) by assumption of a gap.

At second order, after $S^{(1)}$ has eliminated the original $H_2$, there may be new effective off-diagonal terms generated by $H_1$ and two commutators of $S^{(1)}$. One finds $S^{(2)}$ such that $[H_0, S^{(2)}] = -[H_1, S^{(1)}]$, etc. We won't derive all these by hand here, but note that one can systematically solve these equations: each $S^{(j)}$ is chosen to cancel the remaining off-diagonal terms at order $j$. It can be shown that this procedure yields a unique $S$ (given some gauge fixing like requiring $S$ is strictly off-diagonal itself, which we typically do).

Having determined $S$ up to the desired order, we can then find the effective Hamiltonian up to that order. For example, up to second order in $\varepsilon$, the effective Hamiltonian (within block $A$ or block $B$) comes out to:

$$
H_{\text{eff}} = H_0 + H_1 + \frac{1}{2}[H_2, S^{(1)}] + O(\varepsilon^3)\,.
$$

Notice the $H_2$ term does not appear explicitly (it has been transformed away to this order), but its influence remains in the last term $[H_2, S^{(1)}]/2$, which is a second-order correction. Substituting $[H_0, S^{(1)}] = -H_2$ into $\frac{1}{2}[H_2, S^{(1)}] = -\frac{1}{2}[[H_0, S^{(1)}], S^{(1)}]$, one could also write this as $-\frac{1}{2}[[H_0,S^{(1)}], S^{(1)}]$. Regardless, this formula is the effective Hamiltonian correct to second order.

In summary, the Schrieffer–Wolff method provides a formal way to derive $H_{\text{eff}}$ order by order. It essentially encodes the intuitive result from second-order perturbation theory that virtual transitions via intermediate states lead to energy shifts and effective interactions in the subspace of interest.

### 3.1.1 Schrieffer–Wolff for Full Diagonalization

The original formulation above aimed for block-diagonalization (decoupling subspace $A$ and $B$ but not necessarily diagonalizing each block completely). There is a variant where one can attempt to completely diagonalize the Hamiltonian (not just block-diagonalize) perturbatively, often by treating each energy eigenstate rather than whole blocks. This is discussed, for example, in Gambetta *et al.* [14 Appendix] and by Winkler [8] in a modified way.

In the Gambetta approach (as described in [14], which we reference in the thesis), one introduces $U = e^{iS}$ (note the sign convention difference: they use $e^{iS}$ instead of $e^{-S}$; effectively $S \to -iS$ in their notation) and expands $H_{\text{eff}} = e^{iS} H e^{-iS} = \sum_{m=0}^\infty \varepsilon^m H^{(m)}$. They set up recurrence relations similar to above. They also allow choosing $S$ such that not only off-block elements vanish, but *all off-diagonal matrix elements in the energy eigenbasis of $H_0$ vanish*, which means $H_{\text{eff}}$ becomes fully diagonal in that basis (i.e., they directly aim for perturbative eigenvalues). The condition for full diagonalization is $\langle p|H_{\text{eff}}|q\rangle = 0$ for all $p \neq q$. From the perturbative expansion at order $m>0$, one finds:

$$
\langle p|S^{(m)}|q\rangle = -\,\frac{\langle p|H_x^{(m)}|q\rangle}{E^{(0)}_p - E^{(0)}_q} \qquad (p\neq q)\,,
$$

where $H_x^{(m)}$ includes the cumulative nested commutators up to order $m$ except those already diagonal. This generalizes the earlier formula for $S^{(1)}$. By choosing $S$ this way, one can iteratively eliminate off-diagonal elements and solve for corrections to eigenvalues.

While this full diagonalization approach is conceptually a bit more involved, it yields formulas for energy shifts of individual levels. In practice, though, doing full diagonalization perturbatively to high order can be very cumbersome (the expressions blow up combinatorially). For our purposes, block-diagonalization (yielding $H_{\text{eff}}$ block diagonal) is sufficient to get the effective Hamiltonian acting on the subspace of interest (e.g. the qubit subspace) and that is what we will use for analyzing gates.

We will primarily use the block-diagonal SW (the first method described) to derive effective Hamiltonians. However, we mention the full diagonalization variant here because in Appendices C and D the authors use a Mathematica script based on Gambetta’s approach to get effective Hamiltonians for specific gate drives. The difference is mainly whether one stops at a block-diagonal effective Hamiltonian or continues to diagonalize each block.

### 3.1.2 Example: Jaynes–Cummings Hamiltonian via Schrieffer–Wolff

To see the Schrieffer–Wolff method in action, we apply it to the Jaynes–Cummings model introduced earlier. We will derive the dispersive (effective) Hamiltonian to second order in $g/\Delta$ using SW and see that it matches our earlier intuitive result.

The Jaynes–Cummings Hamiltonian (in the RWA) can be written as:

$$
H = H_0 + H' \,,
$$

where we choose:

* **Unperturbed part $H_0$:**

  $$
  H_0 = \omega_r\,a^\dagger a \;-\; \frac{\omega_q}{2}\,\sigma_z \,.
  $$

  Here $\omega_r$ is the resonator frequency (we now use subscript $r$ for the resonator, instead of $c$, and $\omega_q$ for the qubit, instead of $\omega_a$, to align with typical circuit QED notation). $H_0$ is diagonal in the basis of eigenstates $|n\rangle_{\text{photon}}\otimes|s\rangle_{\text{qubit}}$ with $s \in{0,1}$, since $a^\dagger a |n\rangle = n|n\rangle$ and $\sigma_z |s\rangle = (+1)|0\rangle$ or $(-1)|1\rangle$ as before. The eigenenergies of $H_0$ are $E_{n,s}^{(0)} = \omega_r n - \frac{\omega_q}{2}(1 - 2\delta_{s,1})$; for instance $E_{n,0}^{(0)} = \omega_r n - \omega_q/2$ and $E_{n,1}^{(0)} = \omega_r n + \omega_q/2$.

* **Perturbation $H'$:**

  $$
  H' = g\Big(a\,\sigma_+ + a^\dagger\,\sigma_- \Big) \,.
  $$

  This term $H'$ causes transitions between the subspace where the qubit is in $|0\rangle$ and the subspace where it is in $|1\rangle$ by exchanging a photon. We identify $H_2 = H'$ entirely in this case, since $H'$ only connects states of different qubit excitation (it is purely off-diagonal in the qubit basis). There is no $H_1$ because $H'$ has no diagonal terms in the eigenbasis of $H_0$—indeed, $\langle n,s|H'|n',s'\rangle$ is nonzero only if $s \neq s'$.

Now, we define the two subspaces (blocks) $A$ and $B$ between which we want to eliminate coupling:

* Let $A$ be the subspace where the qubit is in $|0\rangle$ (ground state). This consists of states ${|n,0\rangle: n=0,1,2,\dots}$.
* Let $B$ be the subspace where the qubit is in $|1\rangle$ (excited). This has states ${|n,1\rangle: n=0,1,2,\dots}$.

These two sets are separated by an energy gap approximately $\omega_q$ (more precisely, any state in $B$ has energy about $\omega_q$ higher than a state in $A$ with the same photon number, ignoring the small difference in photon number which can cause additional splitting by $\omega_r$). We assume $\omega_q$ is not zero, so a gap exists; in fact $\Delta = \omega_r - \omega_q$ is the detuning, and if $\Delta$ is not too small, we can treat $g$ as small compared to the splitting.

**First-order SW (no effect):** At order $\varepsilon$ (with $\varepsilon$ corresponding to $g$ here), we solve $[H_0, S^{(1)}] = -H'$. In matrix element form:

$$
\langle n,0|S^{(1)}|m,1\rangle = - \frac{\langle n,0|H'|m,1\rangle}{E_{n,0}^{(0)} - E_{m,1}^{(0)}} 
$$

and $\langle m,1|S^{(1)}|n,0\rangle = -(\langle n,0|S^{(1)}|m,1\rangle)^*$ (so that $S^{(1)}$ is anti-Hermitian). Now, $\langle n,0|H'|m,1\rangle = g,\langle n,0|a,\sigma_+|m,1\rangle + g,\langle n,0|a^\dagger,\sigma_-|m,1\rangle$. The second piece $\propto \langle n,0|a^\dagger\sigma_-|m,1\rangle$ is nonzero when $n = m+1$ (because $a^\dagger$ increases photon number by 1 and $\sigma_-$ takes $|1\rangle$ to $|0\rangle$). The first piece $\propto \langle n,0|a,\sigma_+|m,1\rangle$ is nonzero when $n+1 = m$ (because $a$ lowers photon number by 1 and $\sigma_+$ takes $|0\rangle$ to $|1\rangle$). In both cases, $H'$ connects $|m,1\rangle \leftrightarrow |m+1, 0\rangle$ (this is the same pair we identified earlier as the JC coupling). So the only nonzero matrix elements are $\langle m+1,0|H'|m,1\rangle = g\sqrt{m+1}$ and $\langle m,1|H'|m+1,0\rangle = g\sqrt{m+1}$, using the harmonic oscillator matrix elements $\langle m+1|a^\dagger|m\rangle = \sqrt{m+1}$ etc.

Thus for each $m\ge0$, we have:

$$
\langle m+1,0|S^{(1)}|m,1\rangle = -\,\frac{g\sqrt{m+1}}{E_{m+1,0}^{(0)} - E_{m,1}^{(0)}} \,.
$$

Now, $E_{m+1,0}^{(0)} - E_{m,1}^{(0)} = [\omega_r (m+1) - \frac{\omega_q}{2}] - [\omega_r m + \frac{\omega_q}{2}] = \omega_r - \omega_q = -\Delta$ (note $\Delta = \omega_r - \omega_q$). So this difference is just $-\Delta$, independent of $m$. Similarly $\langle m,1|S^{(1)}|m+1,0\rangle$ is the negative complex conjugate (which yields $+\frac{g\sqrt{m+1}}{\Delta}$ if we take the same denominator with opposite sign). We can simplify by writing $S^{(1)}$ in operator form. Recognizing $\sum_{m\ge0} \sqrt{m+1},|m,1\rangle\langle m+1,0| = a,\sigma_+$ (it lowers a photon in state $|m+1,0\rangle$ to $|m,1\rangle$), and its Hermitian adjoint $\sum \sqrt{m+1},|m+1,0\rangle\langle m,1| = a^\dagger \sigma_-$. Thus, we deduce:

$$
S^{(1)} = -\,\frac{g}{\Delta}\Big( a^\dagger\,\sigma_- - a\,\sigma_+ \Big)\,. \tag{3.10}
$$

This is the generator $S$ to first order: an anti-Hermitian combination of $a^\dagger \sigma_-$ and $a \sigma_+$. Intuitively, this $S^{(1)}$ is proportional to the part of $H'$ that we want to cancel, with a phase ($-$ sign) and scaled by the energy denominator $\Delta$.

Now we compute the effective Hamiltonian to second order. Using Eq. (3.6) specialized to our case ($H_1=0, H_2=H'$), we have:

$$
H_{\text{eff}} = H_0 + \frac{1}{2}[H', S^{(1)}] + O(g^3)\,. 
$$

We plug in $S^{(1)}$ and $H'$. Remember $H' = g(a\sigma_+ + a^\dagger \sigma_-)$ and $S^{(1)} = -(g/\Delta)(a^\dagger \sigma_- - a\sigma_+)$. So:

$$
[H', S^{(1)}] = \left[g(a\sigma_+ + a^\dagger\sigma_-) \,,\, -\frac{g}{\Delta}(a^\dagger\sigma_- - a\sigma_+) \right] \,.
$$

We can factor $g \cdot -(g/\Delta) = -g^2/\Delta$ outside. So

$$
[H', S^{(1)}] = -\,\frac{g^2}{\Delta} \Big[\, a\sigma_+ + a^\dagger\sigma_- \,,\, a^\dagger\sigma_- - a\sigma_+ \Big]\,.
$$

Now we calculate the commutator inside. We treat $a$ and $\sigma$ as independent sets (bosonic operators commute with the $\sigma$ operators acting on the qubit Hilbert space). So:

$$
[a\sigma_+,\, a^\dagger\sigma_-] = a a^\dagger\,\sigma_+ \sigma_- - a^\dagger a\, \sigma_- \sigma_+ \,.
$$

We know $a a^\dagger = a^\dagger a + 1$ and $\sigma_+ \sigma_- = |1\rangle\langle1| = \frac{1}{2}(I - \sigma_z)$, $\sigma_- \sigma_+ = |0\rangle\langle0| = \frac{1}{2}(I + \sigma_z)$. Substituting:

$$
a a^\dagger\,\sigma_+\sigma_- = (a^\dagger a + 1)\,\frac{1}{2}(I - \sigma_z)\,,
$$

$$
a^\dagger a\,\sigma_-\sigma_+ = a^\dagger a\,\frac{1}{2}(I + \sigma_z)\,.
$$

So

$$
[a\sigma_+,\, a^\dagger\sigma_-] = \frac{1}{2}\Big[(a^\dagger a + 1)(I - \sigma_z) - a^\dagger a (I + \sigma_z)\Big] \,.
$$

Simplify this:

* The term with $a^\dagger a$ gives $(a^\dagger a + 1)(I - \sigma_z) - a^\dagger a (I + \sigma_z) = a^\dagger a I - a^\dagger a \sigma_z + I - \sigma_z - a^\dagger a I - a^\dagger a \sigma_z = I - \sigma_z - 2a^\dagger a,\sigma_z$ because $a^\dagger a I - a^\dagger a I$ cancels and $-a^\dagger a \sigma_z - a^\dagger a \sigma_z = -2a^\dagger a \sigma_z$. So

$$
[a\sigma_+,\, a^\dagger\sigma_-] = \frac{1}{2}\Big( I - \sigma_z - 2a^\dagger a\,\sigma_z \Big)\,.
$$

The other cross-commutators like $[a\sigma_+,, -a\sigma_+]$ and $[a^\dagger\sigma_-,, a^\dagger\sigma_-]$ are zero (they either commute trivially or annihilate). Actually, to be thorough, let’s break it into two commutators:

$[a\sigma_+, a^\dagger\sigma_-]$ we did above.

Next $[a\sigma_+, -a\sigma_+] = -a\sigma_+ a\sigma_+ + a\sigma_+ a\sigma_+ = 0$ (since $a$ commutes with itself and $\sigma_+\sigma_+ = 0$ as $\sigma_+^2=0$, so trivial).

Similarly $[a^\dagger\sigma_-, a^\dagger\sigma_-] = 0$.

And cross terms like $[a\sigma_+, -a\sigma_+]$ we did, $[a\sigma_+, -a\sigma_+] = 0$. And $[a^\dagger\sigma_-, -a\sigma_+]$ yields something similar:

Actually, maybe more systematically: $[A+B, C+D] = [A,C] + [A,D] + [B,C] + [B,D]$. Here $A = a\sigma_+$, $B = a^\dagger \sigma_-$, $C = a^\dagger\sigma_-$, $D = -a\sigma_+$.

* $[A,C] = [a\sigma_+, a^\dagger\sigma_-]$ we did.
* $[A,D] = [a\sigma_+, -a\sigma_+] = -[a\sigma_+, a\sigma_+] = 0$.
* $[B,C] = [a^\dagger\sigma_-, a^\dagger\sigma_-] = 0$.
* $[B,D] = [a^\dagger\sigma_-, -a\sigma_+] = -[a^\dagger\sigma_-, a\sigma_+]$. This is the Hermitian conjugate of $[a\sigma_+, a^\dagger\sigma_-]$ but with opposite sign, so it should give the same result actually, because $[X,Y]^\dagger = -[X^\dagger, Y^\dagger]$. Alternatively, compute $[a^\dagger\sigma_-, a\sigma_+] = a^\dagger a [\sigma_-,\sigma_+] + a^\dagger a\sigma_+a + ...$ Actually simpler: $[a^\dagger\sigma_-, a\sigma_+] = -[a\sigma_+, a^\dagger\sigma_-]$ (since swapping them introduces a minus sign). So we already have that from above, meaning $[B,D] = -$commutator we found. So adding $[A,C] + [B,D]$ should double the result (but with one negative sign), let's just do it systematically:

Actually, easier: by linearity, we found

$[H',S^{(1)}] = -\frac{g^2}{\Delta} 2 [a\sigma_+, a^\dagger \sigma_-]$ because the other commutators cancel except $B,D$ which gives the same structure. Precisely, our above result was $[a\sigma_+, a^\dagger\sigma_-] = \frac{1}{2}(I - \sigma_z - 2a^\dagger a \sigma_z)$. And presumably $[a^\dagger\sigma_-, a\sigma_+] = -[a\sigma_+, a^\dagger\sigma_-]$ (commutator antisymmetry). So $[A,C] + [B,D] = [a\sigma_+, a^\dagger\sigma_-] - [a\sigma_+, a^\dagger \sigma_-] = 0$ if that was the case? Wait, that suggests maybe we should not double count:

Let's carefully handle: We want $[H',S^{(1)}] = [a\sigma_+ + a^\dagger \sigma_-, a^\dagger\sigma_- - a\sigma_+]$.

Expand:
$= [a\sigma_+, a^\dagger\sigma_-] - [a\sigma_+, a\sigma_+] + [a^\dagger\sigma_-, a^\dagger\sigma_-] - [a^\dagger\sigma_-, a\sigma_+]$.

Now $[a\sigma_+, a\sigma_+] = 0$ and $[a^\dagger\sigma_-, a^\dagger\sigma_-] = 0$. So:

$[H', S^{(1)}] = [a\sigma_+, a^\dagger\sigma_-] - [a^\dagger\sigma_-, a\sigma_+]$.

But note $[a^\dagger\sigma_-, a\sigma_+] = -[a\sigma_+, a^\dagger\sigma_-]$ (since swapping arguments of commutator introduces a minus). So:

$[H', S^{(1)}] = [a\sigma_+, a^\dagger\sigma_-] + [a\sigma_+, a^\dagger\sigma_-] = 2[a\sigma_+, a^\dagger\sigma_-]$.

Yes, that's the factor 2 we suspected. So:

$$
[H', S^{(1)}] = \; -\frac{g^2}{\Delta} \,2\,[a\sigma_+, a^\dagger\sigma_-] 
            = -\frac{2g^2}{\Delta} \cdot \frac{1}{2}(I - \sigma_z - 2a^\dagger a\,\sigma_z)\,,
$$

since we found $[a\sigma_+, a^\dagger\sigma_-] = \frac{1}{2}(I - \sigma_z - 2a^\dagger a,\sigma_z)$.

The $2$ cancels the $\frac{1}{2}$, leaving:

$$
[H', S^{(1)}] = -\frac{g^2}{\Delta}\Big( I - \sigma_z - 2a^\dagger a\,\sigma_z \Big)\,.
$$

Now the effective Hamiltonian to second order is:

$$
H_{\text{eff}} = H_0 + \frac{1}{2}[H', S^{(1)}] = \omega_r a^\dagger a - \frac{\omega_q}{2}\sigma_z \;-\; \frac{g^2}{2\Delta}( I - \sigma_z - 2a^\dagger a\,\sigma_z)\,. 
$$

Distribute the $-\frac{g^2}{2\Delta}$:

$$
H_{\text{eff}} = \omega_r a^\dagger a - \frac{\omega_q}{2}\sigma_z - \frac{g^2}{2\Delta} I + \frac{g^2}{2\Delta}\sigma_z + \frac{g^2}{\Delta} a^\dagger a\,\sigma_z\,.
$$

We can drop the $-\frac{g^2}{2\Delta}I$ term, since it's just an overall constant energy shift (it commutes with everything and does not affect dynamics). Then:

$$
H_{\text{eff}} = \omega_r a^\dagger a + \frac{g^2}{\Delta}\,a^\dagger a\,\sigma_z \;+\; \Big(\frac{g^2}{2\Delta} - \frac{\omega_q}{2}\Big)\sigma_z \,. \tag{3.11}
$$

It is clearer to rewrite that constant in front of $\sigma_z$: $ \frac{g^2}{2\Delta} - \frac{\omega_q}{2} = -\frac{1}{2}\Big(\omega_q - \frac{g^2}{\Delta}\Big)$. So

$$
H_{\text{eff}} = \omega_r a^\dagger a + \frac{g^2}{\Delta} a^\dagger a\,\sigma_z \;-\; \frac{\omega_q - \frac{g^2}{\Delta}}{2}\,\sigma_z \,.
$$

This is exactly of the form we expected: compare to the form we anticipated in Chapter 2 (dispersive regime):

* The resonator term $\omega_r a^\dagger a$ is modified by an additional $\frac{g^2}{\Delta}\sigma_z$, meaning the resonator frequency effectively becomes $\omega_r \pm \frac{g^2}{\Delta}$ depending on the qubit state ($\sigma_z = \pm 1$). In other words, the cavity frequency is pulled by $\chi = \frac{g^2}{\Delta}$ depending on the qubit being excited or not. This is the **AC Stark shift / dispersive shift** on the cavity mode.

* The qubit term $-\frac{\omega_q}{2}\sigma_z$ is modified to $-\frac{1}{2}(\omega_q - \frac{g^2}{\Delta})\sigma_z$. That is, the qubit frequency is renormalized from $\omega_q$ to $\omega_q - \frac{g^2}{\Delta}$. This is often called the **Lamb shift** (the qubit’s own energy is shifted due to interaction with the virtual photons in the cavity). It is simply a small correction to the qubit frequency (notice this is independent of photon number, it’s just $\sigma_z$ with a new coefficient).

Thus, we can identify $\chi = \frac{g^2}{\Delta}$ as the dispersive coupling strength (or Stark shift per photon). We can rewrite the effective Hamiltonian in a perhaps cleaner way as:

$$
H_{\text{eff}} = \Big(\omega_r + \chi\,\sigma_z\Big) a^\dagger a \;-\; \frac{\omega_q - \chi}{2}\,\sigma_z \,,
$$

where $\chi = \frac{g^2}{\Delta}$. Expanding this out confirms it matches Eq. (3.11) (up to dropping constant terms). This **effective Hamiltonian** is valid to second order in $g/\Delta$ and is exactly the form of a dispersive Hamiltonian we expected: no term remains that directly flips $\sigma$ or creates/annihilates $a$ (those were the off-diagonals eliminated); instead, the coupling manifests as a $\sigma_z a^\dagger a$ term and a small shift in the $\sigma_z$ term (qubit frequency).

This result derived using the Schrieffer–Wolff formalism is consistent with what we found in Chapter 2 by intuitive reasoning: the cavity frequency is shifted by $\pm g^2/\Delta$ depending on the qubit state (enabling dispersive readout), and the qubit frequency is slightly shifted by $-g^2/\Delta$ (Lamb shift). The dispersive approximation is valid as long as $|g/\Delta| \ll 1$, in which case higher-order terms (neglected $O(g^3)$ and beyond) are very small. If $\Delta$ is not large, the perturbation series might not converge well, and one would have to include more terms or diagonalize numerically the small blocks.

Finally, note that if we had multiple excitations, this SW method still effectively decouples the subspaces with fixed excitation number, yielding similar results for each manifold. The result we have is already general: $a^\dagger a$ is the photon number operator, so it covers manifolds with arbitrary $n$. Also, it is interesting that if one had two qubits in the cavity, a similar SW approach (treating qubit excitations as separate blocks) would yield an effective qubit–qubit interaction (a $σ_{z1}σ_{z2}$ term) mediated by the virtual photons, as we shall see later in Section 4.2.1 by another method.

### 3.1.3 Including Counter-Rotating Terms: Bloch–Siegert Shift

The Jaynes–Cummings model used the rotating wave approximation (RWA) to drop the counter-rotating terms. What if we include those terms (i.e., consider the full interaction $g(a + a^\dagger)(\sigma_+ + \sigma_-)$ without the RWA)? In that case, the Hamiltonian (often called the **Rabi model** when not making RWA) is:

$$
H_{\text{full}} = \omega_c\,a^\dagger a - \frac{\omega_a}{2}\sigma_z + g\,(a + a^\dagger)(\sigma_+ + \sigma_-)\,. 
$$

This can be written as $H_{\text{full}} = H_0 + H'_{\text{full}}$ where $H_0$ is same as before and now

$$
H'_{\text{full}} = g(a\sigma_+ + a^\dagger\sigma_- + a\sigma_- + a^\dagger\sigma_+)\,.
$$

The extra terms compared to $H'$ are $a\sigma_-$ and $a^\dagger\sigma_+$, which oscillate at frequency $\omega_\Sigma = \omega_c+\omega_a$ in the interaction picture. We can attempt the SW transformation on this full Hamiltonian as well. The first-order $S^{(1)}$ would now be chosen to eliminate *both* the resonant and the non-resonant terms to first order. That is, we would have a condition $[H_0, S^{(1)}] = -H'_{\text{full}}$. In matrix elements, it means:

$$
\langle p|S^{(1)}|q\rangle = -\frac{\langle p|H'_{\text{full}}|q\rangle}{E_p^{(0)} - E_q^{(0)}}\,.
$$

Now $H'*{\text{full}}$ connects not only $|m+1,0\rangle \leftrightarrow |m,1\rangle$ (resonant terms) but also $|m,0\rangle \leftrightarrow |m+1,1\rangle$ (the counter-rotating terms, which create a photon *and* flip the qubit or annihilate a photon *and* flip the qubit down). For example, $a\sigma*-$ acting on $|m,1\rangle$ gives $\sqrt{m}|m-1,0\rangle$ and $a^\dagger\sigma_+$ on $|m,0\rangle$ gives $\sqrt{m+1}|m+1,1\rangle$. These processes virtually couple states differing by two excitations. The energy denominator for those will be large ($\approx \omega_c+\omega_a$). By doing the algebra (which is a bit lengthy but straightforward), one finds that the effective Hamiltonian to second order acquires an extra term from these counter-rotating processes.

The final effective Hamiltonian (to second order) including counter-rotating effects can be written as:

$$
H_{\text{eff(full)}} = \omega_c a^\dagger a - \frac{\omega_a}{2}\sigma_z + \frac{g^2}{\Delta}\,a^\dagger a\,\sigma_z - \frac{g^2}{2\Delta}\sigma_z - \frac{g^2}{2\omega_\Sigma}(2a^\dagger a + 1)\,\sigma_z \,. \tag{3.12}
$$

This expression has an additional term $-\frac{g^2}{2\omega_\Sigma}(2a^\dagger a + 1)\sigma_z$ compared to the earlier RWA result. The $(2a^\dagger a + 1)\sigma_z$ structure comes from the counter-rotating terms: it includes both a term proportional to $a^\dagger a \sigma_z$ and a term proportional to $\sigma_z$ (the \`$+1$' yields $\sigma_z$). Specifically, it shifts the qubit frequency by $-\frac{g^2}{2\omega_\Sigma}$ (the $\sigma_z$ term) and also gives a small additional dispersive shift of the cavity frequency by $-\frac{g^2}{\omega_\Sigma}$ per photon (since $2a^\dagger a\sigma_z$ would be $2$ times that per photon).

If one simplifies $H_{\text{eff(full)}}$, grouping terms:

* The coefficient in front of $a^\dagger a \sigma_z$ becomes $\frac{g^2}{\Delta} - \frac{2g^2}{2\omega_\Sigma} = \frac{g^2}{\Delta} - \frac{g^2}{\omega_\Sigma}$. Using $\omega_\Sigma = \omega_c + \omega_a$, if $\Delta$ is small, $\omega_\Sigma \approx 2\omega_c$ or so, this is a correction of order $\sim g^2/\omega_\Sigma$.

* The coefficient in front of $\sigma_z$ becomes $-\frac{g^2}{2\Delta} - \frac{g^2}{2\omega_\Sigma}$. If $\omega_\Sigma \gg \Delta$, this additional shift is very small.

Thus, the main qualitative effect of including counter-rotating terms is an extra small negative shift to the qubit frequency known as the **Bloch–Siegert shift** (the term $-\frac{g^2}{2\omega_\Sigma}\sigma_z$) and a tiny correction to the dispersive photon shift (the $-\frac{g^2}{\omega_\Sigma}a^\dagger a \sigma_z$ part). The Bloch–Siegert shift is often negligible in the dispersive regime if $\omega_\Sigma$ is large. However, if $\Delta$ is not extremely large, one might want to include it for accuracy. It is a second-order effect of the counter-rotating coupling.

In summary, the Schrieffer–Wolff method can be extended to include the non-resonant terms, and it yields corrections such as the Bloch–Siegert shift. In most of our discussions, we assume RWA (since $\omega_r,\omega_q \gg g$ in transmon devices), but it is good to know what is dropped. The SW method gave us a systematic handle to recover those if needed.

### 3.1.4 Effective Hamiltonian for the Full Transmon–Resonator System

Up to now, our example treated the qubit as a two-level system. But the transmon is really a multi-level (anharmonic oscillator) system. Can we derive an effective Hamiltonian for a transmon coupled to a resonator without truncating the transmon to two levels? Yes, we can apply SW to the full multi-level system, treating the coupling as a perturbation. The approach is the same but algebraically heavier because the Hilbert space of the transmon is infinite-dimensional (or we take a finite number of levels).

One strategy (used in the thesis and by Gambetta et al. [14]) is to treat the transmon as an anharmonic oscillator (Duffing oscillator) with Hamiltonian:

$$
H_{\text{transmon}} = \omega_q\, b^\dagger b + \frac{\eta}{2} b^{\dagger 2} b^2 \,,
$$

where $b, b^\dagger$ are transmon lowering/raising operators and $\eta$ is the anharmonicity (negative for a transmon, $|\eta| \ll \omega_q$). The resonator is $H_{\text{res}} = \omega_r a^\dagger a$. The coupling is something like $g(b + b^\dagger)(a + a^\dagger)$ (assuming a linear coupling). One then splits $H_0 = \omega_r a^\dagger a + \omega_q b^\dagger b + \frac{\eta}{2}b^{\dagger2}b^2$ and $H' = g(b + b^\dagger)(a + a^\dagger)$.

If the detuning between the resonator and *each* transition of the transmon is large compared to $g$, one can perform SW. The result will be an effective Hamiltonian that is diagonal in the number basis of both oscillator and transmon. Essentially, it will be of the form (generalizing the two-level result):

$$
H_{\text{eff}} = \omega_r a^\dagger a + \sum_{n} \omega_n |n\rangle\langle n| + \chi_n\, a^\dagger a\, |n\rangle\langle n| + \text{const}\,,
$$

where $|n\rangle$ is the transmon’s $n$th level, $\omega_n$ its unperturbed frequency (energy of level $n$ relative to ground, so $\omega_1 \approx \omega_q$, $\omega_2 \approx 2\omega_q + \text{shift}$, etc.), and $\chi_n$ are dispersive shifts that depend on the transmon state $n$. In other words, the cavity frequency will shift by an amount that depends on whether the transmon is in state $|0\rangle$ or $|1\rangle$ or $|2\rangle$, etc. Similarly, the transmon transition frequencies get Lamb shifts from the coupling. This is essentially a multi-level generalization of Eq. (3.11).

One concrete result one can derive (see e.g. Koch *et al.*, PRA 76, 042319 (2007) or in Gambetta’s work) is that the dispersive shift for a transmon in state $|n\rangle$ is $\chi_n = g^2 (\frac{1}{\Delta_n} - \frac{1}{\Delta_{n+1}})$, where $\Delta_n = \omega_r - (\omega_n - \omega_{n-1})$ is the detuning between the resonator and the $n-1 \to n$ transition of the transmon. For example, $\chi_0 = g^2(1/\Delta_0 - 1/\Delta_1)$ is the difference in resonator frequency when transmon is in 0 vs 1; if the transmon is in 1, the shift would be $\chi_1 = g^2(1/\Delta_1 - 1/\Delta_2)$ relative to if it were in 2, etc. When the transmon’s anharmonicity is large, these formulas reduce to the two-level case. In the transmon, anharmonicity is small, so $\chi_n$ are slightly different for different $n$. This means if the transmon accidentally gets to $|2\rangle$, the cavity shift changes (which could be used for qutrit readout as well, or cause errors if not accounted for).

We won't derive the full multi-level formula in detail here, but it is good to know the SW formalism can handle it. Essentially one uses the same commutator approach but now sums over many levels. The appendix of [14] by Gambetta provides these formulas, and Appendix B of this thesis collects some useful commutator identities to facilitate such calculations (e.g., formulas for nested commutators used in SW).

In conclusion, the Schrieffer–Wolff transformation is a powerful perturbative tool to derive effective Hamiltonians. We demonstrated it on the Jaynes–Cummings model, reproducing the dispersive Hamiltonian and identifying key effects (Stark shift, Lamb shift, Bloch–Siegert shift). We hinted at the multi-level generalization for a transmon. This sets the stage for understanding how two-qubit gates work: in Chapter 4, we will use the SW idea to derive effective interactions between qubits. Before that, however, we will cross-check SW against two other methods (adiabatic elimination and time-averaging) to deepen confidence in these results.

## 3.2 Adiabatic Elimination

**Adiabatic elimination** is another perturbative approach commonly used in physics and quantum optics to eliminate the effect of a rapidly decaying or highly detuned level. The idea is to treat the dynamics of certain degrees of freedom as instantaneous (or very fast) compared to others, so that those degrees of freedom remain approximately in steady-state (or unexcited) and can be "eliminated" from the equations of motion, leaving an effective interaction among the remaining degrees.

In the context of the Jaynes–Cummings model, one can *adiabatically eliminate* the photon mode if the detuning is large and the photons only get virtually excited. Alternatively, one can eliminate the qubit’s excited state in a lambda system, etc. The typical procedure for adiabatic elimination in a Hamiltonian setting is:

* Write down the equations of motion (Heisenberg equations or Schrödinger picture with time-dependent perturbation theory).
* Assume the variable to be eliminated has zero derivative (steady state or negligible dynamics) and solve for it in terms of others.
* Back-substitute to get an effective evolution for the remaining variables.

For a simple example, consider the JC Hamiltonian in a frame rotating at the drive or something—though perhaps a clearer example is a three-level system in a lambda configuration where the middle level is far-detuned (like in a Raman process). We will illustrate adiabatic elimination by re-deriving the dispersive shift for the Jaynes–Cummings using equations of motion, which is a more intuitive approach than the SW algebra.

### 3.2.1 Jaynes–Cummings via Adiabatic Elimination

We revisit the JC Hamiltonian:

$$
H_{\text{JC}} = \omega_r a^\dagger a - \frac{\omega_q}{2}\sigma_z + g(a\sigma_+ + a^\dagger \sigma_-)\,.
$$

In the dispersive regime ($\Delta = \omega_r - \omega_q$ large), the photon number changes only virtually. If we start in a state with no photon and qubit excited, for example, the qubit will emit a photon only to reabsorb it—there is no resonant exchange. We can derive an effective Hamiltonian by eliminating these virtual processes.

Using Heisenberg equations: $\dot{a} = \frac{i}{\hbar}[H, a]$ and $\dot{\sigma}*- = \frac{i}{\hbar}[H, \sigma*-]$, etc. Compute commutators:

* $[H, a] = [\omega_r a^\dagger a, a] + [ -\frac{\omega_q}{2}\sigma_z, a] + [g a\sigma_+ + g a^\dagger\sigma_-, a]$. The first gives $-\omega_r a$, second zero (commutes), third: $[g a\sigma_+, a] = -g a \sigma_+$ (since $a$ commutes with itself, but careful: treat $\sigma_+$ as constant for this comm). Actually $[a\sigma_+, a] = a\sigma_+ a - a a\sigma_+ = 0$ ($a$ commutes with $a$), and $[a^\dagger \sigma_-, a] = a^\dagger [\sigma_-, a] + [a^\dagger, a]\sigma_- = -[a, a^\dagger]\sigma_- = +\sigma_-$ (since $[a^\dagger, a] = -1$). Actually, let's do systematically:

$\dot{a} = -i\omega_r a - i g \sigma_-$ (this is the result one gets: the cavity field evolves with frequency $\omega_r$ and is driven by the qubit's lowering operator).

Similarly for the qubit lowering operator $\sigma_-$: $[H, \sigma_-] = [\omega_r a^\dagger a, \sigma_-] + [-\frac{\omega_q}{2}\sigma_z, \sigma_-] + [g(a\sigma_+ + a^\dagger\sigma_-), \sigma_-]$. The first commutes (different spaces), second: $[-\frac{\omega_q}{2}\sigma_z, \sigma_-] = \omega_q \sigma_-$ (because $\sigma_z \sigma_- - \sigma_- \sigma_z = -2\sigma_-$). The third: $[g a\sigma_+ + g a^\dagger\sigma_-, \sigma_-] = g[a^\dagger\sigma_-, \sigma_-]$ since $a\sigma_+$ commutes with $\sigma_-$ (different ladder). Now $[\sigma_-, \sigma_-] = 0$, and importantly $[\sigma_+,\sigma_-] = \sigma_z$. But we have $\sigma_-$ not $\sigma_+$. Actually $[a^\dagger \sigma_-, \sigma_-] = a^\dagger [\sigma_-,\sigma_-] + [a^\dagger,\sigma_-]\sigma_- = 0$ since $a^\dagger$ commutes with $\sigma_-$. So no contribution from that. So:

$\dot{\sigma}*- = -i\omega_q \sigma*- + i g a \sigma_z$.

The equations are:

$$
\dot{a} = -i \omega_r a - i g\, \sigma_- \,,
$$

$$
\dot{\sigma}_- = -i \omega_q \sigma_- + i g\,a\,\sigma_z \,.
$$

Now in the dispersive regime, the dynamics of $a$ and $\sigma_-$ are fast (oscillating at $\omega_r$ or $\omega_q$), whereas perhaps what we care about is the slower envelope (the small Stark shift). We can seek a steady-state solution for $\sigma_-$ by setting $\dot{\sigma}*- \approx 0$ (assuming $\sigma*-$ adiabatically follows the field, or vice versa depending on perspective). Solve $\dot{\sigma}_- = 0$:

$$
0 \approx -i \omega_q \sigma_- + i g\, a\,\sigma_z \,,
$$

so

$$
\sigma_- \approx \frac{g}{\omega_q} a\, \sigma_z \,,
$$

assuming $\omega_q \approx$ qubit transition. But more properly, if we look in a frame rotating at $\omega_r$ for $a$ and $\omega_q$ for qubit, they'd be static, then solve. But anyway, substitute this back into $\dot{a}$:

$\dot{a} \approx -i \omega_r a - i g (\frac{g}{\omega_q} a \sigma_z)$ (since $\sigma_-$ was replaced). Actually careful: $\omega_q$ or $\omega_r$? Perhaps we should use $\Delta = \omega_r - \omega_q$ somewhere. A slightly better elimination would set time derivatives equal in a rotating frame so that the difference shows up.

Let’s do it more systematically: We want an effective Hamiltonian that reproduces these eqs to second order. We can consider an ansatz:

$H_{\text{eff}} = \omega_r a^\dagger a + \frac{\omega_q}{2} \sigma_z + \lambda, a^\dagger a, \sigma_z$ (we assume an extra coupling term and maybe a renormalized $\omega_q$).

From this, $\dot{a} = -i(\omega_r + \lambda \sigma_z) a$. And $\dot{\sigma}*- = -i(\omega_q + \lambda (2a^\dagger a + 1)) \sigma*- - 2i\lambda a \sigma_z a$ maybe - it's a bit messy in eq-of-motion form.

Alternatively, solve $\sigma_- \approx (g/\Delta) a \sigma_z$ instead (taking $\omega_q$ difference properly: $\omega_r - \omega_q = \Delta$ yields $\sigma_- \approx (g/\Delta) a \sigma_z$ if $\Delta$ is how far out-of-resonance we are). Actually, the correct relation should involve $\Delta$ because in frequency domain, the off-resonant response amplitude is $g/\Delta$. If we go into a rotating frame at $\omega_r$ for $a$ and at $\omega_q$ for $\sigma_-$, then in that frame $\dot{\sigma}*- = i g a e^{i(\omega_r - \omega_q)t}\sigma_z = i g a e^{i\Delta t}\sigma_z$. For slowly varying amplitude, set $\tilde{\sigma}*-(t) \approx \frac{g}{\Delta} a \sigma_z$ (phase shift accounted by $e^{i\Delta t}/\Delta$ eventually). Then transform back, you'll get an oscillating piece. The effective elimination yields $\sigma_- \approx \frac{g}{\Delta} a \sigma_z$ in the frame where that’s static (makes sense dimensionally: if $\Delta$ is large, $\sigma_-$ is small).

Thus substituting $\sigma_- \approx \frac{g}{\Delta} a \sigma_z$ into $\dot{a}$:

$\dot{a} \approx -i\omega_r a - i g (\frac{g}{\Delta}a \sigma_z) = -i \omega_r a - i \frac{g^2}{\Delta} a \sigma_z$.

This indicates an effective Hamiltonian $\omega_r a^\dagger a + \frac{g^2}{\Delta} a^\dagger a \sigma_z$ (because if $H_\text{eff}$ had that term, $\dot{a}$ gets an extra $-i \frac{g^2}{\Delta} \sigma_z a$ which matches). Meanwhile, for $\dot{\sigma}*-$, substituting back would yield an effective shift of $\omega_q$ by maybe $\frac{g^2}{\Delta}$ as well. So indeed one recovers $H*{\text{eff}} = \omega_r a^\dagger a + (\omega_q - \frac{g^2}{\Delta})\frac{\sigma_z}{2} + \frac{g^2}{\Delta} a^\dagger a \sigma_z$ the same result.

Thus, adiabatic elimination yields the same effective dispersive Hamiltonian as SW. It’s less systematic but conceptually clear: the qubit adjusts its state instantaneously according to the photon number, causing energy shifts.

### 3.2.2 Raman Processes and Comparison to Schrieffer–Wolff

Adiabatic elimination is often used in multi-step processes like Raman transitions. For example, suppose you have two lower levels and one far-detuned upper level; if two lasers couple the lower levels to the upper one, you can eliminate the upper (which is never significantly populated if detuned) and get an effective direct coupling between the two lower levels (this is how a two-photon Raman transition works). The effective coupling (and Stark shifts) can be computed by adiabatic elimination.

In circuit QED, an analogous scenario is if you have two qubits coupled via a bus resonator: if the bus is far off-resonant (dispersive regime), one can eliminate the bus to get a direct qubit–qubit coupling (like an $XX$ or $ZZ$ coupling depending on specifics). This is what was done historically for two-qubit gates (e.g., iSWAP with resonator bus: tune qubits in and out of resonance with bus). Adiabatic elimination gives a simple formula for the effective coupling $J$ between qubits: $J \approx \frac{g_1 g_2}{\Delta_1 + \Delta_2}$ (where $g_{1,2}$ are couplings of each qubit to resonator and $\Delta_{1,2}$ their detunings). Indeed, in Section 4.2.1 we'll see using SW that we get such a $J$.

In comparison, the Schrieffer–Wolff method is essentially performing the same calculation but in a Hamiltonian (energy) approach rather than eliminating time derivatives. SW often yields the same effective Hamiltonian as adiabatic elimination yields from equations of motion. Adiabatic elimination is sometimes more intuitive for designing control sequences, whereas SW is systematic and keeps track of Hermiticity and higher order terms more systematically.

In the Raman example (not detailed here due to focus on transmons), SW would show an effective second-order coupling between the two lower levels with strength $\frac{\Omega_1 \Omega_2}{2\Delta}$ (if $\Omega$ are Rabi rates and $\Delta$ detuning), plus Stark shifts; adiabatic elimination yields the same by solving steady state for the upper level amplitude.

In summary, adiabatic elimination corroborates the SW results in the appropriate regime and is a good check. We used it to re-derive the dispersive JC effective Hamiltonian. Next, we will consider a third method: time-averaging, which is yet another way to derive effective Hamiltonians, especially useful for driven systems (like when we have a strong driving term oscillating in time).

## 3.3 Time-Averaging (Magnus Expansion)

Time-averaging or the **Magnus expansion** is a technique to find an effective Hamiltonian when the actual Hamiltonian has explicit time dependence that is fast oscillating. If a Hamiltonian can be written as $H(t) = H_0 + V(t)$ where $V(t)$ oscillates rapidly (e.g., $V(t) = V e^{i\omega t} + V^\dagger e^{-i\omega t}$ with large $\omega$), then one can average over a period to find an effective static Hamiltonian that governs the slow dynamics.

This is commonly used in NMR and driven systems: by moving into rotating frames or averaging high-frequency terms, one can capture phenomena like the Bloch–Siegert shift or create effective interactions (as in the concept of **Floquet engineering** in modern physics).

A specific way is the **Magnus expansion**, which provides a perturbative expansion for the logarithm of the time evolution operator when the Hamiltonian is periodic or quasi-periodic. The first-order Magnus term is the time-average of the Hamiltonian; second-order terms involve commutators of integrals of the Hamiltonian at different times, etc.

However, here we'll likely apply a simpler “rotating wave” style average: basically re-derive the dispersive Hamiltonian by dropping fast oscillating terms, which we effectively already did in RWA. But we can also derive effective Hamiltonians for two-qubit interactions by averaging in an appropriate frame.

### 3.3.1 Jaynes–Cummings via Time-Averaging

We already applied a form of time-averaging when we invoked the RWA in Section 2.1: we said that terms oscillating at $\omega_\Sigma = \omega_r + \omega_q$ average out. One could formalize this by saying the effective Hamiltonian is the original Hamiltonian minus those terms (since their average is zero). That directly gave the JC Hamiltonian from the full Rabi Hamiltonian.

One can go further: to second order, the effect of those dropped terms (counter-rotating) can be captured by an averaged second-order Hamiltonian like $-\frac{g^2}{\omega_\Sigma} (\sigma_+ \sigma_+ a a + \sigma_- \sigma_- a^\dagger a^\dagger)$ or something, which effectively yields the Bloch–Siegert shift after some transformation. This is an example of the Magnus expansion: the second-order term might produce that static shift.

Time-averaging might be more interesting for cases like the two-drive cross-resonance or bSWAP gates, where one applies a microwave and then averages out the fast oscillations to find an effective gate Hamiltonian. That is indeed how one often analyzes gates: treat the drive in a rotating frame and average out fast components to see the effective coupling.

However, since Chapter 4 will cover specific gates, let's mention a simple use: if two qubits are fixed-coupled (always on $ZZ$ or $XX$ coupling), by driving one qubit at the frequency of the other (cross resonance), an effective interaction can be activated. The effective Hamiltonian can be derived by going into a frame rotating at the drive frequency and averaging. That yields terms like $ZX$ or $ZZ$ depending on detuning. We will see cross-resonance gate discussion in 4.3.2, presumably where time-averaging/Magnus is used.

As a smaller example, imagine a two-level system driven by a strong microwave: $H(t) = \frac{\Omega}{2}( \sigma_x \cos\omega t + \sigma_y \sin\omega t)$ in the rotating frame of the qubit. By time-averaging one finds this yields a static term $\frac{\Omega}{2}\sigma_+$ in the frame if on resonance, or if off resonance yields something like $\frac{\Omega^2}{2\Delta} \sigma_z$ (AC Stark shift). That’s essentially what one does with average Hamiltonian theory in NMR (to design pulses). The results align with perturbation theory.

**In summary**, time-averaging is a third perspective where one averages out rapidly oscillating terms to identify slower dynamics. In the limit of large detuning or high drive frequency, it matches the results of SW (which finds effective static interactions) and adiabatic elimination. All these methods, applied correctly, should yield consistent effective Hamiltonians.

We have now seen three approaches:

* Schrieffer–Wolff: a perturbative unitary transformation approach on the Hamiltonian.
* Adiabatic Elimination: a perturbative approach on the equations of motion (treating some variables as quasi-steady).
* Time-Averaging (Magnus): a perturbative approach in the time domain for explicitly time-dependent Hamiltonians.

Each has its use cases. SW is very general and ensures the result is a proper Hermitian Hamiltonian. Adiabatic elimination is quick for simple systems and intuitive. Time-averaging is essential when dealing with driven systems or when designing control pulses and periodic modulations.

## 3.4 Ordinary Perturbation Theory vs. SW

One might wonder: why not just do straightforward non-degenerate perturbation theory on energy levels? The answer is: one can, but it becomes unwieldy for large systems and doesn’t directly give a Hamiltonian operator in the subspace. Non-degenerate perturbation theory (as in textbook quantum mechanics) would let you compute corrections to each energy eigenvalue and eigenstate order by order. In principle, if you only care about, say, the energy of the ground and first excited state of the coupled qubit–resonator system, you could do that.

For example, we could treat the coupling $g$ in JC as a perturbation and compute the second-order correction to the energy of $|0,0\rangle$ (no photon, qubit ground). That correction would come from virtual transitions to $|1,1\rangle$ (one photon, qubit excited) and yield $-\frac{g^2}{\Delta}$, giving $E_{00} \approx -\frac{\omega_q}{2} - \frac{g^2}{\Delta}$ perhaps. Similarly the energy of $|0,1\rangle$ (no photon, qubit excited) would shift by $+\frac{g^2}{\Delta}$. So the splitting between them becomes $\omega_q - \frac{2g^2}{\Delta}$. That matches what we found for the $\sigma_z$ term.

So regular perturbation theory on energies does yield the dispersive shifts. But it has limitations: (1) it requires non-degenerate eigenstates to start with (imagine if $\Delta$ were zero, then you'd need degenerate perturbation theory, which SW handles more gracefully in quasi-degenerate manner), and (2) to get an effective Hamiltonian operator (useful for dynamics, not just energies), one must also transform the states accordingly (which SW essentially does by giving $U = e^{-S}$ that transforms to the effective frame).

In essence, SW *is* a form of degenerate perturbation theory (quasi-degenerate if blocks have multiple states) that yields not only energies but also effective operators (observables, etc.) in the reduced subspace. It also inherently yields the effective eigenstates if needed ($|p_{\text{eff}} \rangle = e^{-S}|p_{\text{exact}} \rangle$ to a certain order).

**Conclusion of Chapter 3:** We have learned several methods to derive effective Hamiltonians. We applied these methods to simple systems (like one qubit + one resonator) to gain confidence:

* The **Schrieffer–Wolff transformation** provided a systematic way to eliminate coupling and yielded the dispersive Hamiltonian and Bloch–Siegert shift.
* **Adiabatic elimination** gave a quick intuitive derivation of the same dispersive result by assuming the qubit follows the cavity field adiabatically.
* **Time averaging (RWA/Magnus)** was implicitly used to derive the JC model and could be used to find second-order effects of fast oscillations as well.
* Ordinary perturbation theory on eigenenergies confirms the shifts.

All methods agree on key outcomes: coupling leads to shifts (Stark/Lamb shifts) and effective interactions (like qubit–qubit coupling via a resonator). In the next chapter, we will apply these insights to actual two-qubit gate schemes. We will see how specific driving or tuning strategies can turn these effective interactions on and off, enabling gate operations like iSWAP, bSWAP, and cross-resonance. Understanding the perturbative analysis will allow us to dissect those gates and identify the origin of terms like $ZZ$ interactions or conditional phases that are central to two-qubit gate performance.

Understood. I’ll now convert Chapter 4 and the appendix on flux tuning into separate, detailed markdown documents. I’ll fully expand all Schrieffer-Wolff derivations, with particular care in showing how parameters like effective coupling and ZZ crosstalk emerge from the perturbative treatment.

I'll let you know once both are ready for your review.


# Chapter 4: Gates

## 4.1 Coherent Control

As described in Section 2.2.3 of the thesis, the state of a qubit can be measured via microwave radiation on a cavity. However, the same physical interaction can be used in reverse to **coherently control a qubit via a microwave drive** without measuring it. In particular, if we drive the cavity far off-resonantly (detuned from the cavity by a large amount), we can manipulate the qubit state without significantly populating the cavity with real photons. In the dispersive regime, a drive at the qubit’s frequency causes only a *virtual* population of the cavity (mean photon number $n \sim 0.1$ for typical parameters), enabling fast control without strong measurement back-action.

### 4.1.1 Single-Qubit Gates

Following Blais *et al.*, we model a microwave drive on the cavity at frequency $\omega_d$ by the time-dependent Hamiltonian (in the lab frame):


$$
H_d(t) \;=\; \xi(t)\big(a\,e^{-i\omega_d t} + a^\dagger e^{i\omega_d t}\big)\,,
$$


where $\xi(t)$ is the (real) drive amplitude envelope, and $a$ ($a^\dagger$) is the annihilation (creation) operator for the cavity mode. To simplify the analysis, we transform into a *rotating frame* at the drive frequency $\omega_d$. In this frame, the drive term becomes time-independent, and the cavity and qubit Hamiltonian terms pick up detunings. The Hamiltonian in the rotating frame (using $\Delta_r = \omega_r - \omega_d$ for the cavity-drive detuning and $\Delta_q = \omega_q - \omega_d$ for the qubit-drive detuning) is:


$$
H \;=\; \Delta_r\,a^\dagger a \;-\; \frac{\Delta_q}{2}\,\sigma_z \;+\; g\big(a\,\sigma_- + a^\dagger\,\sigma_+\big) \;+\; \xi(t)\big(a + a^\dagger\big)\,,
$$


where $g$ is the qubit–cavity coupling strength (Jaynes-Cummings coupling), and $\sigma_-, \sigma_+$ are the qubit lowering and raising operators, respectively.

Now, to handle the drive term $\xi(t)(a + a^\dagger)$, we perform a **displacement transformation** using the unitary $D(\alpha) = \exp{\alpha(t)a^\dagger - \alpha^*(t)a}$ (a Glauber displacement). We choose the displacement $\alpha(t)$ such that it cancels the drive term to first order. The displaced annihilation operator $D^\dagger a,D$ acts as $a \to a + \alpha(t)$ under this transformation, and similarly $a^\dagger \to a^\dagger + \alpha^*(t)$. Applying this to $H$ and dropping $c$-number terms (which cause only a global phase), we obtain the transformed Hamiltonian $H \to \tilde{H} = D^\dagger H D - i,D^\dagger \dot{D}$ as:


$$
\tilde{H} \;=\; \Delta_r\,a^\dagger a \;-\; \frac{\Delta_q}{2}\,\sigma_z \;+\; g\Big[(a + \alpha^*(t))\,\sigma_- + (a^\dagger + \alpha(t))\,\sigma_+\Big] \;+\; \xi(t)\big(a + a^\dagger\big) \;+\; \Delta_r\!\big(\alpha(t)a^\dagger + \alpha^*(t)a\big) \;-\; i\big(\dot{\alpha}(t)a^\dagger - \dot{\alpha}^*(t)a\big)\,. 
$$


We can now choose $\alpha(t)$ to eliminate the terms *linear* in the field operators $a, a^\dagger$. Setting the coefficient of $a$ (and $a^\dagger$) to zero yields the **equation of motion for $\alpha$**:


$$
-i\,\dot{\alpha}(t) + \Delta_r\,\alpha(t) + \xi(t) \;=\; 0\,,
$$


which can be integrated given a drive pulse $\xi(t)$. Physically, $\alpha(t)$ represents the *coherent cavity field* set up by the drive in the rotating frame.

With the appropriate choice of $\alpha(t)$, all explicit $a$ terms from the drive are canceled. The transformed Hamiltonian becomes (writing $\Omega(t) \equiv 2g,\alpha(t)$, which will turn out to be the qubit’s Rabi drive frequency):


$$
\tilde{H} \;=\; \Delta_r\,a^\dagger a \;-\; \frac{\Delta_q}{2}\,\sigma_z \;+\; g\big(a\,\sigma_- + a^\dagger\,\sigma_+\big) \;+\; \frac{1}{2}\Big[\Omega^*(t)\,\sigma_- + \Omega(t)\,\sigma_+\Big]\,.
$$


Notice that the last term is exactly a direct *drive on the qubit*: $\frac{1}{2}[\Omega^*(t)\sigma_- + \Omega(t)\sigma_+] = \frac{\Omega(t)}{2}\sigma_x$ (for real $\Omega(t)$), which corresponds to a classical Rabi drive on the qubit’s $\sigma_x$ (X) axis. We have thus transformed the original cavity drive into an equivalent qubit drive. In summary, by displacing the cavity field, we trade the explicit cavity drive for an effective qubit drive with Rabi frequency $\Omega(t)$, up to additional small terms.

At this point, we incorporate the result of Chapter 3: in the absence of a drive, the **dispersive Schrieffer-Wolff (SW) transformation** gave us an effective block-diagonal Hamiltonian (i.e. no direct qubit–cavity coupling). We would like to use that result but now *include the drive term*. To do so, we apply the SW transformation (derived in Section 3.1.2) to $\tilde{H}$ while treating the drive term as *perturbation* as well (since $\Omega$ is typically much smaller than detunings). For a real Rabi drive $\Omega(t)=\Omega^*(t)$, performing the SW transformation to second order yields an **effective Hamiltonian with the drive**:


$$
\tilde{H}_{\text{eff}} \;=\; \Big(\Delta_r + \frac{g^2}{\Delta}\,\sigma_z\Big)\,a^\dagger a \;-\; \frac{1}{2}\Big(\Delta_q - \frac{g^2}{\Delta}\Big)\,\sigma_z \;+\; \frac{\Omega(t)}{2}\,\sigma_x \;+\; \frac{g\,\Omega(t)}{2\Delta}\,\big(a + a^\dagger\big)\sigma_z\,,
$$


where $\Delta \equiv \omega_r - \omega_q$ is the qubit–cavity detuning (for a single qubit, $\Delta_r = \Delta$ and $\Delta_q = -\Delta$ in the rotating frame). The terms in $\tilde{H}_{\text{eff}}$ can be interpreted as follows:

* The cavity frequency is shifted by $\frac{g^2}{\Delta}\sigma_z$, meaning it depends on the qubit state ($\sigma_z = \pm 1$). This is the dispersive **ac Stark shift** of the cavity frequency.
* The qubit transition frequency is shifted (Lamb shift) by $-\frac{g^2}{\Delta}$ (notice the minus sign inside the parentheses), so the qubit’s effective $\sigma_z$ term is reduced. This matches the dispersive shift found earlier in Eq. (3.11) of the thesis.
* $\frac{\Omega(t)}{2}\sigma_x$ is the intended qubit drive term (Rabi oscillation).
* $\frac{g,\Omega}{2\Delta}(a + a^\dagger)\sigma_z$ is a small extra term that couples the qubit’s $\sigma_z$ to real cavity excitations. This term arises from the combination of drive and qubit–cavity coupling. In an ideal dispersive approximation one might neglect this term (it represents a residual entanglement of qubit and cavity during the drive, sometimes called an AC Stark mixing term). If the drive is sufficiently detuned or $\Omega$ is small, $g\Omega/\Delta$ is negligible. In practice, one might include this term’s effect as a slight additional Stark shift of the qubit frequency (or use a two-drive scheme to cancel it).

Importantly, aside from that last term, $\tilde{H}_{\text{eff}}$ is **block-diagonal**: the qubit and cavity are not exchanging excitations directly, and the qubit sees an effective drive on $\sigma_x$. This block-diagonal form justifies the dispersive treatment and will be assumed in analyzing gates going forward.

It’s worth noting that Blais *et al.* derive the effect of the drive on the qubit using an alternative method: directly applying the SW transformation to the Jaynes-Cummings Hamiltonian with a drive included (instead of first displacing the cavity). In that approach, one finds the same end result: the drive induces a Rabi term on the qubit, with equivalent small corrections. Both approaches are consistent with each other.

Having established how to include a qubit drive in the effective Hamiltonian, we will **use the form of the drive term $\frac{\Omega(t)}{2}\sigma_x$ in all subsequent analyses** (as stated above). We can now proceed to two-qubit gates, where this formalism will help to identify effective interactions.

## 4.2 Two-Qubit Gates

There have been many proposals to realize **entangling two-qubit gates** on superconducting qubits, which broadly fall into two categories:

* **Tunable qubit gates:** Those in which qubits’ frequencies (or their coupling) are **tuned in time** (typically via flux bias) to turn on and off an interaction.
* **Fixed-frequency gates with microwave driving:** Those in which qubits remain at fixed frequencies with a fixed “always-on” coupling, and entanglement is generated by applying carefully chosen microwave drives.

Each approach has advantages and disadvantages. Tunable qubits (like flux-tunable transmons) allow one to *isolate* qubits when idle (far detuned, negligible interaction) and then bring them into resonance or near-resonance to enact a gate. This can make single-qubit control easier and turn off unwanted interactions during idle periods. A prime example is the **iSWAP gate** (Section 4.2.2), where two qubits are coupled via a resonator and only interact significantly when tuned into resonance with each other via flux control. However, tunable qubits require extra control lines and are susceptible to **flux noise**, potentially hurting coherence.

Fixed-frequency schemes (always-on coupling), on the other hand, allow qubits to be parked at sweet spots for maximal coherence, and use microwave pulses to selectively activate interactions. Examples include the **Cross-Resonance gate** (Section 4.2.3) and two-transmon gates like **bSWAP** and **MAP** (Section 4.3). These avoid flux noise, but one must carefully manage the always-on coupling to avoid unwanted entanglement or **ZZ crosstalk** during idle operations. The cross-resonance gate, for instance, uses a drive on one qubit to mediate an effective interaction with another.

In this section, we first derive a general **effective two-qubit Hamiltonian** for two qubits coupled via a resonator in the dispersive regime (using a second-order SW perturbation). This gives insight into the effective *ZZ coupling* and *exchange interactions* between qubits. We then discuss the iSWAP gate as an example of a tunable-qubit gate, and finally the cross-resonance gate as an example of an all-microwave gate.

### 4.2.1 Entangling Two Qubits via a Resonator

Consider two qubits (labeled 1 and 2) coupled to the same resonator mode (no direct qubit–qubit coupling). The system can be described by a two-qubit Jaynes-Cummings Hamiltonian:


$$
H \;=\; \omega_r\,a^\dagger a \;-\; \frac{\omega_{q1}}{2}\,\sigma_{z1} \;-\; \frac{\omega_{q2}}{2}\,\sigma_{z2} \;+\; g_1\big(a\,\sigma_{1-} + a^\dagger\,\sigma_{1+}\big) \;+\; g_2\big(a\,\sigma_{2-} + a^\dagger\,\sigma_{2+}\big)\,,
$$


where $\omega_{q i}$ are the qubit transition frequencies and $g_i$ are the coupling strengths of each qubit to the resonator. We assume the coupling is **small** and each qubit is far detuned from the cavity: $g_i \ll |\omega_r - \omega_{q i}|$. This is the dispersive (off-resonant) regime for each qubit with respect to the resonator.

Our goal is to derive an **effective Hamiltonian** that directly couples the two qubits (mediated by the resonator) while eliminating the qubit–cavity interaction to second order in $g$. This can be done via a **Schrieffer-Wolff transformation** similar to the single-qubit case but generalized to two qubits.

Because the uncoupled Hamiltonian $H_0 = \omega_r a^\dagger a - \frac{\omega_{q1}}{2}\sigma_{z1} - \frac{\omega_{q2}}{2}\sigma_{z2}$ has a degenerate structure (e.g. states with one excitation in qubit 1 vs in qubit 2 vs in the resonator can be near-degenerate if frequencies are close), one convenient choice is to divide the Hilbert space into blocks of **even vs. odd total excitation parity**. For example, one can choose block $A$ spanned by ${|0,0,n\rangle, |1,1,n\rangle}$ (both qubits in the same state, so total excitations even when including $n$ photons), and block $B$ spanned by ${|0,1,n\rangle, |1,0,n\rangle}$ (one qubit excited, the other not – an odd number of excitations in the qubits). Here $|i,j,n\rangle$ denotes qubit 1 in state $i\in{0,1}$, qubit 2 in state $j$, and $n$ photons in the cavity. In this parity basis, the interaction terms $a\sigma_{i-}$ will connect states between the blocks (flipping one qubit excitation in exchange for a photon).

We write the interaction Hamiltonian as $H_2 = g_1(a\sigma_{1-} + a^\dagger\sigma_{1+}) + g_2(a\sigma_{2-} + a^\dagger\sigma_{2+})$. We then **choose an anti-Hermitian operator $S^{(1)}$** that will cancel $H_2$ to first order when we transform the Hamiltonian as $H_{\text{eff}} = e^{-S}H e^{S}$. In perturbation theory (assuming $g$ is small), we take $S = S^{(1)} + S^{(2)} + \dots$ with $S^{(1)}$ of order $g^{-1}$, etc. The standard SW result (as given in Section 3.1.2) is that we require $[H_0, S^{(1)}] = -H_2$ to eliminate the first-order term. A suitable choice (by analogy with the single-qubit case) is:


$$
S^{(1)} \;=\; -\sum_{i=1}^2 \frac{g_i}{\Delta_i}\Big(a\,\sigma_{i-} - a^\dagger\,\sigma_{i+}\Big)\,,
$$


where $\Delta_i = \omega_r - \omega_{q i}$ is the detuning of qubit $i$ from the resonator. Essentially, $S^{(1)}$ has the form (coupling strength)$\times$(excitation exchange) divided by detuning, generalizing the single-qubit case to two qubits. Plugging this into the SW expansion, the effective Hamiltonian up to second order is:


$$
H_{\text{eff}} \;=\; H_0 + \frac{1}{2}[H_2, S^{(1)}] + \mathcal{O}(g^3)\,.
$$


We now compute the **commutator $[H_2, S^{(1)}]$** explicitly, keeping terms up to order $g^2$. This commutator will generate the second-order shifts and qubit–qubit coupling. Let’s break it into pieces. Write $H_2 = \sum_i H_{2,i}$ with $H_{2,i} = g_i(a,\sigma_{i-} + a^\dagger,\sigma_{i+})$, and $S^{(1)} = \sum_j S^{(1)}*j$ with $S^{(1)}*j = -\frac{g_j}{\Delta_j}(a,\sigma*{j-} - a^\dagger,\sigma*{j+})$. Then:


$$
[H_2, S^{(1)}] \;=\; \sum_{i,j}[H_{2,i}, S^{(1)}_j]\,.
$$


The terms can be categorized into two types: **(i)** $i=j$, and **(ii)** $i \neq j$. We will handle each in turn.

* **Case 1: $i = j$.** These terms involve commutators like $[g_i(a\sigma_{i-} + a^\dagger\sigma_{i+}),; -\frac{g_i}{\Delta_i}(a,\sigma_{i-} - a^\dagger\sigma_{i+})]$. For a fixed $i$, let’s drop the index for clarity: consider $[g(a\sigma_- + a^\dagger\sigma_+),; -(g/\Delta)(a,\sigma_- - a^\dagger\sigma_+)]$. Expanding this commutator:

  
  \begin{aligned}
  [H_{2,i}, S^{(1)}_i] &= -\frac{g_i^2}{\Delta_i}\Big[\,a\sigma_{i-} + a^\dagger\sigma_{i+}\,,\;a\,\sigma_{i-} - a^\dagger\sigma_{i+}\Big] \\
  &= -\frac{g_i^2}{\Delta_i}\Big( [a\sigma_{i-},\,a\,\sigma_{i-}] \;-\;[a\sigma_{i-},\,a^\dagger\sigma_{i+}] \;+\;[a^\dagger\sigma_{i+},\,a\,\sigma_{i-}] \;-\;[a^\dagger\sigma_{i+},\,a^\dagger\sigma_{i+}] \Big)\,. 
  \end{aligned}
  

  Now, $[a\sigma_{i-},,a\sigma_{i-}] = 0$ (two identical terms) and $[a^\dagger\sigma_{i+},,a^\dagger\sigma_{i+}] = 0$. The nontrivial parts are the mixed commutators. Using basic commutation relations for independent operators: $[AB, CD] = A{B,C}D - C{A,D}B$ etc., or simpler, note that $a$ commutes with $a$ and $a^\dagger$ commutes with $a^\dagger$. The non-zero commutators come from $[a, a^\dagger] = 1$ and $[\sigma_{i-}, \sigma_{i+}] = \sigma_{i-}\sigma_{i+} - \sigma_{i+}\sigma_{i-}$. Using the Pauli operator identities (Appendix B): $[\sigma_-, \sigma_+] = \sigma_z$, we find:

  * $[a\sigma_-,,a^\dagger\sigma_+] = a a^\dagger,\sigma_-\sigma_+ - a^\dagger a,\sigma_+\sigma_-$. Because $a a^\dagger = a^\dagger a + 1$, this becomes $(a^\dagger a + 1)\sigma_-\sigma_+ - a^\dagger a,\sigma_+\sigma_- = a^\dagger a(\sigma_-\sigma_+ - \sigma_+\sigma_-) + \sigma_-\sigma_+$. Now $\sigma_-\sigma_+ - \sigma_+\sigma_- = [\sigma_-,\sigma_+] = \sigma_z$. Also, $\sigma_-\sigma_+$ is the projector onto the qubit’s excited state $|1\rangle\langle 1|$. In a two-level system, $\sigma_-\sigma_+ = \frac{1}{2}(I - \sigma_z)$ and $\sigma_+\sigma_- = \frac{1}{2}(I + \sigma_z)$. So we can also express that result as $a^\dagger a,\sigma_z + \sigma_-\sigma_+$ (though it’s fine to leave it in the mixed form for now).

  * $[a^\dagger\sigma_+,,a,\sigma_-] = -[a\sigma_-,,a^\dagger\sigma_+]$ (since $[X,Y] = -[Y,X]$). Therefore it will be the negative of the above result: $-[a\sigma_-,,a^\dagger\sigma_+] = -\big(a^\dagger a,\sigma_z + \sigma_-\sigma_+\big)$.

  Putting these together:

  
  \begin{aligned}
  [H_{2,i}, S^{(1)}_i] &= -\frac{g_i^2}{\Delta_i}\Big( -[a\sigma_-,\,a^\dagger\sigma_+] + [a^\dagger\sigma_+,\,a\sigma_-] \Big) \\
  &= -\frac{g_i^2}{\Delta_i}\Big( -[a\sigma_-,\,a^\dagger\sigma_+] - [a\sigma_-,\,a^\dagger\sigma_+] \Big) \qquad (\text{since }[a^\dagger\sigma_+,a\sigma_-] = -[a\sigma_-,a^\dagger\sigma_+])\\
  &= \frac{-2\,g_i^2}{\Delta_i}\Big(-[a\sigma_-,\,a^\dagger\sigma_+]\Big) \\
  &= \frac{2\,g_i^2}{\Delta_i}\big(a^\dagger a\,\sigma_z + \sigma_-\sigma_+\big)\,.
  \end{aligned}
  

  Thus (dropping the identity term $a^\dagger a,\sigma_z$ contains an $a^\dagger a$ that will combine with the cavity term, and $\sigma_-\sigma_+$ is a $\sigma_z$-related projector):

  
  $$ 
  [H_{2,i}, S^{(1)}_i] \;\approx\; \frac{2\,g_i^2}{\Delta_i}\Big(a^\dagger a\,\sigma_{zi} + \frac{1}{2}(I - \sigma_{zi})\Big)\,. 
  $$ 
  

  The $I$ (identity) term is a constant energy shift (depending on whether qubit $i$ is excited or not); such constant terms are often dropped as they do not affect dynamics. What remains is proportional to $a^\dagger a,\sigma_{zi}$ and $\sigma_{zi}$. When we include the $\frac{1}{2}$ prefactor in $H_{\text{eff}}$, we will get a contribution $\frac{1}{2}[H_{2,i},S^{(1)}*i] = \frac{g_i^2}{\Delta_i}(a^\dagger a,\sigma*{zi}) + \frac{g_i^2}{2\Delta_i}\sigma_{zi}$ (again, up to constant identity terms). This means:

  * The cavity mode $a^\dagger a$ sees a frequency shift of $\frac{g_i^2}{\Delta_i}$ *conditioned on the qubit state* ($\sigma_{zi} = \pm 1$). In other words, the cavity’s frequency is pulled by each qubit (dispersive shift).
  * The qubit $i$ itself gets a **Lamb shift** of $-\frac{g_i^2}{2\Delta_i}$ (note the minus sign because $\sigma_z$ appears with a plus in $H_0$ as $- \frac{\omega_{qi}}{2}\sigma_{zi}$). So the qubit’s effective $\sigma_z$ term will be $-\frac{1}{2}(\omega_{qi} - \frac{g_i^2}{\Delta_i})\sigma_{zi}$.

  These results are completely analogous to the single-qubit dispersive case. We see that each qubit individually causes a dispersive shift on the cavity and itself.

* **Case 2: $i \neq j$.** These terms correspond to the interaction between two *different* qubits via the resonator. Take $i=1, j=2$ for concreteness (and the term $i=2, j=1$ will be similar). We need $[H_{2,1}, S^{(1)}*2] + [H*{2,2}, S^{(1)}*1]$ (because the double sum includes both $[1,2]$ and $[2,1]$). Let’s compute $[H*{2,1}, S^{(1)}_2]$:

  
  \begin{aligned}
  [H_{2,1}, S^{(1)}_2] &= -\frac{g_1 g_2}{\Delta_2}\Big[\,a\,\sigma_{1-} + a^\dagger\sigma_{1+}\,,\;a\,\sigma_{2-} - a^\dagger\sigma_{2+}\Big] \\
  &= -\frac{g_1 g_2}{\Delta_2}\Big( [a\sigma_{1-},\,a\sigma_{2-}] \;-\; [a\sigma_{1-},\,a^\dagger\sigma_{2+}] \;+\; [a^\dagger\sigma_{1+},\,a\sigma_{2-}] \;-\; [a^\dagger\sigma_{1+},\,a^\dagger\sigma_{2+}] \Big)\,. 
  \end{aligned}
  

  Many of these commutators are zero because operators acting on different qubits commute: $\sigma_{1-}$ commutes with $\sigma_{2-}$, etc. Also $[a\sigma_{1-}, a\sigma_{2-}] = a a [\sigma_{1-},\sigma_{2-}] = 0$ since $a a$ is just $a^2$ and the sigmas commute. Similarly $[a^\dagger\sigma_{1+}, a^\dagger\sigma_{2+}] = 0$. The cross terms are non-zero:

  * $[a\sigma_{1-},,a^\dagger\sigma_{2+}]$: Here $a$ and $a^\dagger$ do not commute ($[a, a^\dagger] = 1$), while $\sigma_{1-}$ and $\sigma_{2+}$ commute (different qubits). We effectively get $[a, a^\dagger],\sigma_{1-}\sigma_{2+} = 1 \cdot (\sigma_{1-}\sigma_{2+})$. So $[a\sigma_{1-}, a^\dagger\sigma_{2+}] = \sigma_{1-}\sigma_{2+}$. This operator $\sigma_{1-}\sigma_{2+}$ takes a excitation from qubit 2 and gives it to qubit 1 (it maps $|0_1,1_2\rangle \leftrightarrow |1_1,0_2\rangle$). In fact $\sigma_{1-}\sigma_{2+} = |0_1 1_2\rangle\langle 1_1 0_2|$.

  * $[a^\dagger\sigma_{1+},,a\sigma_{2-}]$: similarly, $[a^\dagger, a] = -1$, and the sigmas commute, giving $-,\sigma_{1+}\sigma_{2-}$. But $\sigma_{1+}\sigma_{2-}$ is the Hermitian conjugate of $\sigma_{1-}\sigma_{2+}$, i.e. it takes an excitation from qubit 1 and gives to qubit 2 (the other direction of exchange).

  So $[H_{2,1}, S^{(1)}*2] = -\frac{g_1 g_2}{\Delta_2}\big(\sigma*{1-}\sigma_{2+} - \sigma_{1+}\sigma_{2-}\big)$. Meanwhile, the other term $[H_{2,2}, S^{(1)}*1]$ by symmetry yields $-\frac{g_2 g_1}{\Delta_1}\big(\sigma*{2-}\sigma_{1+} - \sigma_{2+}\sigma_{1-}\big) = -\frac{g_1 g_2}{\Delta_1}\big(\sigma_{1+}\sigma_{2-} - \sigma_{1-}\sigma_{2+}\big)$ (just relabeling indices). Adding them:

  
  \begin{aligned}
  [H_{2,1}, S^{(1)}_2] + [H_{2,2}, S^{(1)}_1] &= -g_1 g_2\Big(\frac{1}{\Delta_2} + \frac{1}{\Delta_1}\Big)\big(\sigma_{1-}\sigma_{2+} - \sigma_{1+}\sigma_{2-}\big) \\
  &= \;g_1 g_2\Big(\frac{1}{\Delta_1} + \frac{1}{\Delta_2}\Big)\big(\sigma_{1+}\sigma_{2-} + \sigma_{1-}\sigma_{2+}\big)\,,
  \end{aligned}
  

  where in the second line we factored out a minus sign to make it a symmetric sum $\sigma_{1+}\sigma_{2-} + \sigma_{1-}\sigma_{2+}$ (which is Hermitian). Note $\sigma_{1+}\sigma_{2-} + \sigma_{1-}\sigma_{2+}$ is the two-qubit *exchange coupling* operator (swapping an excitation between qubit 1 and 2).

  Now remember, $H_{\text{eff}}$ gets $\frac{1}{2}$ of this commutator. So the effective Hamiltonian obtains a term:

  
  $$ 
  \frac{1}{2}\Big([H_{2,1}, S^{(1)}_2] + [H_{2,2}, S^{(1)}_1]\Big) \;=\; \frac{g_1 g_2}{2}\Big(\frac{1}{\Delta_1} + \frac{1}{\Delta_2}\Big)\Big(\sigma_{1+}\sigma_{2-} + \sigma_{1-}\sigma_{2+}\Big)\,. 
  $$
  

  We can simplify the prefactor by putting over a common denominator: $\frac{1}{2}\big(\frac{1}{\Delta_1} + \frac{1}{\Delta_2}\big) = \frac{\Delta_1 + \Delta_2}{2,\Delta_1 \Delta_2}$. Substituting $\Delta_i = \omega_r - \omega_{qi}$, we get $\Delta_1 + \Delta_2 = 2\omega_r - (\omega_{q1} + \omega_{q2})$. It is convenient to define the **effective coupling strength $J$** as:

  
  $$ 
  J \;\equiv\; \frac{g_1 g_2}{2}\Big(\frac{1}{\Delta_1} + \frac{1}{\Delta_2}\Big) \,=\, \frac{g_1 g_2(\Delta_1 + \Delta_2)}{2\,\Delta_1 \Delta_2}\,. 
  $$
  

  In terms of the physical frequencies, this is:

  
  $$ 
  J \;=\; \frac{g_1 g_2\,(\omega_{q1} + \omega_{q2} - 2\omega_r)}{2(\omega_r - \omega_{q1})(\omega_r - \omega_{q2})}\,. 
  $$
  

  Notice that if both qubits are *below* the resonator frequency ($\omega_{qi} < \omega_r$), then each $\Delta_i$ is positive and $(\omega_{q1}+\omega_{q2}-2\omega_r)$ is negative (since $2\omega_r$ is larger than the sum of the $\omega_q$’s). Thus $J$ will be **negative** in that case. The sign of $J$ isn’t very important for the gate (a negative $J$ just means a different phase for the resulting two-qubit operation), but it’s worth noting as it relates to whether the coupling is effectively ferromagnetic or antiferromagnetic in spin language. In any case, the magnitude $|J|$ dictates the strength of the induced qubit–qubit interaction.

  So the effective two-qubit **exchange interaction term** in the Hamiltonian becomes $J(\sigma_{1+}\sigma_{2-} + \sigma_{2+}\sigma_{1-})$. In terms of qubit basis states, this operator swaps an excitation between the qubits: $J(|10\rangle\langle 01| + |01\rangle\langle 10|)$.

Summing up **both cases** (self-terms $i=j$ and exchange terms $i\neq j$) and including $H_0$, we obtain the **effective Hamiltonian to second order**:


$$
H_{\text{eff}} \;=\; \omega_r\,a^\dagger a \;+\; \sum_{i=1}^2 \frac{g_i^2}{\Delta_i}\,\sigma_{zi}\,a^\dagger a \;-\; \frac{1}{2}\sum_{i=1}^2 \Big(\omega_{qi} - \frac{g_i^2}{\Delta_i}\Big)\sigma_{zi} \;+\; J\big(\sigma_{1+}\sigma_{2-} + \sigma_{1-}\sigma_{2+}\big)\,. 
$$


We have dropped constant energy terms. This result can be compared to equation (3.11) of the thesis (for two qubits) and indeed it matches, except now we have the explicit coupling term with $J$. Rewriting in a perhaps clearer way:

* The resonator’s effective frequency is $\~\omega_r + \chi_1 \sigma_{z1} + \chi_2 \sigma_{z2}$, where $\chi_i = g_i^2/\Delta_i$ is the dispersive shift (photon number dependent but here no real photons are in play in idle state).
* The qubit frequencies are effectively $\omega_{qi}^{\text{eff}} = \omega_{qi} - \frac{g_i^2}{\Delta_i}$ (Lamb-shifted downward).
* There is a direct qubit–qubit exchange coupling of strength $J$. In matrix form on the single-excitation subspace ${|01\rangle,|10\rangle}$, this term looks like $\begin{pmatrix}0 & J \ J & 0\end{pmatrix}$, causing oscillations between $|01\rangle$ and $|10\rangle$.

For completeness, if we project out the resonator (assuming it remains in vacuum $n=0$ in the dynamics), we can simplify $a^\dagger a \to 0$. Then the effective Hamiltonian on the qubits reduces to:


$$
H_{\text{eff (qubits-only)}} \;=\; -\frac{\omega_{q1}'}{2}\sigma_{z1} \;-\; \frac{\omega_{q2}'}{2}\sigma_{z2} \;+\; J\big(\sigma_{1+}\sigma_{2-} + \sigma_{2+}\sigma_{1-}\big)\,,
$$


where $\omega_{q i}' = \omega_{q i} - \frac{g_i^2}{\Delta_i}$ is the Lamb-shifted qubit frequency. This Hamiltonian $H_{\text{eff}}$ shows **two effects of the coupling**: (1) a small renormalization of each qubit’s frequency, and (2) a resonant exchange interaction of strength $J$ between the qubits. We have thus derived how the cavity mediates an effective qubit–qubit coupling even when the qubits never directly interact.

### 4.2.2 The iSWAP Gate

If two qubits are *identical and resonant* ($\omega_{q1} = \omega_{q2}$) and have equal coupling $g_1 = g_2 = g$, the effective coupling $J$ derived above simplifies. In that symmetric case, $\Delta_1 = \Delta_2 = \Delta$, so


$$
J \;=\; \frac{g^2}{\Delta}\,,
$$


and the effective Hamiltonian (assuming the resonator is kept in vacuum and focusing on qubit subspace) becomes:


$$
H_{\text{eff}} \;=\; -\frac{1}{2}\big(\omega_q - \frac{g^2}{\Delta}\big)(\sigma_{z1} + \sigma_{z2}) \;-\; \frac{g^2}{\Delta}\Big(\sigma_{1+}\sigma_{2-} + \sigma_{2+}\sigma_{1-}\Big)\,. 
$$


In the *interaction picture* that rotates away the single-qubit terms (i.e. moving to a frame rotating with $\omega_q - \frac{g^2}{\Delta}$ for each qubit), the $\sigma_z$ terms disappear and we are left with purely the exchange interaction term:


$$
H_{\text{int}} \;=\; -\frac{g^2}{\Delta}\Big(\sigma_{1+}\sigma_{2-} + \sigma_{2+}\sigma_{1-}\Big)\,. 
$$


We recognize this as the Hamiltonian for the **iSWAP** interaction. In the basis ${|00\rangle, |01\rangle, |10\rangle, |11\rangle}$, this Hamiltonian exchanges $|01\rangle \leftrightarrow |10\rangle$ and leaves $|00\rangle$ and $|11\rangle$ unchanged. The time-evolution operator can be written (on the single-excitation subspace) as:


$$
U_{\text{eff}}(t) \;=\; \exp\Big\{i\frac{g^2}{\Delta}t\big(|01\rangle\langle 10| + |10\rangle\langle 01|\big)\Big\} \;=\; 
\begin{pmatrix}
1 & 0 & 0 & 0 \\
0 & \cos\!\frac{g^2 t}{\Delta} & i\sin\!\frac{g^2 t}{\Delta} & 0 \\
0 & i\sin\!\frac{g^2 t}{\Delta} & \cos\!\frac{g^2 t}{\Delta} & 0 \\
0 & 0 & 0 & 1 
\end{pmatrix}_{,}
$$


where the matrix is written in the ${|00\rangle,|01\rangle,|10\rangle,|11\rangle}$ basis. If we set the interaction time $t$ such that $\frac{g^2}{\Delta}t = \frac{\pi}{2}$ (assuming $g^2/\Delta$ has units of frequency), the matrix becomes diag$(1,1,1,1)$ except the off-diagonal terms become $i$. Specifically, for $t = \frac{\pi\Delta}{2g^2}$,


$$
U_{\text{eff}}\Big(\frac{\pi\Delta}{2g^2}\Big) \;=\; 
\begin{pmatrix}
1 & 0 & 0 & 0 \\
0 & 0 & i & 0 \\
0 & i & 0 & 0 \\
0 & 0 & 0 & 1 
\end{pmatrix}\,,
$$


which (up to trivial single-qubit phase rotations that can be adjusted) corresponds to the **iSWAP gate**. The iSWAP gate swaps the $|01\rangle$ and $|10\rangle$ states and adds a phase of $i$ (90 degrees) to those components, which is a canonical two-qubit entangling gate.

In physical implementation, the iSWAP gate is typically realized by **tuning the qubits into resonance** ($\omega_{q1} \approx \omega_{q2}$) for a specific duration. In our derivation, we assumed they were already resonant and derived the effective coupling in the dispersive limit. However, if the qubits are far detuned initially (to avoid unwanted interaction during idle periods), one can **flux-tune** them into resonance to activate this exchange coupling $J$ and then detune them again. When the qubits are off-resonance ($\omega_{q1} \neq \omega_{q2}$), the $\sigma_{1+}\sigma_{2-}$ terms in $H_{\text{eff}}$ will acquire a rapid phase $e^{\pm i(\omega_{q1}-\omega_{q2})t}$ in the rotating frame, effectively averaging out and producing no net swap (thus interaction is "off"). This is why detuning disables the interaction, whereas tuning them into resonance (making $\omega_{q1}\approx\omega_{q2}$) allows the swap to occur coherently.

In summary, the iSWAP gate uses the exchange interaction mediated by the resonator. By timing the interaction for the appropriate duration, one can achieve a full iSWAP (or partial swap if desired). In practice, the *speed* of the iSWAP gate is limited by the magnitude of $J = g^2/\Delta$. To make it faster, one can reduce the detuning $\Delta$, but then one must be cautious to remain in a parameter regime where leakage to the resonator is still perturbative. The iSWAP gate we derived is an example of using **tunable qubits** (via flux) to enact an entangling gate.

### 4.2.3 The Cross-Resonance Gate

The **Cross-Resonance (CR) gate** is an all-microwave two-qubit entangling gate implemented on fixed-frequency qubits with an always-on coupling. Originally demonstrated on capacitively shunted flux qubits by Rigetti *et al.* and later adapted to transmons, the CR gate uses a microwave drive on one qubit (the *control* qubit) at the *resonant frequency of the other qubit* (the *target*). The effect of this drive, in presence of the always-on coupling, is to produce an effective conditional rotation of the target qubit.

To understand the CR gate, we start from the effective two-qubit Hamiltonian we derived in the previous section, but now we **omit the cavity mode entirely**, since we assume we have already diagonalized out the resonator. In other words, consider two qubits with a direct exchange coupling $J$ as derived above. The effective Hamiltonian (in the absence of any drives) is:


$$
H \;=\; -\frac{\omega_1}{2}\,\sigma_{z1} \;-\; \frac{\omega_2}{2}\,\sigma_{z2} \;+\; J\big(\sigma_{1+}\sigma_{2-} + \sigma_{2+}\sigma_{1-}\big)\,,
$$


where $\omega_i = \omega_{q i}'$ are the Lamb-shifted qubit frequencies (we drop the prime for simplicity). We assume $|\omega_1 - \omega_2| \gg J$ (the qubits are **far detuned** from each other). In this regime, the exchange interaction $J$ is **weak** and largely off-resonant, meaning one qubit’s excitation does not resonantly swap to the other due to the large energy mismatch. Nonetheless, as we saw, $J$ can cause small shifts in energy levels via second-order processes. In fact, if qubit 1 and qubit 2 are far detuned, one can perform a second SW transformation to *diagonalize* the exchange coupling and see its residual effects (often called **ZZ coupling**).

We perform a second SW transformation with generator chosen to cancel the $J$ exchange terms. Specifically, take:


$$
S^{(1)}_{ZZ} \;=\; -\frac{J}{\Delta_{12}}\Big(\sigma_{1+}\sigma_{2-} - \sigma_{2+}\sigma_{1-}\Big)\,,
$$


where $\Delta_{12} = \omega_1 - \omega_2$ is the detuning between the two qubits. This $S$ is analogous to the one we used for the resonator, now treating the exchange interaction as a perturbation. Commutator calculations (similar to before) show that to second order, the exchange term will induce a **frequency shift** on each qubit’s $\sigma_z$ of order $J^2/\Delta_{12}$. In other words, after this transformation the Hamiltonian becomes approximately:


$$
H_{\text{eff}} \;=\; -\frac{\omega_1 + \frac{J^2}{\Delta_{12}}}{2}\,\sigma_{z1} \;-\; \frac{\omega_2 - \frac{J^2}{\Delta_{12}}}{2}\,\sigma_{z2}\,,
$$


with the exchange coupling largely removed. The two qubits’ frequencies get slightly renormalized: qubit 1 increases by $+\frac{J^2}{\Delta_{12}}$ and qubit 2 decreases by $\frac{J^2}{\Delta_{12}}$. (Note: If $\omega_1 > \omega_2$, then $\Delta_{12} > 0$ and indeed qubit 1’s frequency shifts up.) This small static $\sigma_z^1\sigma_z^2$ interaction is often called the **ZZ coupling** or **residual $ZZ$ crosstalk**, because it means the energy of the $|11\rangle$ state is different from the sum of energies of $|10\rangle$ and $|01\rangle$. In fact, the energy of $|11\rangle$ relative to $|10\rangle$ or $|01\rangle$ is shifted by roughly $J^2/\Delta_{12}$ (we will see a more precise expression later). This residual coupling is undesirable during idle operation (it causes unwanted conditional phase accumulation), but here it sets the stage for the cross-resonance interaction.

Now, suppose we **drive qubit 1 at the (adjusted) frequency of qubit 2** (i.e., at $\omega_2$). We apply a drive Hamiltonian on qubit 1:


$$
H_d(t) \;=\; \frac{\Omega(t)}{2}\Big(\sigma_{1+}\,e^{-i\omega_2 t} + \sigma_{1-}\,e^{i\omega_2 t}\Big)\,,
$$


where $\Omega(t)$ is the drive amplitude (Rabi frequency) on qubit 1 and we assume the drive phase is zero for simplicity. This drive is resonant with *qubit 2’s* frequency (in the shifted basis, $\omega_2$). We are in a situation reminiscent of a 2-level atom (qubit 1) being driven at the frequency of another 2-level system (qubit 2) to which it’s coupled. Intuition: The state of qubit 1 (which we’ll consider the control) will modulate how qubit 2 (target) responds to the drive on qubit 1.

To analyze this, we transform into the *interaction frame* that diagonalizes the static part (including the small $J^2/\Delta_{12}$ shifts) **and** rotates qubit 1 at $\omega_2$. After doing so, and also including the SW diagonalization from above, the effective driven Hamiltonian (in the rotating frame of $\omega_2$ on qubit 1) becomes:


$$
H^{\text{(diag frame)}}_d(t) \;=\; U_{\text{SW}} \,U_{\text{rot}}\,H_d(t)\,U_{\text{rot}}^\dagger U_{\text{SW}}^\dagger \;\approx\; \frac{\Omega(t)}{2}\Big(\sigma_{x1} \;-\; \frac{J}{\Delta_{12}}\,\sigma_{z1}\sigma_{x2}\Big)\,. 
$$


Here, $\sigma_{x1} = \sigma_{1+} + \sigma_{1-}$ is a Pauli $X$ on qubit 1 (control), and $\sigma_{z1}\sigma_{x2}$ represents a *conditional $X$ on qubit 2* (target) – since $\sigma_{z1}$ commutes with everything here, we can think of it as $-1$ if qubit 1 is $|1\rangle$ and $+1$ if qubit 1 is $|0\rangle$, thus flipping the sign of $\sigma_{x2}$ term. The coefficient of the $\sigma_{z1}\sigma_{x2}$ term is $\frac{J}{\Delta_{12}}\Omega(t)$ (note it is suppressed by the detuning ratio $J/\Delta_{12}$). The full effective Hamiltonian in this interaction frame is then:


$$
H_{\text{eff}}(t) \;=\; -\frac{\delta}{2}\,\sigma_{z1} \;+\; \frac{\Omega(t)}{2}\Big(\sigma_{x1} - \frac{J}{\Delta_{12}}\,\sigma_{z1}\sigma_{x2}\Big)\,. 
$$


Here $\delta = \omega_1 - \omega_2$ is the detuning between the qubits (so qubit 1 appears as having a Z term of half the detuning in this rotating frame). The $\sigma_{x1}$ term is just a drive on the control qubit in its own rotating frame – this might be canceled by moving to the rotating frame of qubit 1’s actual drive frequency if needed (or by calibrating a $\pi$ pulse on qubit 1 that can be absorbed). The crucial term is $;-\frac{J}{\Delta_{12}}\frac{\Omega(t)}{2},\sigma_{z1}\sigma_{x2}$. This is a **ZX interaction**: an $X$ rotation on qubit 2 whose direction (sign) depends on the state of qubit 1. In other words, qubit 1’s $|0\rangle$ vs. $|1\rangle$ state will either drive qubit 2 with $+\Omega/2 , \sigma_x$ or $-\Omega/2,\sigma_x$. This is exactly the kind of term needed for a controlled-NOT gate (up to basis rotations): a term $\sigma_{z1}\sigma_{x2}$ generates a rotation on qubit 2 about the $x$-axis that is conditioned on qubit 1’s state.

The presence of the $-\frac{\delta}{2}\sigma_{z1}$ term means that in this frame, qubit 1 accumulates a phase if not addressed – but since qubit 1 is the control, one typically doesn’t care if it’s rotating in the interaction frame, or one can echo it out. The direct $\sigma_{x1}$ drive term indicates that the drive on qubit 1 still directly affects qubit 1 (this is sometimes called the spectator or Stark effect on the control), but this can be calibrated away or echoed as well in an experiment.

What’s important is that we have obtained a Hamiltonian of the form:


$$
H_{\text{CR}} \;\approx\; \frac{\Omega}{2}\,\sigma_{x1} \;-\; \frac{\Omega J}{2\Delta_{12}}\,\sigma_{z1}\sigma_{x2}\,. 
$$


If we operate in a regime where $\Omega J/(2\Delta_{12})$ times the desired gate duration gives a $\pi/2$ rotation on qubit 2 conditioned on qubit 1 (and simultaneously arrange single-qubit rotations to cancel out the $\sigma_{x1}$ term and any unwanted Z rotations), we can implement a controlled-NOT or controlled-$X$ gate. In practice, the **CR gate** is implemented by driving continuously for a time to accumulate the right conditional rotation angle, while using calibration pulses to remove any single-qubit rotations on the control and target that also occur. The output is an entangling gate typically equivalent to a CNOT up to single-qubit basis changes. In the example above, if run for the correct time, it yields an operation where qubit 1 (in computational basis) controls an $X$-rotation on qubit 2.

Experimental realizations of the CR gate have achieved high fidelities, but currently typical fidelities are on the order of 80–90% for one pulse (as of the reference given, F<sub>CR</sub> \~ 81% in 220 ns). Errors come from many sources: the fact that the $\sigma_{x1}$ spectator term and other off-resonant effects need cancellation, as well as decoherence and leakage to non-computational states (especially if the qubits have higher levels, which in transmons they do). Nonetheless, the CR gate remains a cornerstone of fixed-frequency transmon architectures (like IBM’s devices) since it requires no tunability and only all-microwave control.

In summary, the cross-resonance gate is an entangling gate where one qubit’s drive at the other’s frequency causes a two-qubit interaction of the ZX form. This highlights how **ZZ crosstalk (residual coupling)** and off-resonant drives can be harnessed to produce conditional operations, albeit at the cost of careful calibration to isolate the desired term.

## 4.3 Two-Qubit Gates Using Transmons (Multi-Level Effects)

In the preceding analysis of iSWAP and cross-resonance, we **treated qubits as two-level systems**. However, transmon qubits are *weakly anharmonic oscillators* with multiple energy levels (${|0\rangle,|1\rangle,|2\rangle,\dots}$). The transmon’s low anharmonicity (typically only a few percent of the qubit frequency) means higher levels can become populated or virtually occupied during gates, and these levels significantly affect gate dynamics and **ZZ interactions**. Thus, to accurately analyze two-qubit gates with transmons, we must extend our perturbative treatment to **multi-level systems**.

In this section, we incorporate the first excited non-computational levels ($|2\rangle$ states of each transmon, sometimes also $|3\rangle$, etc.) into the effective Hamiltonians. We will see that new interaction channels emerge, and some gates (bSWAP, MAP) explicitly take advantage of higher levels by tuning certain level spacings into resonance.

Consider two transmons coupled to the same resonator (or directly to each other) as in Section 4.2, but now model each transmon as an anharmonic oscillator (Duffing oscillator). The Hamiltonian can be written as:


$$
H \;=\; \omega_r\,a^\dagger a \;+\; \sum_{i=1}^2 \sum_{j_i \ge 0} \Big[ \big(\omega_i - \frac{\delta_i}{2}\big) j_i \;+\; \frac{\delta_i}{2} j_i^2 \Big]\,|j_i\rangle\langle j_i| \;+\; \sum_{i=1}^2 g_i\big(a\,c_i^\dagger + a^\dagger c_i\big)\,. 
$$


Here $|j_i\rangle$ denotes the $j$-th level of transmon $i$. $\omega_i$ is the transition frequency between $|0\rangle$ and $|1\rangle$ of qubit $i$, and $\delta_i$ is its **anharmonicity** (the difference between the $|1\rangle\rightarrow|2\rangle$ transition frequency and the $|0\rangle\rightarrow|1\rangle$ frequency). So the energy of level $|j\rangle$ (relative to $|0\rangle$) is approximately $j\omega_i - \frac{\delta_i}{2} j(j-1)$, making the level spacings $\omega_{i} - \delta_i(j-1)$ for $|j-1\rangle \to |j\rangle$. The operator $c_i$ is the **lowering operator** for transmon $i$ ($c_i |j_i\rangle = \sqrt{j_i},|j_i-1\rangle$), and $c_i^\dagger$ its conjugate, analogous to an oscillator ladder operator but with nonlinear spacings. The coupling term $a,c_i^\dagger + a^\dagger c_i$ generalizes $a,\sigma_{i-} + a^\dagger\sigma_{i+}$ to allow transitions beyond the first excited state.

To treat this perturbatively, we again use a Schrieffer-Wolff transformation. The structure is similar: we choose an $S^{(1)}$ to eliminate $H_2 = \sum_i g_i(a c_i^\dagger + a^\dagger c_i)$. By analogy with the two-level case, one uses the fact that for each transmon level $|j\rangle$, the detuning from the cavity is $\omega_r - \omega_{i,j}$ where $\omega_{i,j}$ is the $|j-1\rangle \to |j\rangle$ transition frequency. Let’s denote $\omega_{i,j} = \omega_i - (j-1)\delta_i$ for $j\ge1$ (since each successive transition is reduced by $\delta_i$). Then the detuning for the $|j\rangle$ level from the cavity is roughly $\Delta_{i,j} = \omega_r - \omega_{i,j}$.

We take the ansatz for the SW generator:


$$
S^{(1)} \;=\; -\sum_{i=1}^2 g_i \big(a\,\tilde{c}_i^\dagger - a^\dagger \tilde{c}_i\big)\,,
$$


where $\tilde{c}_i$ is a slightly modified lowering operator that has the appropriate detuning factors built in for each transition. In fact, one can define


$$
\tilde{c}_i \;=\; \sum_{j\ge0} \gamma_{j+1}^{(i)}\,|j_i\rangle\langle j_i+1|\,,
$$


with coefficients $\gamma_{j+1}^{(i)} = \sqrt{j+1}/(\omega_r - \omega_i - \delta_i j)$. Essentially, $\tilde{c}*i$ is like $c_i$ but each rung has been weighted by the inverse detuning $\frac{1}{\Delta*{i,j}}$. This ensures $[H_0, S^{(1)}] \approx -H_2$.

Carrying through the perturbation calculation (which is lengthy but can be done systematically, possibly with computer algebra as in the thesis), one finds an effective Hamiltonian to second order. Rather than derive every term, we summarize the structure:


$$
H_{\text{eff}} \;=\; \Big(\omega_r + \sum_{i,j_i} \chi_{j_i}^{(i)}\,|j_i\rangle\langle j_i|\Big) a^\dagger a \;+\; \sum_{i,j_i} \omega_{j_i}^{(i)}\,|j_i\rangle\langle j_i| \;-\; \sum_{i\neq k} \frac{g_i g_k}{2}\big(c_i \tilde{c}_k^\dagger + c_i^\dagger \tilde{c}_k\big)\,. 
$$


Let’s unpack this:

* The cavity frequency $\omega_r$ is shifted by $\chi_{j_i}^{(i)}$ depending on the occupation $j_i$ of each transmon. These $\chi_{j_i}$ are the generalized dispersive shifts for level $|j_i\rangle$ of transmon $i$ (they generalize $g^2/\Delta$ from the two-level case). For example, $\chi_1^{(i)} = g_i^2/(\omega_r - \omega_i)$ for the first level, and higher levels have their own $\chi$ given by more complex formulas (see Eq. (3.16) in the thesis for explicit form).

* $\omega_{j_i}^{(i)}$ are the *renormalized* energies of the transmon levels (including Lamb shifts from the coupling). For instance, $\omega_{0}^{(i)} = 0$ by definition, $\omega_{1}^{(i)} = \omega_i -$ some shift, $\omega_{2}^{(i)} = 2\omega_i - \delta_i -$ some shift, etc. Equation (3.17) in the thesis gives these frequencies with corrections. Essentially, each transmon level $|j\rangle$ gets a small negative shift due to $g^2/\Delta$ terms (like the two-level Lamb shift, but modified for each level).

* The last term is the **coupling between transmons**. After eliminating the cavity, the two transmons are effectively coupled by an operator $H_{\text{coupling}} = -\sum_{i\neq k}\frac{g_i g_k}{2}(c_i \tilde{c}_k^\dagger + \text{h.c.})$. In more intuitive form, one can rewrite this coupling as a sum over transitions between *pairs of levels* on the two transmons. Specifically, one finds:

  
  $$
  H_{\text{coupling}} \;=\; \sum_{j_1, j_2} J_{j_1,j_2}\Big(|j_1,\,j_2+1\rangle\langle j_1+1,\,j_2| + \text{h.c.}\Big)\,,
  $$
  

  where $|j_1, j_2+1\rangle\langle j_1+1, j_2|$ is the operator that transitions transmon 1 from $|j_1+1\rangle$ to $|j_1\rangle$ while simultaneously transitioning transmon 2 from $|j_2\rangle$ to $|j_2+1\rangle$ (i.e. an excitation hops from one transmon to the other). The **effective coupling rate** $J_{j_1,j_2}$ for that pair of levels is given by:

  
  $$ 
  J_{j_1,j_2} \;=\; \frac{g_1 g_2\,\big(\omega_1 + \omega_2 + \delta_1 j_1 + \delta_2 j_2 - 2\omega_r\big)}{2\,(\omega_r - \omega_1 - \delta_1 j_1)\,(\omega_r - \omega_2 - \delta_2 j_2)}\,. 
  $$
  

  This formula, which can be derived from the above coupling term, matches the result by Gambetta *et al.* (Eq. (54) in Ref. 16 of the thesis). It generalizes our earlier $J$ (which corresponded to $J_{0,0}$) to cases where one or both transmons are in higher excitations.

  A key point: if the transmon anharmonicities $\delta_i$ are small and we restrict to low levels, $J_{j_1,j_2}$ doesn’t vary too much with $j_1, j_2$. In the limit of zero anharmonicity (making them pure harmonic oscillators), all $J_{j_1,j_2}$ become the same constant, and $c_i \approx \tilde{c}*i$, so the coupling term reduces to $J(c_1 c_2^\dagger + c_1^\dagger c_2)$, which is like two harmonic oscillators linearly coupled (swapping any number of excitations). For real transmons with finite $\delta$, $J*{j_1,j_2}$ does vary with $j$ but not dramatically for low $j$. Often one can approximate that $J_{j_1,j_2} \approx J$ for the lowest few levels, which means the coupling can be treated as roughly **independent of level**. Under that approximation, one can replace $\tilde{c}*i$ by $c_i$ and $J*{j_1,j_2}$ by $J$, yielding:

  
  $$ 
  H_{\text{coupling}} \;\approx\; J\big(c_1 c_2^\dagger + c_1^\dagger c_2\big)\,. 
  $$
  

  This has the same form as the two-level coupling but now acting on the full transmon Hilbert space. Indeed, if we then further *restrict* to the qubit subspace ${|0\rangle,|1\rangle}$ for each transmon (the computational subspace), $c_i$ reduces to $\sigma_{i-}$ and the coupling $J c_1 c_2^\dagger$ becomes $J,\sigma_{1-}\sigma_{2+}$ as before. So this is consistent. The new insight is that outside the computational subspace, the coupling can take the $|1\rangle$ of one qubit to the $|2\rangle$ of the other, etc. Those are usually unwanted **leakage** processes during gates, but some gate schemes *intentionally* drive them.

  In summary, after the SW transformation, we get an effective Hamiltonian where the two transmons are directly coupled. If we now *only care about the qubit subspace*, we can project $H_{\text{coupling}}$ onto ${|0\rangle,|1\rangle}$ for each transmon by taking only $j_1=j_2=0$. Then $J_{0,0} = \frac{g_1 g_2(\omega_1+\omega_2-2\omega_r)}{2(\omega_r-\omega_1)(\omega_r-\omega_2)}$, and the coupling term becomes $J_{0,0}(|0,1\rangle\langle 1,0| + |1,0\rangle\langle 0,1|)$. Written in Pauli form, that's $J_{0,0}(\sigma_{1+}\sigma_{2-} + \sigma_{1-}\sigma_{2+})$, which matches our earlier two-level result. So indeed the two-level approximation of $H_{\text{eff}}$ recovers the previous Section 4.2 effective Hamiltonian.

The power of the multi-level picture is that now we can consider **gate schemes that use higher levels**. Two such gates are discussed: **bSWAP** (Section 4.3.1) and **MAP (Microwave-Activated Phase)** gate (Section 4.3.2). These gates deliberately exploit level resonances involving $|2\rangle$ or $|3\rangle$ states to induce specific two-qubit interactions.

### 4.3.1 The bSWAP Gate

The *bSWAP gate* (also called the Bell-Rabi gate) is a two-transmon entangling gate proposed by Poletto *et al.*. This gate explicitly uses the second excited state of the transmon (the $|2\rangle$ level) to achieve a two-photon transition between $|00\rangle$ and $|11\rangle$. In a harmonic system, a direct $|00\rangle \leftrightarrow |11\rangle$ transition is forbidden by selection rules (it would require two photons at once). But because the transmon is *anharmonic*, the two-photon process becomes allowed if certain resonance conditions are met.

The setup for bSWAP is two transmons coupled (as above) **without a resonator** (we consider the effective direct coupling Hamiltonian between transmons after eliminating the resonator). The Hamiltonian is a pair of Duffing oscillators with a coupling term:


$$
H \;=\; \Big(\omega_1 - \frac{\delta_1}{2}\Big)c_1^\dagger c_1 \;+\; \frac{\delta_1}{2}(c_1^\dagger c_1)^2 \;+\; \Big(\omega_2 - \frac{\delta_2}{2}\Big)c_2^\dagger c_2 \;+\; \frac{\delta_2}{2}(c_2^\dagger c_2)^2 \;+\; J\big(c_1^\dagger c_2 + c_1 c_2^\dagger\big)\,. 
$$


This is basically the two-transmon Hamiltonian from Eq. (4.11) but dropping small second-order frequency shifts from the cavity (assuming those are calibrated out or negligible). We have an always-on coupling $J$.

The idea behind bSWAP is to apply a microwave **drive that is near-resonant with the $|00\rangle \to |11\rangle$ two-photon transition**. To facilitate this, one sets up a specific detuning between the qubits so that **$|00\rangle$, $|11\rangle$, and one of the $|20\rangle$ or $|02\rangle$ states form a low-energy manifold** that is relatively isolated from other levels. Concretely, suppose qubit 1 has frequency $\omega_1$ and qubit 2 has $\omega_2$ such that $\Delta_{12} = \omega_1 - \omega_2$ is approximately $-\delta_1$ (the negative of qubit 1’s anharmonicity). In that case, the energy of $|20\rangle$ (which is $2\omega_1 - \delta_1$ for qubit1 plus 0 for qubit2) is nearly the same as the energy of $|11\rangle$ (which is $\omega_1 + \omega_2$). If $\omega_1 > \omega_2$ and $\delta_i$ are positive (transmon anharmonicity is typically defined as a negative number in energy because $|1\rangle\to|2\rangle$ is a lower frequency than $|0\rangle\to|1\rangle$, but here they treat $\delta$ as a positive parameter for absolute difference), then $\Delta_{12} \to -\delta_1$ means $\omega_1 - \omega_2 \approx -\delta_1$, or $\omega_1 + \delta_1 \approx \omega_2$. That puts $|20\rangle$ roughly degenerate with $|11\rangle$. Similarly, one might also adjust to get $|02\rangle$ close to $|11\rangle$ by $\Delta_{12} \approx \delta_2$.

The bSWAP paper likely considered a symmetric scenario or one of these conditions. The thesis describes (Figure 4.2) that for $\Delta_{12} \to -\delta_1$, the levels $|00\rangle$, $|11\rangle$, and $|20\rangle$ are close in energy in a rotating frame. These three states then form a subspace where dynamics can be largely confined (if other levels are sufficiently far off-resonance). The $|20\rangle \leftrightarrow |11\rangle$ coupling occurs via the always-on $J$ (which can couple $|20\rangle \leftrightarrow |11\rangle$ by moving one excitation from qubit1 to qubit2). Similarly, $|11\rangle \leftrightarrow |02\rangle$ could couple if $\Delta_{12} \to \delta_2$. Let’s assume $\Delta_{12}$ is tuned for one of them (say $|20\rangle$ resonance) for now.

Now, the **drive**: We apply microwave drives to induce transitions within this low-energy manifold. In practice, one might drive both qubits simultaneously with appropriate amplitude and phase. The thesis writes the drive as:


$$
H_d(t) \;=\; \frac{\Omega_1}{2}\big(c_1\,e^{-i(\omega_d t+\phi)} + c_1^\dagger e^{i(\omega_d t+\phi)}\big) \;+\; \frac{\Omega_2}{2}\big(c_2\,e^{-i(\omega_d t+\phi)} + c_2^\dagger e^{i(\omega_d t+\phi)}\big)\,. 
$$


So both transmons are driven at the *same frequency* $\omega_d$ and phase $\phi$, but possibly with different amplitudes $\Omega_1, \Omega_2$. They choose $\omega_d$ near $(\omega_1+\omega_2)/2$, which is roughly the average frequency of the two qubits (and near half the energy of $|11\rangle$). Specifically $\omega_d = \frac{\omega_1 + \omega_2}{2} - \delta$ with a small detuning $\delta$. In a frame rotating at $\omega_d$, $|00\rangle$ and $|11\rangle$ become nearly degenerate (since $2\omega_d \approx \omega_1 + \omega_2$) and $|20\rangle$ and $|02\rangle$ shift accordingly. The drive can then induce transitions between these states.

The analysis of bSWAP is involved; the thesis indicates they performed a full diagonalization (numerical, using Mathematica) including up to 3 excitations to derive an effective Hamiltonian and gate unitary. However, the gist is:

* In the rotating frame at $\omega_d$, find the effective Hamiltonian for the subspace ${|00\rangle, |11\rangle, |01\rangle, |10\rangle, |20\rangle, |02\rangle, ...}$.
* By appropriate choice of qubit detuning, isolate the ${|00\rangle, |11\rangle, |20\rangle}$ manifold (for example).
* Use a second SW or numerical diagonalization to eliminate coupling to outside states (treat $J$ as perturbation to first order, ignoring second-order shifts for simplicity as they did).
* The effective Hamiltonian in the truncated 2-qubit space (after eliminating leakage) has, in general, various terms: a ZZ term, an $IZ-ZI$ asymmetry term, and importantly an $XX \pm YY$ type term that causes Rabi oscillations between $|00\rangle$ and $|11\rangle$ (the hallmark of bSWAP).

Indeed, the thesis provides a matrix (Eq. 4.12) for the effective Hamiltonian in the ${|00\rangle,|01\rangle,|10\rangle,|11\rangle}$ basis after all these transformations. That matrix has structure with parameters $\alpha_{IZ}, \alpha_{ZI}, \alpha_{ZZ}$ (representing single-qubit Stark shifts and ZZ coupling), and coupling terms labeled $\Omega_S$ and $\Omega_B$. Specifically, $\Omega_S$ corresponds to transitions in the single-excitation subspace ($|01\rangle \leftrightarrow |10\rangle$) and $\Omega_B$ corresponds to **Bell-state oscillations** ($|00\rangle \leftrightarrow |11\rangle$). They find that by adjusting the small detuning $\delta$, one can make $\alpha_{IZ} + \alpha_{ZI} = 0$ (cancel asymmetric Stark shifts on the single-excitation subspace). Then the Hamiltonian can be decomposed into commuting parts:

* $H_{ZZ} = \frac{\alpha_{ZZ}}{4} ZZ$ (a pure ZZ interaction term),
* $H_{SZ} = \frac{\alpha_{IZ}}{2}(IZ - ZI) + \frac{\Omega_S}{2}(XX + YY)$, which causes $|01\rangle \leftrightarrow |10\rangle$ iSWAP-like oscillations (mediated by $\Omega_S$) with an offset $\alpha_{IZ}$ (this is essentially a conditional $\sigma_x$ rotation on the single-excitation subspace),
* $H_{B} = \frac{\Omega_B}{4}\cos(2\phi)(XX - YY) + \frac{\Omega_B}{4}\sin(2\phi)(XY + YX)$, which is an $XX\pm YY$ type coupling between $|00\rangle$ and $|11\rangle$ (the two-photon drive part).

If $\phi$ (the drive phase) is set such that $\cos(2\phi) = 1, \sin(2\phi)=0$, then $H_B \propto XX - YY$ (which in the $|00\rangle,|11\rangle$ subspace acts like an $X$ rotation). The form $XX - YY$ is essentially $2(|00\rangle\langle 11| + |11\rangle\langle 00|)$ (since $XX - YY = 2(|00\rangle\langle 11| + |11\rangle\langle 00|)$ in the 2-qubit basis, up to phases). This will drive oscillations between $|00\rangle$ and $|11\rangle$ at rate $\Omega_B/2$.

They further compute the evolution operator $U_{\text{eff}}(t) = e^{-iH_{\text{eff}}t}$ and find it factorizes as $U_{ZZ} U_{SZ} U_B$ since $H_{ZZ}, H_{SZ}, H_B$ commute after that tuning. They give explicit matrix forms for $U_{SZ}(t)$ (showing how it entangles $|01\rangle$ and $|10\rangle$) and $U_B(t)$. Notably,


$$
U_B(t) \;=\; \begin{pmatrix}
\cos(\frac{\Omega_B t}{2}) & 0 & 0 & -i e^{-2i\phi}\sin(\frac{\Omega_B t}{2}) \\
0 & 1 & 0 & 0 \\
0 & 0 & 1 & 0 \\
- i e^{2i\phi}\sin(\frac{\Omega_B t}{2}) & 0 & 0 & \cos(\frac{\Omega_B t}{2})
\end{pmatrix}\,,
$$


in the basis ${|00\rangle,|01\rangle,|10\rangle,|11\rangle}$. This clearly shows Rabi oscillations between $|00\rangle$ and $|11\rangle$ at frequency $\Omega_B/2$, with a phase factor $e^{\pm 2i\phi}$. If one sets $\phi = 0$ for simplicity, then at time $t = \pi/\Omega_B$, $U_B$ becomes diag$(\cos(\pi/2),1,1,\cos(\pi/2)) +$ off-diagonals which yield $|00\rangle \to -i |11\rangle$ and $|11\rangle \to -i |00\rangle$ (i.e. a SWAP between $|00\rangle$ and $|11\rangle$ with an $i$ phase). That is essentially a $\sqrt{\text{SWAP}}$ between $|00\rangle$ and $|11\rangle$ when taken to half that time, or a full SWAP at that time up to phase.

In the context of entangling gates, if you start with $|00\rangle$ and apply this for half the oscillation ($t = \pi/(2\Omega_B)$), you'll get an equal superposition $\frac{1}{\sqrt{2}}(|00\rangle - i|11\rangle)$ which is a Bell state. So bSWAP can produce a Bell state directly from $|00\rangle$ in one pulse, which is quite neat.

They provide the explicit formula for $\Omega_B$ (Eq. 4.13) which is quite complicated:


$$
\Omega_B \;=\; \frac{-2J\big(-J \Omega_1 \Omega_2(\delta_1 + \delta_2) + \Omega_2^2 \,2\delta_2(\delta_1 + \Delta_{12}) + \Omega_1^2\, \delta_1(\delta_2 - \Delta_{12})\big)}{(\delta_2 - \Delta_{12})(\delta_1 + \Delta_{12})(\Delta_{12}^2 - 4\delta_2)}\,.
$$


This is indeed quite messy. They note that this matches formulas given by Gambetta *et al.* in an appendix. The dependencies are on the drive amplitudes $\Omega_1, \Omega_2$ and detunings. Typically one would simplify this by choosing symmetric conditions ($\Omega_1 = \Omega_2$ or something like that).

The key takeaways are: the bSWAP gate achieves a direct entangling interaction by driving the system at the frequency that causes simultaneous two-photon transitions. It produces a *ZZ $\pi$ rotation* between $|00\rangle$ and $|11\rangle$ (a SWAP of those populations) while also generating some single-excitation dynamics that can be tuned out.

### 4.3.2 The MAP (Microwave-Activated Phase) Gate

The **MAP gate** (Microwave-Activated Controlled-Phase gate) is another two-transmon gate that uses higher levels. This gate, introduced by Chow *et al.*, induces a controlled-$Z$ (CZ) gate by using a drive to exploit an avoided crossing between certain non-computational levels.

The setup is again two transmons coupled via a resonator (or effectively directly). The specific tuning for MAP is such that the non-computational levels $|12\rangle$ (meaning qubit1 in $|1\rangle$, qubit2 in $|2\rangle$) and $|03\rangle$ (qubit1 in $|0\rangle$, qubit2 in $|3\rangle$) are **degenerate**. That is achieved by setting the qubit frequency difference equal to twice the anharmonicity of one of them: $\omega_1 - \omega_2 = 2\delta_2$ (assuming $\delta_2$ is the anharmonicity of qubit2). Under that condition, a coupling $\sqrt{3}J$ exists between $|12\rangle$ and $|03\rangle$ (because the coupling $J$ can move one excitation from qubit1 to qubit2, and here that means $|12\rangle \leftrightarrow |03\rangle$ involves moving an excitation from qubit1 to qubit2’s third level; the matrix element has a $\sqrt{3}$ factor due to the $\sqrt{j+1}$ factor of ladder operators). This results in an avoided crossing with a splitting $\sim \sqrt{3}J$. They call the splitting $\sqrt{3}J$ as well (since $\sqrt{3}$ is about 1.732, they might just incorporate it).

In absence of any drive, the degeneracy between $|12\rangle$ and $|03\rangle$ being lifted by coupling means that the energy levels of those states repel, creating a difference in energy for states that have two excitations distributed differently. This in turn leads to different transition frequencies for transitions out of $|11\rangle$ vs out of $|02\rangle$ or $|01\rangle$. In other words, the degeneracy removal induces a **ZZ energy shift** between certain transitions (the idea is similar to how a cross-Kerr or cross-resonance yields ZZ).

The MAP gate approach is to drive a specific transition that causes a phase accumulation on $|11\rangle$. They apply a drive (with amplitude $\Omega$, presumably on both qubits or the cavity) near the frequency of certain transitions such that effectively a **ZZ interaction** is realized in the rotating frame.

The Hamiltonian in the lab frame (for the two-transmon system) can be given as before:


$$
H \;=\; \Big(\omega_1 - \frac{\delta_1}{2}\Big)c_1^\dagger c_1 + \frac{\delta_1}{2}(c_1^\dagger c_1)^2 + \Big(\omega_2 - \frac{\delta_2}{2}\Big)c_2^\dagger c_2 + \frac{\delta_2}{2}(c_2^\dagger c_2)^2 + J(c_1^\dagger c_2 + c_1 c_2^\dagger)\,. 
$$


Resonance condition: $\omega_1 - \omega_2 = 2\delta_2$, so $E_{|12\rangle} = \omega_1 + 2\omega_2 + \delta_2$ and $E_{|03\rangle} = 3\omega_2 + 3\delta_2$ become equal, as they show:


$$
E_{|12\rangle} = \omega_1 + 2\omega_2 + \delta_2, \qquad E_{|03\rangle} = 3\omega_2 + 3\delta_2,
$$

Setting $\omega_1 - \omega_2 = 2\delta_2$ gives $E_{|12\rangle} = E_{|03\rangle}$.


They diagonalize the $|12\rangle - |03\rangle$ subspace with a simple 2x2 rotation (like an off-diagonal coupling $\sqrt{3}J$ yields splitting $\pm \xi$). The diagonalization yields energies $3\omega_2 + 3\delta_2 \pm \xi$ with:


$$
\xi = \frac{1}{2}\Big(\sqrt{12 J^2 + (\Delta_{|12|-|03|})^2} + \Delta_{|12|-|03|}\Big) \approx \frac{\sqrt{12}J}{2} = \sqrt{3}J,
$$

when $\Delta_{|12|-|03|} = 0$ (on resonance).


So $\xi = \sqrt{3}J$. This means $|12\rangle$ and $|03\rangle$ aren’t eigenstates; instead, the eigenstates are symmetric/antisymmetric combos each shifted by $\pm \xi/2$. The net effect is that the **transitions $|02\rangle \leftrightarrow |12\rangle$ and $|01\rangle \leftrightarrow |11\rangle$ etc. get slightly different frequencies**. This difference is encoded in a parameter $\zeta$ (the ZZ shift). The thesis denotes $\zeta = \omega_{11} - \omega_{01} - \omega_{10}$, which is the difference between the frequency of $|1,1\rangle \to |0,1\rangle$ plus $|1,1\rangle \to |1,0\rangle$ vs the $|1,1\rangle \to |0,0\rangle$ (which should be zero ideally). If $\zeta \neq 0$, it means $|11\rangle$ has some extra energy relative to $|01\rangle$ and $|10\rangle$ energies combined – which is exactly a ZZ coupling measure. They derive in the appendix (Appendix D) that:


$$
\zeta = \zeta_0 + \frac{\Omega^2}{2\delta}\,\zeta_2,
$$

with
$$
\zeta_0 = \frac{2J^2(\Delta + 2\delta_1)}{\Delta(\Delta + \delta_1)}, \qquad \zeta_2 = \frac{3J^2}{3J^2 - \delta_2},
$$

to leading orders.


Here $\zeta_0$ is the static ZZ (existing with no drive), and $\zeta_2$ is the extra part induced by the drive (of amplitude $\Omega$ and detuning $\delta = \omega_d - (\omega_2 + \delta_2)$, presumably). They assume the drive is near the $|n2\rangle \leftrightarrow |n1\rangle$ transition (maybe $|12\rangle \leftrightarrow |11\rangle$ or similar, they'd mention "drive almost resonant with the $|n2\rangle \leftrightarrow |n1\rangle$ transition"). Actually, they say a drive near the $|n,2\rangle \to |n,1\rangle$ transitions induces a ZZ.

Anyway, $\zeta$ being nonzero means the $|11\rangle$ state picks up a phase relative to others over time. The MAP gate is exactly to accumulate a phase of $\pi$ on $|11\rangle$ relative to $|00\rangle,|01\rangle,|10\rangle$. If $\zeta$ is the rate of that phase difference, one would aim for $\zeta \Delta t = \pi$. In the experiment, they indeed mention doing a Ramsey-type experiment: put qubit2 in a superposition, qubit1 either in $|0\rangle$ or $|1\rangle$, drive the MAP interaction for some time $\Delta t$, and see an extra phase if qubit1 was $|1\rangle$. They show after $\Delta t = 514$ ns, a phase difference of $\pi$ accumulates between those cases (so a controlled-$Z$ gate).

Finally, they present the effective Hamiltonian in the computational subspace after the MAP gate as (Eq. 4.15):


$$
H_{\text{eff}} = \omega_{10}\,|10\rangle\langle 10| + \omega_{01}\,|01\rangle\langle 01| + (\omega_{10} + \omega_{01} + \zeta)\,|11\rangle\langle 11| + \text{const.}
$$

This can be rewritten (ignoring global energy) as
$$
H_{\text{eff}} = -\frac{1}{2}(\omega_{10} - \frac{\zeta}{2})\sigma_{z1} - \frac{1}{2}(\omega_{01} - \frac{\zeta}{2})\sigma_{z2} + \frac{\zeta}{4}\sigma_{z1}\sigma_{z2}\,,
$$

so there is a ZZ term of strength $\zeta/4$.


When $\zeta \Delta t = \pi$, the unitary is $U = e^{-i \pi/4 \sigma_{z1}\sigma_{z2}}$ up to single qubit phases. That is a CZ (since $e^{-i\pi/4 ZZ} = \operatorname{diag}(1,1,1,-1)$, which is a CZ gate up to global phase $-1$ on $|11\rangle$ state). They indeed mention achieving a $ZZ_{\pi}$ gate (which is a CZ) with $\Delta t = 514$ ns, fidelity \~87.6%.

Thus, the MAP gate is essentially creating an *effective ZZ interaction* via a two-photon resonance and driving it. It’s an example of how *driving near a specific higher-level transition can induce a conditional phase without needing tunable flux bias*. However, leakage and decoherence limit fidelity (87.6% was reported).

In conclusion, by expanding our perturbative analysis to multi-level transmons, we can design and understand two-qubit gates that utilize higher levels. The Schrieffer-Wolff method remains extremely useful: it allows us to derive effective Hamiltonians where desired interactions (like $XX+YY$ for bSWAP or $ZZ$ for MAP) appear explicitly, and unwanted interactions can be identified and minimized by parameter choice or echoed out. The trade-off is that including higher levels introduces more opportunities for *leakage* (population leaving the computational subspace) and *dephasing* on those levels. These are active areas of optimization (pulse shaping to minimize leakage, etc.).

Both bSWAP and MAP demonstrate that while the transmon’s weak anharmonicity complicates simple two-level gate models, it also provides **extra knobs** (additional resonances) that can be exploited to create entangling interactions that are otherwise forbidden in a purely harmonic system. The challenge is achieving high fidelity, which requires precision in meeting resonance conditions and mitigating decoherence during the gate.


# Transmons coupled via fixed frequency coupler

Below is a **complete Schrieffer–Wolff (SW) derivation** of the
fourth-order **ZZ-crosstalk term**
for two fixed-frequency transmons (qubits 1, 2) dispersively
coupled to a tunable transmon coupler $c$.
Everything is written in the same Duffing–oscillator notation we adopted
earlier.
For clarity we keep **only the rotating-wave (excitation-preserving)**
parts of the capacitive coupling; counter-rotating pieces
do **not** change the result up to the order of interest.



## 0 Basic definitions

| symbol                                      | meaning                                                                 |
| - | -- |
| $a_j^{(\dagger)}$                           | annihilation (creation) operator of transmon $j\in\{1,2,c\}$            |
| $\omega_j$                                  | $0\!\to\!1$ transition frequency of $j$                                 |
| $\alpha_j>0$                                | anharmonicity of $j$ (so $1\!\to\!2$ transition is $\omega_j-\alpha_j$) |
| $g_{jc}$                                    | capacitive exchange coupling between qubit $j$ and coupler              |
| (\displaystyle                              |                                                                         |
| \Delta\_{jc}\equiv\omega\_j-\omega\_c,\quad |                                                                         |
| \Delta\_{jc}^{(2)}=\Delta\_{jc}+\alpha\_j   |                                                                         |
| )                                           |                                                                         |
| $n_j=a_j^\dagger a_j$                       | number operator of transmon $j$                                         |

We retain the first three levels
$|0\rangle,|1\rangle,|2\rangle$ of every transmon.



## 1 Hamiltonian split $H=H_0+V$

### 1.1 Unperturbed part (diagonal)

$$
H_0
=\sum_{j=1,2,c}
\Bigl[\,
  \omega_j\,n_j
  -\frac{\alpha_j}{2}\,n_j(n_j-1)
\Bigr].
\tag{1}
$$

### 1.2 Small off-diagonal part (exchange only)

$$
\boxed{\;
V
=\sum_{j=1,2} g_{jc}\bigl(a_j^\dagger a_c + a_j a_c^\dagger\bigr)
\;}
\tag{2}
$$

No direct 1–2 coupling is assumed;
all interaction is mediated by the coupler.

We work in the **dispersive regime**
$|\Delta_{jc}|,|\Delta_{jc}^{(2)}|\gg g_{jc}$
so that the coupler is only *virtually* excited.



## 2 Projectors and SW block structure

Define projectors

$$
P=\sum_{n_1,n_2}|n_1,n_2,0\rangle\langle n_1,n_2,0|,\qquad
Q=1-P,
$$

i.e. $P$ keeps the **coupler ground manifold** and
$Q$ keeps all states with $n_c\ge1$.
The interaction $V$ is **block off-diagonal**:
$PVP=0,\;QVQ=0$.

A unitary $U=e^{S}$ with **anti-Hermitian** generator $S=-S^\dagger$
is chosen such that

$$
\tilde{H}=e^{S}He^{-S}
\quad\text{obeys}\quad
P\tilde{H}Q=0=Q\tilde{H}P.
\tag{3}
$$

We expand $S=S^{(1)}+S^{(3)}+\dots$
(only odd orders are needed because $V$ has no block-diagonal part).



## 3 First-order generator $S^{(1)}$

The SW condition to first order is

$$
[H_0,S^{(1)}]=-V.
\tag{4}
$$

Because energy denominators depend only on **which qubit excites the
coupler**, we can write

$$
\boxed{\;
S^{(1)}
=
\sum_{j=1,2}
\frac{g_{jc}}{\Delta_{jc}}
\Bigl(
  a_j^\dagger a_c
  -a_j a_c^\dagger
\Bigr)
\;}
\tag{5}
$$

*(Check)*
For any product eigenstate
$|\dots,n_c=0\rangle$ and
$|\dots,n_c=1\rangle$
the matrix element of the commutator
$[H_0,S^{(1)}]$ equals
$+g_{jc}\bigl(a_j^\dagger a_c + a_j a_c^\dagger\bigr)$
with the requisite denominator $\Delta_{jc}$,
hence (4) is satisfied.



## 4 Second-order effective Hamiltonian

$$
H^{(2)}=\frac12\,[V,S^{(1)}].
\tag{6}
$$

Compute the commutator term-by-term.

### 4.1 Useful elementary commutators

Because operators on **different** transmons commute,
only the $(a_j,a_j^\dagger)$ with the same index give non-zero
results.

For each $j$

$$
\begin{aligned}
[a_j^\dagger a_c,\;a_j a_c^\dagger]
&=
a_j^\dagger a_c a_j a_c^\dagger
-
a_j a_c^\dagger a_j^\dagger a_c
\\
&=
n_j
-
n_c ,
\end{aligned}
\tag{7}
$$

using $a_c a_c^\dagger=a_c^\dagger a_c+1$
and similarly for $a_j$.

For **distinct** $j\neq k$

$$
[a_j^\dagger a_c,\;a_k a_c^\dagger]
=
a_j^\dagger a_k .
\tag{8}
$$

(The factor $a_c a_c^\dagger$ collapses to 1 after projection
$ P\,(\ldots)\,P$, because $n_c=0$ in the low-energy manifold.)

### 4.2 Plugging into (6)

Separate diagonal ($j=k$) and off-diagonal ($j\neq k$) parts:

$$
\begin{aligned}
H^{(2)}
&=\frac12
\sum_{j}
\frac{g_{jc}^{2}}{\Delta_{jc}}
\bigl[n_j-n_c\bigr]
\;+\;
\frac12
\sum_{j\neq k}
\frac{g_{jc}g_{kc}}{\Delta_{kc}}
\,a_j^\dagger a_k
\\[4pt]
&\xrightarrow{P(\,\,)\,P}
\sum_{j}
\frac{g_{jc}^{2}}{2\Delta_{jc}}\;n_j
\;+\;
\Bigl(
  \underbrace{
   \frac{g_{1c}g_{2c}}{2}
   \Bigl[\frac{1}{\Delta_{1c}}+\frac{1}{\Delta_{2c}}\Bigr]
  }_{\displaystyle J_{\text{eff}}}
 \Bigr)
 \bigl(
   a_1^\dagger a_2 + a_1 a_2^\dagger
 \bigr).
\end{aligned}
\tag{9}
$$

* First term ⇒ single-qubit Lamb shifts (can be absorbed into
  renormalised qubit frequencies $\tilde\omega_{1,2}$).
* Second term ⇒ **exchange coupling**
  $H_{XY}=J_{\text{eff}}(\sigma_1^+\sigma_2^-+\sigma_1^-\sigma_2^+)$.

No ZZ appears at this order.



## 5 Third-order generator $S^{(3)}$

Because $V$ has no block-diagonal part,
$H_1\equiv V_{\text{diag}}=0,$
so the SW recursion gives

$$
[H_0,S^{(3)}]
=
-\frac13\,\underbrace{[\,V,S^{(1)}\,]}_{W}
\quad\Longrightarrow\quad
S^{(3)}=-\frac13\,\mathcal L^{-1}\bigl(W_{\text{off}}\bigr),
\tag{10}
$$

where $\mathcal L^{-1}$ divides each matrix element by its
energy difference $E_{NM}$.
*Practical rule*:
replace every string of operators that
**changes** the coupler excitation with an energy denominator.

Using (7)–(8) one shows that **$W$ contains two kinds of terms**:

1. Block-diagonal pieces $\propto n_j$ (already used in $H^{(2)}$);
2. Block-off-diagonal pieces of the form
   $\propto a_j^\dagger a_c$ or $a_j a_c^\dagger$
   multiplied by **number operators** $n_{k}$.
   Only these *off-diagonal* pieces enter $S^{(3)}$.

Carrying out the algebra gives (one line per qubit label):

$$
\boxed{\;
S^{(3)}
=
\sum_{j\neq k}
\frac{g_{jc}^{2}g_{kc}}{3\,\Delta_{jc}^{2}}
\Bigl(
  \frac{1}{\Delta_{kc}}
  -
  \frac{1}{\Delta_{kc}^{(2)}}
\Bigr)
\Bigl(
  n_j\,a_k^\dagger a_c - n_j\,a_k a_c^\dagger
\Bigr)
\;}
\tag{11}
$$

(The denominators $\Delta^{(2)}=\Delta+\alpha$ appear because
when $n_k=1$ is raised to $n_k=2$ the qubit energy differs by $\alpha_k$.)



## 6 Fourth-order effective Hamiltonian

$$
H^{(4)}
=
\frac12\,[\,V,S^{(3)}]
\;+\;
\frac{1}{8}\,[\,[\,V,S^{(1)}],\,S^{(1)}].
\tag{12}
$$

### 6.1 The double-commutator piece

$$
K\equiv[\,[V,S^{(1)}],S^{(1)}] .
$$

Because both commutators
**lower and then raise** the coupler,
$K$ is **block diagonal** already.
Using (7)–(8) repeatedly one obtains

$$
\boxed{\;
P K P
=\;2
\sum_{j\neq k}
g_{jc}^{2}g_{kc}^{2}
\Bigl(
 \frac{1}{\Delta_{jc}^{2}}
 \Bigl[
   \frac{1}{\Delta_{kc}}
   -
   \frac{1}{\Delta_{kc}^{(2)}}
 \Bigr]
\Bigr)
\Bigl(
 n_j n_k - n_k
\Bigr)
\;}
\tag{13}
$$

After dropping single-qubit terms
($n_k$ only) we keep
$\propto n_1 n_2$, the source of ZZ.

### 6.2 The $[V,S^{(3)}]$ piece

Using the explicit $S^{(3)}$ in (11) and the same commutator rules,
one finds

$$
P\,[V,S^{(3)}]\,P
=
-2
\sum_{j\neq k}
g_{jc}^{2}g_{kc}^{2}
\Bigl(
 \frac{1}{\Delta_{jc}^{2}}
 \Bigl[
   \frac{1}{\Delta_{kc}}
   -
   \frac{1}{\Delta_{kc}^{(2)}}
 \Bigr]
\Bigr)
\Bigl(
 n_j n_k - n_k
\Bigr).
\tag{14}
$$

### 6.3 Add the two pieces

$$
\begin{aligned}
H^{(4)}
&=
\frac12 P[V,S^{(3)}]P
\;+\;
\frac18 P K P
\\[6pt]
&=
\sum_{j\neq k}
g_{jc}^{2}g_{kc}^{2}
\Bigl(
 \frac{1}{\Delta_{jc}^{2}}
 \Bigl[
   \frac{1}{\Delta_{kc}}
   -
   \frac{1}{\Delta_{kc}^{(2)}}
 \Bigr]
\Bigr)
\Bigl(
 n_j n_k - n_k
\Bigr)
\;\;(\text{overall factor }-1+1/4=-3/4\text{ cancels!})
\\[6pt]
&\xrightarrow{\text{keep }n_1 n_2\text{ only}}
\;
\boxed{\;
 \frac{\zeta}{2}\; \sigma_1^{z}\sigma_2^{z}
 \quad\text{with}\quad
 \zeta\equiv
 g_{1c}^{2}g_{2c}^{2}
 \bigl(\Lambda_{1}+\Lambda_{2}-\Lambda_{1}^{\prime}-\Lambda_{2}^{\prime}\bigr)
 \;}
\end{aligned}
\tag{15}
$$

where the **coefficients** are exactly

$$
\Lambda_{1}=\frac{1}{\Delta_{1c}^{2}}
           \Bigl[\frac{1}{\Delta_{2c}}-\frac{1}{\Delta_{2c}^{(2)}}\Bigr],
\quad
\Lambda_{2}=\Lambda_{1}(1\!\leftrightarrow\!2),
\quad
\Lambda_{1}^{\prime}=\Lambda_{1}\bigl(\Delta_{2c}\!\to\!\Delta_{1c}\bigr),
\quad
\Lambda_{2}^{\prime}=\Lambda_{2}\bigl(\Delta_{1c}\!\to\!\Delta_{2c}\bigr).
\tag{16}
$$



## 7 Consistency checks

1. **Two-level truncation** ($\alpha_{1,2}\!\to\!\infty$):
   $\Delta^{(2)}\!\to\!\infty$;
   every bracket $[\dots]-[\dots]$ cancels ⇒ $\zeta=0$.

2. **Exchange/ZZ scaling**:
   $\displaystyle J_{\text{eff}}\sim g^{2}/\Delta,
   \quad\zeta\sim g^{4}/\Delta^{3}$
   so $|\zeta/J_{\text{eff}}|\sim (g/\Delta)^{2}\ll1$.

3. **ZZ-null bias** exists because the two square brackets in (16) have
   opposite sign for
   $\Delta_{1c}$ above and $\Delta_{2c}$ below zero.
   Tuning $\omega_c$ appropriately can make
   $\Lambda_{1}+\Lambda_{2}=\Lambda_{1}^{\prime}+\Lambda_{2}^{\prime}\Rightarrow\zeta=0$.

These match Zhu *et al.* (arXiv 1210.1605) but here every step—
choice of $S^{(1)}$, full commutators, block projection—is
spelled out explicitly, leaving no gap in the SW construction.



### **Final effective Hamiltonian to fourth order**

$$
\boxed{\;
H_{\text{eff}}
=
\tilde{\omega}_1\,n_1 + \tilde{\omega}_2\,n_2
\;+\;
J_{\text{eff}}\bigl(\sigma_1^{+}\sigma_2^{-}+\sigma_1^{-}\sigma_2^{+}\bigr)
\;+\;
\frac{\zeta}{2}\,\sigma_1^{z}\sigma_2^{z}
\;}
$$

with
$\displaystyle
 J_{\text{eff}}  =\tfrac{g_{1c}g_{2c}}{2}\bigl(\tfrac{1}{\Delta_{1c}}+\tfrac{1}{\Delta_{2c}}\bigr),$
and $\zeta$ given in (15)–(16).
All quantities follow directly from the SW algebra
up to fourth order—no shortcuts through energy-shift formulas.


# Tunable coupler

$$
\omega_c(\Phi)=\sqrt{\,8E_{C,c}E_{J,\Sigma}\!\cos\!\tfrac{\pi\Phi}{\Phi_0}\,}-E_{C,c}
\tag{0.1}
$$

(and therefore every detuning $\Delta_{jc}\equiv\omega_j-\omega_c(\Phi)$) depends continuously on the external flux $\Phi$.
We will show algebraically that the ZZ coefficient
$\zeta(\Phi)$ coming out of the SW series can be tuned *through zero* by choosing an appropriate $\Phi^\star$.


## 0 Basic set-up and notation (unchanged)

We keep the Duffing Hamiltonian, identical couplings $g_{jc}$ as before, and again retain three levels of every transmon.
All symbols in the table of §0 of the static derivation remain valid, except that **detunings are now flux-dependent**

$$
\boxed{\;
\Delta_{jc}(\Phi)=\omega_j-\omega_c(\Phi),
\quad
\Delta_{jc}^{(2)}(\Phi)=\Delta_{jc}(\Phi)+\alpha_j
\;}
\tag{0.2}
$$



## 1 Hamiltonian split $H=H_0(\Phi)+V$

Exactly as in Eqs. (1)–(2) of the static derivation, but with $\omega_c\to\omega_c(\Phi)$.  Nothing changes algebraically for $V$.



## 2 Projectors, first-order generator $S^{(1)}(\Phi)$

Because the *structure* of $H_0$ and $V$ is untouched,
all operator manipulations of §§2–3 carry over verbatim,
only replace every $\Delta_{jc}\to\Delta_{jc}(\Phi)$:

$$
\boxed{\;
S^{(1)}(\Phi)=
\sum_{j=1,2}
\frac{g_{jc}}{\Delta_{jc}(\Phi)}
\bigl(a_j^\dagger a_c-a_j a_c^\dagger\bigr)
\;}
\tag{1.1}
$$

(the first-order SW condition $[H_0,S^{(1)}]=-V$ is still satisfied).



## 3 Second-order effective Hamiltonian

Identical commutators ⇒

$$
H^{(2)}(\Phi)=
\sum_{j}\frac{g_{jc}^{2}}{2\Delta_{jc}(\Phi)}\,n_j
\;+\;
J_{\text{eff}}(\Phi)\,
\bigl(a_1^\dagger a_2+a_1 a_2^\dagger\bigr),
\tag{2.1}
$$

with

$$
\boxed{\;
J_{\text{eff}}(\Phi)=
\frac{g_{1c}g_{2c}}{2}
\Bigl[\frac{1}{\Delta_{1c}(\Phi)}+\frac{1}{\Delta_{2c}(\Phi)}\Bigr]
\;}
\tag{2.2}
$$

(just as before, but now tunable with $\Phi$).
At a bias $\Phi=\Phi_{\!\text{dec}}$ satisfying
$\displaystyle\frac{1}{\Delta_{1c}}+\frac{1}{\Delta_{2c}}=0$
the **exchange coupling is exactly nulled.**



## 4 Third-order generator $S^{(3)}(\Phi)$

All algebra from §5 of the static derivation stands, giving

$$
\boxed{\;
S^{(3)}(\Phi)=
\sum_{j\neq k}
\frac{g_{jc}^{2}g_{kc}}{3\,\Delta_{jc}^{2}(\Phi)}
\Bigl[
  \frac{1}{\Delta_{kc}(\Phi)}-
  \frac{1}{\Delta_{kc}^{(2)}(\Phi)}
\Bigr]
\bigl(
  n_j\,a_k^\dagger a_c-n_j\,a_k a_c^\dagger
\bigr)
\;}
\tag{3.1}
$$



## 5 Fourth-order block-diagonal piece  $H^{(4)}(\Phi)$

Using exactly the same commutator identities,

$$
H^{(4)}(\Phi)
=
\frac12[V,S^{(3)}(\Phi)]
\;+\;
\frac18\bigl[\,[V,S^{(1)}(\Phi)],\,S^{(1)}(\Phi)\bigr].
$$

Project onto $n_c=0$ and keep only the two-qubit term $n_1n_2$:

$$
\boxed{\;
\frac{\zeta(\Phi)}{2}\,\sigma_1^{z}\sigma_2^{z},
\qquad
\zeta(\Phi)=
g_{1c}^{2}g_{2c}^{2}\,
\Bigl(\Lambda_{1}+\Lambda_{2}-\Lambda_{1}^{\prime}-\Lambda_{2}^{\prime}\Bigr)_{\!(\Phi)}
\;}
\tag{4.1}
$$

where, exactly as before but with flux-dependence,

$$
\begin{aligned}
\Lambda_{1}(\Phi)&=\frac{1}{\Delta_{1c}^{2}(\Phi)}
\Bigl[\frac{1}{\Delta_{2c}(\Phi)}
      -\frac{1}{\Delta_{2c}^{(2)}(\Phi)}\Bigr],\\
\Lambda_{2}(\Phi)&=\Lambda_{1}(\Phi)\bigl(1\!\leftrightarrow\!2\bigr),\\
\Lambda_{1}^{\prime}(\Phi)&=\Lambda_{1}(\Phi)
                               \bigl(\Delta_{2c}\!\to\!\Delta_{1c}\bigr),\\
\Lambda_{2}^{\prime}(\Phi)&=\Lambda_{2}(\Phi)
                               \bigl(\Delta_{1c}\!\to\!\Delta_{2c}\bigr).
\end{aligned}
\tag{4.2}
$$



## 6 Tuning the ZZ coefficient to zero

Set $\zeta(\Phi^\star)=0$.
Because each $\Delta_{jc}(\Phi)=\omega_j-\omega_c(\Phi)$ is *monotone* in $\Phi$
(and $\alpha_j$ is constant),
Eq. (4.1) is a *single scalar equation* for the bias $\Phi^\star$.

### 6.1 Interpretation

* Each square bracket in (4.2) is **positive** if
  the coupler lies *below* the referenced qubit
  $(\Delta_{jc}>0)$ and **negative** if the coupler lies above
  $(\Delta_{jc}<0)$.
* Hence the two big groups
  $\Lambda_{1}+\Lambda_{2}$ and
  $\Lambda_{1}^{\prime}+\Lambda_{2}^{\prime}$
  have *opposite* flux dependence:
  one decreases, the other increases as $\omega_c(\Phi)$ moves
  through the qubit band.
* By continuity there is **always** a crossing point $\Phi^\star$
  between the two fixed-frequency qubits where the net ZZ cancels.

### 6.2 Closed-form condition (analytic)

Writing

$$
f_j(\Phi)\;=\;\frac{1}{\Delta_{jc}^{2}(\Phi)}
\quad\text{and}\quad
g_j(\Phi)\;=\;
\frac{1}{\Delta_{jc}(\Phi)}-
\frac{1}{\Delta_{jc}^{(2)}(\Phi)},
$$

Eq. $\zeta=0$ is

$$
f_1\,g_2+f_2\,g_1=f_1\,g_1+f_2\,g_2.
\tag{6.1}
$$

Because $f_j>0$,
Eq. (6.1) rearranges to

$$
\boxed{\;
\frac{g_2(\Phi^\star)-g_1(\Phi^\star)}
     {g_2(\Phi^\star)+g_1(\Phi^\star)}
=
\frac{f_2(\Phi^\star)-f_1(\Phi^\star)}
     {f_2(\Phi^\star)+f_1(\Phi^\star)}
\;}
\tag{6.2}
$$

which can be solved numerically once the
device parameters $\{\omega_{1,2},\alpha_{1,2},E_{J,\Sigma},E_{C,c}\}$ are known.

### 6.3 Special symmetric case

If qubits are symmetric
($\omega_1=\omega_2,\;\alpha_1=\alpha_2,\;g_{1c}=g_{2c}$)
then $\Delta_{1c}(\Phi)=\Delta_{2c}(\Phi)$ at a *single* flux bias.
There, $f_1=f_2$ **and** $g_1=g_2$ ⇒ each bracket in (4.1) cancels
individually ⇒ $\zeta(\Phi^\star)=0$.
So the zero-ZZ point coincides with the
zero-exchange point $(J_{\text{eff}}=0)$.



## 7 Resulting effective Hamiltonian (flux-tuned)

At an arbitrary bias $\Phi$ the two-qubit Hamiltonian is

$$
H_{\text{eff}}(\Phi)=
\tilde\omega_1(\Phi)\,n_1+\tilde\omega_2(\Phi)\,n_2
\;+\;
J_{\text{eff}}(\Phi)\,
(\sigma_1^+\sigma_2^-+\sigma_1^-\sigma_2^+)
\;+\;
\frac{\zeta(\Phi)}{2}\,\sigma_1^{z}\sigma_2^{z},
\tag{7.1}
$$

with $\tilde\omega_j(\Phi)=\omega_j+g_{jc}^{2}/(2\Delta_{jc}(\Phi))$.

* At $\Phi=\Phi_{\!\text{dec}}$: $J_{\text{eff}}=0$
  (idling qubits decoupled).
* At $\Phi=\Phi^\star$: $\zeta=0$
  (no residual ZZ crosstalk).
* In practice one biases *between* these points to trade off minimal
  ZZ against a desired static coupling, or uses *flux modulation*
  around $\Phi_{\!\text{dec}}$ to create *parametric* gates while the
  *average* ZZ remains near zero.


### **Take-away**

The SW algebra is identical to the fixed-frequency derivation;
only the detunings acquire flux dependence.
Because the fourth-order ZZ coefficient is an *odd* function of the
relative signs of $\Delta_{1c}$ and $\Delta_{2c}$, sweeping
$\omega_c(\Phi)$ across the qubit frequencies must pass through a bias
$\Phi^\star$ where the two groups of virtual paths cancel exactly,
giving **$\boxed{\zeta(\Phi^\star)=0}$**.
This is precisely the operating principle behind the
“ZZ-free tunable coupler” used in many contemporary superconducting
processors.
