# Erasure Minesweeper: exploring hybrid-erasure surface code architectures for efficient quantum error correction

Jason D. Chadwick\*, Mariesa H. Teo\*, Joshua Viszlai\*, Willers Yang\*, and Frederic T. Chong

\*Department of Computer Science, University of Chicago

Chicago, IL, USA

\*Authors contributed equally; listed alphabetically

Abstract—Dual-rail erasure qubits can substantially improve the efficiency of quantum error correction, allowing lower error rates to be achieved with fewer qubits, but each erasure qubit requires  $3 \times$  more transmons to implement compared to standard qubits. In this work, we introduce a hybrid-erasure architecture for surface code error correction where a carefully chosen subset of qubits is designated as erasure qubits while the rest remain standard. Through code-capacity analysis and circuitlevel simulations, we show that a hybrid-erasure architecture can boost the performance of the surface code-much like how a game of Minesweeper becomes easier once a few squares are revealed-while using fewer resources than a full-erasure architecture. We study strategies for the allocation and placement of erasure qubits through analysis and simulations. We then use the hybrid-erasure architecture to explore the trade-offs between per-qubit cost and key logical performance metrics such as threshold and effective distance in surface code error correction. Our results show that the strategic introduction of dual-rail erasure qubits in a transmon architecture can enhance the logical performance of surface codes for a fixed transmon budget, particularly for near-term-relevant transmon counts and logical error rates.

Index Terms—quantum error correction, erasure qubits, surface code

#### I. INTRODUCTION

Quantum error correction (QEC) is expected to be necessary to achieve the low error rates of  $10^{-12}$  to  $10^{-18}$  needed to enable powerful applications of quantum computing [1]–[6].

This work is funded in part by the STAQ project under award NSF Phy-232580; in part by the US Department of Energy Office of Advanced Scientific Computing Research, Accelerated Research for Quantum Computing Program; and in part by the NSF Quantum Leap Challenge Institute for Hybrid Quantum Architectures and Networks (NSF Award 2016136), in part based upon work supported by the U.S. Department of Energy, Office of Science, National Quantum Information Science Research Centers, and in part by the Army Research Office under Grant Number W911NF-23-1-0077. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation herein. This work was completed in part with resources provided by the Research Computing Center of the University of Chicago.

FTC is the Chief Scientist for Quantum Software at Infleqtion and an advisor to Quantum Circuits, Inc.



Fig. 1. Viewing a hybrid-erasure architecture as a game of Minesweeper. *Top:* In a standard surface code, we receive syndrome bits (red and blue 1s) indicating that an error string terminates nearby. The decoder must find a valid explanation (red and blue flags) based only on this information. If the decoder guesses wrong, a logical error can occur. *Bottom:* When some data qubits are *erasure qubits*, we gain extra information before decoding (black bombs and green checks on left) that help us decide which error string to predict, avoiding logical errors.

QEC offers the promise of exponential error suppression with increasing code size once physical qubit errors are below a certain *threshold*. The surface code is a leading approach to quantum error correction due to its relatively high threshold, compatibility with planar nearest-neighbor qubit connectivity, and ease of decoding [7].

While typical noise models used to simulate and evaluate QEC assume that physical one- and two-qubit errors are random, uniformly distributed Pauli operators, significant gains in QEC efficiency can be found under alternative noise models [8]–[13]. For example, under a biased noise model where

<sup>\*</sup>jchadwick@uchicago.edu

<sup>\*</sup>mhteo@uchicago.edu

<sup>\*</sup>viszlai@uchicago.edu

<sup>\*</sup>willers@uchicago.edu

the relative strengths of Pauli X and Z errors are orders of magnitude different, the XZZX surface code can drastically improve the noise threshold compared to the standard surface code [8]. Another type of non-standard error model is heralded erasure, in which the information in a physical qubit is completely lost, but the time and location of the error are known. A surface code made up of erasure qubits, in which the primary error mechanism is heralded erasure, can have a higher threshold and, crucially, twice the effective distance: a surface code patch with  $d \times d$  data qubits can correct d-1 heralded erasure errors on ideal erasure qubits, but only  $\lfloor (d-1)/2 \rfloor$  errors under a standard Pauli error model. This gives erasure qubits a significant scaling advantage over standard unbiased gubits, leading to significant interest in the proposal [14]–[17] and realization [18]–[26] of erasure qubits in various hardware modalities.

Dual-rail erasure qubits in superconducting transmons [15], [16] are a particularly promising approach to implementing erasure qubits, with several recent experimental realizations [20], [22], [26]. A dual-rail qubit is encoded in two transmons such that a relaxation error (the dominant error in individual transmons) in either transmon takes the system out of the qubit space; an *erasure check* can be performed regularly to determine whether this has happened, and if so, the qubit is reset and the information is sent to the decoder. Some dual-rail implementations require a third transmon to perform this erasure check, bringing the relative cost of a single dual-rail erasure qubit to  $3\times$  that of a standard qubit.

There is therefore a trade-off between increased complexity of each physical qubit and improved QEC efficiency. Several factors may impose limits on the number of transmons that can be used for each logical qubit in the surface code. First, fabrication defects are a considerable hurdle for large-scale transmon-based quantum computers, as the yield (proportion of fully functional chips without defects) decreases with increasing transmon count per chip. The defect rate in current industrial devices is proprietary, but is commonly estimated to be between  $10^{-3}$  and  $2 \times 10^{-2}$  [27]–[29]. Although modular approaches are expected to mitigate this problem [30], it may be desirable for each logical qubit to be fully contained on a single chip if chip-to-chip connections are restricted [31]. Second, once yield is addressed, there will still be limits to the maximum number of transmons that can be placed into a single dilution refrigerator due to heating concerns [32]–[35]; larger transmon counts will require distributed multi-fridge approaches that will have reduced inter-fridge communication capabilities [36].

In this work, we investigate a hybrid-erasure architecture for implementing surface codes in transmons, where physical qubits of the surface code are composed of a mixture of standard and dual-rail erasure qubits. As key benefits of erasure come from the additional error information, optimizing such a hybrid-erasure architecture rests on the allocation of erasure qubits—deciding how many erasure qubits to use and where to place them—so the additional information can best be exploited to reach a target logical error performance.



Fig. 2. A d=7 rotated surface code patch, consisting of  $d^2=49$  data qubits (black) and  $d^2-1=48$  ancilla qubits (white) in a degree-four planar grid. X and Z stabilizers are represented by light and dark plaquettes, indicating weight-two or weight-four parity checks. The  $\bar{X}$  and  $\bar{Z}$  logical operators are highlighted in light and dark blue lines at the boundary. In the "Erasure Minesweeper" game of Figure 1, data qubits are potential bomb tiles and ancilla qubits are tiles with numbers.

Erasure qubits supply the decoder with additional information on where the errors have occurred (and more importantly, have not occurred), allowing the decoder to more efficiently distinguish topologically distinct error chains. The objective is similar to that of Minesweeper, where tiles are revealed strategically to deduce the locations of bombs (physical errors). To this end, we first study the surface code in a code-capacity model, allowing us to determine an optimized placement strategy. Then, through more accurate circuit-level simulations, we evaluate the QEC performance of a hybrid-erasure surface code using this placement heuristic and study the trade-offs between key metrics such as effective distance, threshold, and logical error rates. We find that our placement heuristic results in a hybrid-erasure chip that achieves better logical error rate than a homogeneous all-standard or all-erasure qubit chips for certain relevant transmon-count budgets.

The remainder of the paper is structured as follows. In Section II, we introduce erasure qubits and surface code error correction. Thus far, studies of erasure for error correction have assumed a homogeneous, all-erasure layout. We extend the prior work by introducing a hybrid-erasure architecture for surface code error correction in Section III, and analyze, in a code-capacity model, the impact of design choices such as allocation and placement on the logical performance. In Section IV, we describe our circuit-level simulation framework. In Section V, we discuss the results of these circuit-level evaluations and analyze transmon costs, chip yield, and logical performance for varying system sizes using a hybrid-erasure chip versus homogeneous all-standard or all-erasure chips. We also explore alternative placement strategies in Section VI under circuit-level noise and compare simulation results to the predicted behavior from the code-capacity analysis. Finally, in Section VII, we discuss conclusions and future work.



Fig. 3. Decoding an example set of errors (a) in the surface code. (b) Syndrome information available in a standard, erasure-free surface code. (c) Decoder prediction resulting in a logical Z error. (d) Syndrome + erasure information available in a hybrid erasure architecture with two rows/columns of erasure qubits. (e) Decoder prediction resulting in no logical errors. The lack of erasure detection on the erasure qubits near the boundary critically help the decoder differentiate the correct error chain.

#### II. BACKGROUND

#### A. The surface code

A distance-d surface code (Figure 2) encodes one logical qubit in the entangled state of  $d^2$  physical qubits on a square grid. The logical states correspond to the eigenstates of a commuting group of  $d^2-1$  X-type and Z-type parity check operators. These check operators have weight 2 or 4, meaning they're supported on 2 or 4 physical qubits, and can be measured using only 2D local interactions with measurement ancillae interleaved with the data qubits. The logical observables  $\bar{X}$  and  $\bar{Z}$  of the logical qubit are encoded non-locally as the parity of physical Xs and Zs on a path crossing the lattice, the minimum lengths of which grow as we increase the size of the lattice.

A physical error can be detected by periodically measuring the parity check operators and corrected if a decoder can determine a valid set of physical errors that match the measurement outcomes. The outcome of the parity check operators is often referred to as the error syndrome. In the absence of physical errors, the syndrome is invariant; and when a chain of physical errors occurs, the checks on the ends of the error chain will be flipped, yielding a non-trivial syndrome. The decoder attempts to correct the error by finding the most likely explanation given the error syndrome. Due to the non-local encoding of the logical observables as a lattice-traversing path, an error chain can cause a logical error only if it traverses the lattice, which can occur if the decoder guesses incorrectly and applies an invalid correction. We denote the probability of a single

physical error as p and the probability of a logical error as  $p_L$ .

The relationship between p and  $p_L$  of an error-correcting code depends critically on two metrics: the threshold  $p_{th}$ , which characterizes the minimum physical error rate p required to achieve suppression of  $p_L$  with increasing code size, and the effective distance  $d_{\rm eff}$ , which characterizes how sensitive  $p_L$  is to p.

#### B. Erasure qubits

An erasure qubit is a physical qubit whose primary error mechanism is a heralded erasure, which is a depolarizing error that occurs at a known time. A surface code implemented using ideal erasure qubits is known to experience large improvements in threshold and effective distance [13], [37], [38]. In practice, the degree of this scaling advantage depends on the physical implementation of the erasure qubit [12], [13]. Here, we briefly review several prominent proposals for erasure qubit implementations, designed to convert the dominant errors in the hardware into erasure errors.

Dual rail transmon qubits, which are the focus of this work, comprise two superconducting transmons coupled together [15]. The logical computational basis states are given by

$$|\overline{0}\rangle = \frac{|01\rangle + |10\rangle}{\sqrt{2}} , \qquad |\overline{1}\rangle = \frac{|01\rangle - |10\rangle}{\sqrt{2}}$$
 (1)

In each transmon, amplitude damping errors dominate, which bring the dual rail erasure qubit to the  $|00\rangle$  state. If an erasure qubit is detected to be in this state, it can be reset, returning it to the computational basis [39]–[43]. In total, this



Fig. 4. Examples of hybrid-erasure architecture. (a)  $\mathcal{A}(5,0.6,P_r)$  where the placement  $P_r=\{1,2,5,...,22\}$  is sampled at random. (b)  $\mathcal{A}(7,0.57,P^*)$ . Since  $f_e\times d^2=27.93$ , only 27 erasures are allocated, with 24 erasures placed in 2 rows/columns, and the remaining 3 placed greedily, all as close to the center as possible.

implementation requires two transmons to make up the dualrail, and an additional one for the erasure check, bringing its total physical qubit cost to 3 per erasure qubit. We note that some proposals have suggested using a superconducting cavity for readout [15], but for simplicity here our resource estimates are quoted in terms of the three-transmon implementation.

Beyond transmons, erasure qubit implementations have been proposed and demonstrated in dual-rail superconducting cavities, where dominant photon loss errors are detected and converted into erasures [16], [20], [21], [25]. There has also been significant effort in both theory and experiment in developing erasure qubits in neutral atoms, and it is estimated that up to 98% of leakage outside the qubit subspace can be converted into erasure [14], [18]. A similar scheme has been suggested for erasure conversion in trapped ion systems [17]. We focus on dual-rail qubits in this work because they are fabricated from the same components as standard superconducting qubits, so we can make a direct cost comparison between them. However, much of our findings can also be generalized to other hybrid erasure schemes.

# III. ANALYSIS OF A HYBRID-ERASURE SURFACE CODE

We consider surface code implemented on a hybrid-erasure architecture  $A(d, f_e, P)$ , where

- $d \in \{3, 5, ...\}$  is the surface code distance,
- $f_e \in [0,1]$  upper-bounds the fraction of erasure qubits,
- $P \subset [d^2]$  specifies the subset of data qubits (indexed by  $[d^2] = \{1,...,d^2\}$ ) designated as erasures.

Note that the two edge cases,  $\mathcal{A}(d,0,0)$  and  $\mathcal{A}(d,1,[d^2])$ , correspond to a distance-d surface code on all standard qubits and all erasure qubits, respectively.

We perform analysis in the code-capacity model, in which errors can occur on data qubits but syndrome measurements are assumed to be perfect (and there is no propagation of error between qubits). Section VI-C confirms that our analysis closely matches circuit-level simulation results.

### A. Summary of results

We analyze the performance of surface codes on a hybriderasure architecture  $\mathcal{A}(d, f_e, P^*)$  in a code-capacity model,



Fig. 5. Error correction on the repetition code with heralded erasure noise. (a) We consider a distance-7 repetition code with k=3 erasure qubits, at indices  $q_2,q_3,q_6$ . (b) Suppose we encounter 3 physical X errors, triggering a detection event on  $q_3$ . (c) A physical error on  $q_2$  may trigger the detection event, but does not flip any syndromes; this could be due to the deplorizing error being projected onto the code space as a Z error or an identity, each with probability 25%. (d) There are two ways to match the syndromes. Since there's no detection even on  $q_6$ , we know the error must be the error chain not containing  $q_6$ .

and prove that with a strategic placement  $P^*$ , the effective distance achievable is at least

$$d_{\text{eff}} \ge \left\lfloor \frac{d \times (2 - \sqrt{1 - f_e}) + 1}{2} \right\rfloor - \epsilon d,$$
 (2)

where  $\epsilon d \approx -d/\log p < 1$  for small codes. Indeed, when  $\epsilon k < 1$ , Equation 2 would predict  $d_{\rm eff} \geq \lfloor \frac{d+1}{2} \rfloor$  for surface code with standard qubits  $d_{\rm eff} \geq \lfloor \frac{2d+1}{2} \rfloor = d$  for a surface code with erasure qubits, as expected.

To optimize the placement strategy  $P^*$ , we prioritize achieving the optimal effective distance and then greedily place the remaining qubits to maximize impact on logical error rate. For a given number of erasure qubits, the optimal effective distance is achieved by placing the erasure qubits in k full rows and columns, starting from the center, for the largest k possible (using  $2kd-k^2$  qubits). We will prove in Section III-D that this guarantees the effective distance given in Equation 2. The remaining erasures are placed greedily to minimize the average Manhattan distance from the lattice center. See Figure 4 for an example. The greedy heuristic follows from an error path counting argument detailed in Section III-C.

#### B. The repetition code: a toy model

We will first examine a repetition code as a toy example. Suppose our repetition code has d data qubits with physical error rate p, k of which are designated as erasures, and we are only interested in protecting against bit-flip (X-type) errors. When a heralded erasure occurs, the erasure qubit is reinitialized as a maximally mixed state, which is equivalent to applying a fully depolarizing error. After measuring the Z observables, the heralded erasure error channel applies I, X, Y, or Z with 1/4 probability each, but only X and Y contribute to bit-flip errors. A maximum likelihood decoder [44] is used to determine the error chain. We will calculate the rate of a logical bit-flip of the repetition code assuming ideal syndrome extraction and assuming that each data qubit experiences a bit-flip (or a heralded erasure error if it is an erasure qubit) independently with probability p.

First, note that there are always *two possible explanations* for any syndrome patterns, which partition the data qubits into

two disjoint subsets. See Figure 5(d) for an example. This is because, on a 1D chain, each flipped check can only be matched to the left or to the right, and fixing the direction for any one check fixes the entire matching.

Secondly, we observe that when k>0, a logical error occurs only when heralded erasure errors occur on all erasure qubits (otherwise, a maximum-likelihood decoder never makes mistakes, since the error chain cannot pass through the clean erasure qubits). This happens with probability  $p^k$ . It remains to find the logical error rate in this case.

Suppose an error E contains  $l=l_d+l_e$  qubits ( $l_d$  data qubits and  $l_e$  erasure qubits). We have the following:

$$\mathbb{P}[E] = p^{l_d} \times (1-p)^{d-k-l_d} \times (\frac{1}{2})^{l_e} \times (\frac{1}{2})^{k-l_e},$$

$$\Longrightarrow \frac{\mathbb{P}[E]}{\mathbb{P}[E^c]} = (\frac{p}{1-p})^{2l_d-d+k}.$$

A logical error occurs if  $\mathbb{P}[E] < \mathbb{P}[E^c]$ , or,  $\mathbb{P}[E]/\mathbb{P}[E^c] < 1$ , which implies:

$$(2l_d - d + k) \log \frac{p}{1 - p} < 0 \implies l_d \ge \lfloor \frac{d - k + 1}{2} \rfloor.$$

Hence, to leading order in p and assuming  $p \ll d$ , the probability of a logical error given E is

$$p_{L} = p^{k} \times \mathbb{P}\left[l_{d} \ge \lfloor \frac{d - k + 1}{2} \rfloor\right]$$

$$\le p^{k} \times \frac{d + k}{2} \times p^{\lfloor \frac{d - k + 1}{2} \rfloor}$$

$$\lesssim p^{k + \lfloor \frac{d - k + 1}{2} \rfloor}$$

$$= p^{\lfloor \frac{d + k + 1}{2} \rfloor},$$

We can verify that the edge cases k=0 and k=d give the correct effective distances  $d_{\rm eff}$ .

We can find  $p_L$  exactly by summing over E:

$$p_L = \mathbb{P}_E[\mathbb{P}[E] < \mathbb{P}[E^c]] = \sum_{l_d \in [d-k]} \sum_{l_e \in [k]} \binom{d-k}{l_d} \binom{k}{l_e}$$
$$\times p^{l_d} (1-p)^{d-k-l_d} (\frac{1}{2})^k$$
$$\times \chi[2l_d < d-k]$$

where  $\chi$  is the indication function. We have verified that the approximation remains sufficiently accurate to the exact value for larger distances.

## C. Impact of erasure placement on logical error rate

Unlike in 1D, where ways to match a syndrome pattern are always unique, there may exist exponentially many explanations for a syndrome pattern on a 2D surface code, making it challenging to generalize code-capacity level calculations. We call these the lattice traversing paths. In this section, we study the behaviors of small surface codes by enumerating all lattice traversing paths, and show that a logical failure can be approximately decomposed into logical errors along lattice traversing paths. This allows us to infer the correlation between



Fig. 6. The fraction of error strings (X or Z) that contain each physical qubit in a d=7 surface code.

an erasure qubit's placement and its impact on the logical error rate, justifies the row-and-column placement strategy, and allows us to carry out capacity-level analysis for effective distances in surface codes, as detailed in Section III-D.

A lattice traversing path is a sequence of 2D-connected nodes that starts and ends at two boundaries of a (possibly rectangular) lattice. The number of such paths across an  $n \times m$  lattice,  $P_{m,n}$ , can be calculated with a recursive formula in polynomial time, and is often known as the number of unique ways a king can cross a  $n \times m$  chessboard [45].

We can interpret logical errors on a surface code in a dynamic view as a king making its way across a chessboard. Indeed, a logical error occurs on a surface code only when there is a connected chain of physical errors connecting opposite boundaries; we can upper-bound the overall logical error using a union bound across logical errors corresponding to lattice traversing paths, and consider the effect of erasure allocation on each lattice traversing path separately. We will only include minimum-length paths in our analysis, since we are most interested in leading-order effects on the logical error rates.

Restricted to a minimum-weight lattice traversing path, logical errors on a lattice traversing path resemble those of the simple repetition code, where the impacts of erasure qubits on logical error rate and effective distance are more readily understood. This simplifying assumption allows us to quantify the influence of an erasure qubit by studying its importance on an ensemble of isolated lattice-crossing paths, suggesting a simple recipe for determining the optimal placement of an erasure qubit: if we can determine the benefit an erasure configuration P brings to each lattice crossing path, we can bound its impacts on all logical errors to leading order. While this is difficult for arbitrary placements, we point out a few observations by studying lattice traversing paths on small codes that lead to our optimized placement strategy.

We show in Figure 6 a histogram of how frequently each qubit in a distance 7 surface code is contained in a logical error. Immediately, we observe that the qubits in the middle are included in up to  $3\times$  more paths than on the edge. We derive our closest-to-center placement strategy from this observation. Secondly, a more subtle point is that, since each lattice path either must traverse all columns or all rows, filling an additional row and column with erasures necessarily guarantees that

each lattice path encounters an additional erasure qubit. The effective distance of the code will be guaranteed to increase by the same analysis given in Section III-B. It is important to note that, while in general we should place erasures where they encounter more traversing paths (in the center), we should not prioritize this over filling additional columns/rows, since, to leading order, the logical error performance is constricted by the error with the shortest fault distance.

## D. Surface code effective distance with hybrid-erasures

Finally, we can find the total logical error rate of the surface code on a hybrid- $\mathcal{A}(d,f_e,P)$  architecture by decomposing a logical error into lattice traversing paths. Let k be the largest integer such that  $2kd-k^2 \leq f_ed^2$ . Then, we can place the erasure such that at least k rows and columns are completely filled, such that any lattice traversing paths must cross at least k erasure qubits.

Let E be a set of physical errors and let the recovery operator calculated by the decoder be denoted as Dec(E). Then, a logical error occurs when  $E \oplus Dec(E) = P$  for some logical operator  $P \in \mathcal{P}_L$ . Focusing on only the minimum weight logical errors to obtain an estimation to first order in p, we have:

$$p_L = \mathbb{P}_E \left[ \bigcup_{P \in \mathcal{P}} E + Dec(E) = P \right]$$
 (3)

$$\leq \sum_{P \in \mathcal{P}} \sum_{E \in P} \mathbb{P}[E|P] \times \mathbb{P}_E[\mathbb{P}[E] < \mathbb{P}[P - E]] \tag{4}$$

where notice that regardless of P,  $\sum_{E\in P}\mathbb{P}[E|P]\times\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[E]=\mathbb{P}[$ 

We can now arrive at the approximation of Equation 2:

$$\begin{split} p_L &\lesssim 2^d \times p^{\lfloor \frac{d+k+1}{2} \rfloor} \\ \Longrightarrow \ d_{\mathrm{eff}} &\gtrsim \lfloor \frac{d+k+1}{2} \rfloor + d \log_p 2 \\ &\geq \left\lfloor \frac{d \times (2 - \sqrt{1-f_e}) + 1}{2} \right\rfloor - \epsilon d, \end{split}$$

where  $\epsilon=-d/\log_2 p$ , and the last line follows from solving the quadratic equation  $2kd-k^2\leq f_ed^2\leq 2(k+1)d-(k+1)^2$ .

#### IV. CIRCUIT-LEVEL EVALUATION METHODOLOGY

While our analysis in a code capacity model sets expectations of the logical performance for varying erasure fractions and erasure placements, performance in a real device is better predicted via a circuit-level model. This captures error propagation, time-like differentiation of errors, and ancilla errors that arise due to simulating the underlying error-prone stabilizer circuit with a specific CNOT schedule.

| Error mechanism             | Standard qubit | Dual-rail erasure qubit |
|-----------------------------|----------------|-------------------------|
| Initialization error        | p              | 2p                      |
| Readout error               | p              | $2p$                    |
| Single-qubit (H) gate error | p/10           | p                       |
| Two-qubit (CX) gate error   | p              | p                       |
| Erasure check error         | -              | p                       |
| TABLE I                     |                |                         |

SINGLE-PARAMETER NOISE MODEL USED IN EVALUATIONS

#### A. Noise model

To evaluate QEC performance, it is convenient to define a noise model that depends only on a single parameter p. This assumes that, as hardware fidelities continue to improve, the relative error rates of different error mechanisms remain the same. Table I shows the noise parameters used in our evaluations, unless otherwise stated.

The transmon qubit noise model is a standard model in which initialization, readout, and two-qubit gates have error rate p, and single-qubit gates have a lower error rate p/10. To convert to a dual-rail error model, we consider the number of transmons involved in each operation. A dual-rail initialization or readout must be performed correctly on two transmons simultaneously, so these error rates double to 2p. A dual-rail single-qubit gate is in fact a two-transmon gate, so this error rate becomes p. A dual-rail two-qubit gate only involves one transmon from each dual-rail qubit [15], so this error rate remains p. Finally, erasure checks must be performed on the dual-rail qubit. Although current experimental demonstrations of this operation are relatively error-prone [22], we assume that the error can be reduced in the future to be on par with other operations, to an error rate of p.

We assume that we can perform a two-qubit gate between a dual-rail qubit and a standard transmon qubit. We believe this is not unreasonable, as transmon control is quite flexible and there is no fundamental barrier to such an operation. We assume that such a gate would also have error rate p, and an error in the gate would simultaneously erase the dual-rail and fully depolarize the transmon.

In this work, we assume that all erasure checks are perfect, such that there is never a false positive or false negative detection event. The impact of imperfect erasure checks has been explored in other works [46], but accounting for imperfect erasure checks would significantly increase the complexity and computational cost of our simulations, so here we choose to simplify this aspect of the noise model so that we can focus on studying the interesting features of hybrid-erasure architectures without making additional assumptions on more noise parameters. Similarly, we consider the case where an erasure check is done after every time step in the circuit for simplicity, though the effects of different erasure check schedules have also been studied in previous work [39].

# B. Simulation and decoding

We perform circuit-level simulations using Stim [47]. Errors are added upon initialization, readout, single-qubit gates, two-qubit gates, and erasure checks according to the noise model





Fig. 7. Logical memory performance of surface code of varying distance and varying fraction of erasure qubits, distributed according to our optimized heuristic. *Top:* For fixed code distance d=7, increasing the fraction of erasure qubits improves  $d_{\rm eff}$ , evidenced by a steeper slope. *Bottom:* Increasing the fraction of erasure qubits also increases  $p_{\rm th}$ , the crossing point for different distances.



Fig. 8. Effective distance  $d_{\rm eff}$  of surface codes at various erasure qubit fractions. Optimized placement yields higher effective distance for intermediate erasure fractions. Lines show linear interpolation between (d+1)/2 and d.



Fig. 9. Extracted surface code thresholds for random and optimized placement for different erasure fractions. Optimized placement leads to more consistent improvements in threshold.

in Table I. One- and two-qubit gate errors on erasure qubits are modeled as heralded detections followed by a replacement of the qubit(s) with a maximally mixed state. The resulting circuit-level effect is an erasure flag along with  $\{I,X,Y,Z\}$  errors with probability  $\frac{1}{4}$  for each affected qubit.

Decoding is performed using PyMatching [48]. To account for the additional knowledge from heralded errors on erasure qubits, the decoding graph is modified for each shot prior to decoding. Each gate error on an erasure qubit corresponds to a set of circuit-level errors E correlated with a flag f. Depending on the sampled value of f, we make one of two modifications to the decoding graph. 1) If f=0, we remove the edges corresponding to the errors E from the decoding graph. 2) If f=1, we update edge weights for each error  $e\in E$  as follows:

$$w_e = \log \left[ \left( \sum_{e' \in E} p_{e'} - p_e \right) / p_e \right]$$

where  $p_e$  is the probability of error e derived from the noise model in Table I. See Figure 3 for an example.

# C. Calculating effective distance and threshold

To determine the effective distance and threshold of the code for a given experiment, we first manually identify an approximate threshold (where lines corresponding to different code distances cross in a physical vs. logical error rate plot) and consider only the points below the threshold. For the data corresponding to a particular code distance, we fit these points to the function  $p_L = A(b \cdot x)^{d_{\rm eff}}$  to find parameters A, b, and  $d_{\rm eff}$ . Then, we calculate the threshold  $p_{\rm th}$  as the average location where these fitted lines for different distances cross each other.

## D. Limitations

In this work, we make several hardware and noise model assumptions that shape our findings. Firstly, our transmon cost scaling results depend on the assumption that an erasure qubit requires three transmons. In practice, transmon cost depends on erasure qubit implementation. In particular, if a cavity is used for readout as mentioned previously, the transmon



Fig. 10. Achievable logical error rate for hybrid-erasure surface codes as a function of transmon count, for several fixed physical error rates. Hybrid architectures outperform all-standard or all-erasure architectures for certain transmon budgets.



Fig. 11. Calculating expected fabrication yield for a chip containing the number of transmons needed for each surface code instance (see Figure 10). Assuming a defect rate of  $10^{-3}$  per transmon.

cost of an erasure qubit would be 2 instead of 3. Secondly, we assume in our noise model a specific ratio of erasure qubit error rates against standard qubit error rates (Table I), which our results are sensitive to. Quantitatively, reducing the cost of the erasure qubit or the ratio of erasure qubit-standard qubit error rates would contract the regions where we see hybrid architectures outperforming architectures with no erasure qubits or all erasure qubits – however, qualitatively the analysis would remain the same, showing some parameter regions where the hybrid approach could achieve a target logical error rate with fewer transmons.

More generally, the applicability of our results is limited by our choices in the noise model. As mentioned previously, we assume for simplicity that our erasure checks are spatially and temporally perfect, which is not hardware-realistic. Similarly, we assume that all errors in our erasure qubits are heralded erasures – in reality, while heralded erasures are the dominant source of error in an erasure qubit, there is also some probability of unheralded Pauli errors. We leave further exploration of these and other changes to the noise model to future work.



Fig. 12. Scaling of costs in Figure 10 for larger systems using an all erasure architecture, a no erasure architecture, or a hybrid erasure architecture. All hybrid points sweep over and select the best erasure fraction. Shaded region represents area of possible hybrid erasure values in between analytical upper bound and approximate observed circuit-level lower bound. (Top) Minimum possible logical error rate for fixed transmon cost. Background gradient indicates resulting single chip yield for a defect rate of  $10^{-3}$ . (Bottom) Minimum possible number of transmons for a target logical error rate.

# V. EVALUATING HYBRID-ERASURE ARCHITECTURES

# A. Effective distance and threshold

We first evaluate the logical error rate of surface codes with varying fractions of erasure qubits, distributed according to our optimized heuristic. The results are shown in Figure 7 and the key metrics of interest are the effective code distance  $d_{\rm eff}$  (the magnitude of the slope of each line) and the threshold error rate  $p_{\rm th}$  (the point where lines of the same erasure fraction cross). We observe both  $d_{\rm eff}$  and  $p_{\rm th}$  increasing as the fraction of erasure qubits increases, interpolating between the standard surface code and the expected values for a full erasure qubit architecture [14], [39], [46].

In Figure 8, we plot the extracted effective distance fits for code distances 3, 5, and 7 at various erasure qubit fractions. We see a steady improvement in  $d_{\rm eff}$  as the erasure fraction increases. Additionally, we see that the optimized erasure qubit placement heuristic significantly outperforms random placement for intermediate erasure fractions, indicating the importance of erasure qubit location in hybrid architectures.



Fig. 13. Effect of spatial placement of erasure qubits in a d=7 surface code. Color indicates correlation between logical error rate and presence of an erasure qubit at each location in the  $7 \times 7$  grid of data qubits. Correlations are shown for  $\bar{Z}$  (top),  $\bar{X}$  (middle) and combined (bottom) error rates. "Checkerboard" patterns are evident in  $\bar{Z}$  and  $\bar{X}$  data but disappear for combined data.

Figure 9 shows the extracted surface code thresholds for varying erasure qubit fraction, similarly showing a steady improvement as we add more erasure qubits. Again, a random placement of erasure qubits significantly underperforms compared to the optimized heuristic.

#### B. Comparing transmon counts

Dual-rail transmon qubits are particularly relevant when considering a hybrid-erasure architecture because they are built from the same components as standard transmon qubits. This makes a transmon-based hybrid-erasure architecture particularly compelling. While a typical transmon qubit consists of a single transmon, a dual-rail qubit instead requires three transmons [12]. This is important because the yield is determined by the transmon count of a chip [27], so reducing the transmon count can improve the scalability of manufacturing. Additionally, the number of control lines scales with the number of transmons, so heating concerns are also dependent on the transmon count rather than the qubit count.

In this context, we examine the transmon cost of hybriderasure architectures in Figure 10 for several fixed physical error rates. These can be viewed as vertical slices through Figure 7, with transmon count calculated as a function of code distance and erasure fraction. For a fixed code distance and physical error rate, we see the expected increase in transmon count and improvement in logical error rate as erasure fraction increases. For certain fixed transmon budgets, e.g. from 70 to 100 or from 150 to 190 transmons, a hybrid-erasure design gives a better logical error rate than can be achieved with a full-erasure design.

For a fixed chance of any transmon being defective (the defect rate  $\epsilon$ ), we can translate transmon count n into chip yield  $y=(1-\epsilon)^n$ , where we consider a chip to be operational only if no constituent transmons are defective. This effectively flips the x-axis, as shown in Figure 11. Again, we can that hybrid-erasure schemes allow us to interpolate between the standard qubit surface code and the erasure qubit surface code, allowing us to balance yield and logical error rate, as may be necessary to enable scalable modular quantum computers.

## C. Scaling to larger transmon counts

Although Figures 10 and 11 help analyze costs for near-term system sizes, we are also interested in the impact of hybrid erasure architectures for larger systems. To estimate the performance of the hybrid erasure architecture in higher distances and lower logical error rates, we use the empirical thresholds and a projected  $d_{\rm eff}$  bounded between two estimates. As an optimistic upper bound, we may expect the erasure fraction to behave as in the repetition code described in Section III-B. We observe that the fit becomes more accurate as we increase the distance of the surface code. The analytical lower-bound is derived using Equation 2. In Figure 12 , we compare a full-erasure architecture, a no-erasure architecture, and a hybrid architecture for surface code sizes up to 1,000

transmons and down to logical error rates of  $10^{-15}$ , where the shaded regions represent the parameter regimes where a hybrid architecture may potentially outperform the standard approaches. In the top plot of Figure 12, each point indicates the best possible logical error rate for each architecture as we increase the number of transmons, where at each point we pick the optimal code distance (and erasure fraction, for the hybrid architecture) to minimize logical error rate. We find that the best strategy is often to have 100% erasure qubits, but there are ranges where the hybrid approach is potentially better. As the size of the system increases, these regions become proportionally smaller. We observe the same trend in the bottom plot of Figure 12, where for a target logical error rate we plot the fewest number of transmons needed for each architecture.

We conclude that the benefits of a hybrid architecture diminish as the size of the system increases. However, in the near-term, there are potentially many opportunities where a hybrid architecture can improve logical-level performance. Furthermore, given the low yield of large chips, it may be preferable to keep the chips small. Indeed, in a fault-tolerant system built on concatenated codes [49], [50], it may be sufficient to use smaller distance surface codes where the results of Figure 10 and Figure 12 indicate that hybrid erasure qubits are more impactful. We leave further analysis of hybrid erasure qubits in a concatenated scheme to future work as the system scale needed to support such codes is likely beyond what is achievable in the near-term.

# VI. EMPIRICAL ANALYSIS OF ERASURE PLACEMENTS

In this section, we will extend our analysis on the impact of erasure qubit placement to the circuit level and compare against the code-capacity level predictions.

#### A. Empirical effect of erasure qubit placement

Figure 13 shows logical error rate correlations for each qubit location in a d=7 surface code. Samples are generated from circuit-level simulations of randomly distributed erasure qubits of a given erasure fraction. As expected, we generally see that qubits near the center have the highest correlations with the logical error rate, since these qubits are included in the most error paths that connect opposite boundaries. Interestingly, we observe "checkerboard" patterns for independent Z and X logical error rates; we hypothesize that these are an artifact of the different CNOT orderings for Z/X stabilizers. Combining the Z and X error rates into a single logical error rate recovers the expected behavior from the code-capacity analysis, where distance from the center is the most important feature.

#### B. Other erasure configurations

Finally, we investigate the performance of erasure placement with varying orientation. We compare the gains in effective distance achieved using each variation of erasure qubit placement, simulating each configuration at d=3,5,7 with increasing proportions of erasure qubits. Our results validate the heuristic priority to increase  $k_e$ , the minimum number of



Fig. 14. Placing erasure qubits in a way that intercepts as many traversing paths as possible maximizes effective distance gains in partial erasure architectures. Focusing on logical Z error paths, we see that adding a centrally-placed column of erasures, which encounters every possible logical Z error path, has a bigger effect on the effective distance than adding a diagonal, and an even bigger effect compared to adding a row.



Fig. 15. Placing lines of erasure qubits in different orientations has a small impact on the total logical error rate if the minimum number of erasures each logical error encounters is the same.



Fig. 16. Gains in effective distance are slightly higher when lines of erasures are placed consecutively rather than alternately. Data shown is for  $\frac{d+1}{2}$  lines of erasure, placed horizontally (rows) and vertically (cols).

erasure qubits each logical error encounters. We show that the best effective distance can be obtained when the lines of erasures are placed in consecutive rows closest to the center as prescribed by the optimized placement strategy, while the orientations of the configuration does not matter.

We first study the impact of the orientation of lines of erasures, demonstrating the importance of placing erasure qubits to maximize  $k_e$ . As shown in Figure 14, we compare the cases where erasure qubits are organized in centrally placed rows, columns, and diagonals, focusing on the logical Z error rate. As we increase the number of lines of erasure qubits in each configuration, we see that the columns of erasure qubits show the greatest gains in effective distance, followed by the diagonals, and finally the rows. This is in keeping with our expectations: After adding a column, every logical Z error path crossing through the band of erasure qubits necessarily encounters an additional erasure qubit, increasing  $k_e$  by one. In contrast, it takes adding 2 (or 3 in the corners) diagonals to guarantee an increase in  $k_e$ . Rows perform worst of all because they are parallel to the logical observable–  $k_e$  will be 0 unless every row is populated with erasure qubits, as there always exists an error path that does not encounter any erasure qubits.

Although our heuristic prioritizes increasing  $k_e$ , it does not prescribe a specific configuration if multiple can achieve the same  $k_e$ . Finally, we will study the sensitivity of the logical error performance for varying orientation and centrality of lines of erasures.

First, we show that orientation of the lines do not matter as long as they achieve the same  $k_e$ . We show the  $d_{\rm eff}$  results obtained via simulation in 15 from three such configurations, where lines of erasures take different orientations, where the impact on effective distance is small.

Then, we see that it's better to place lines of earsures closest to the center. As shown in Figure 16, we see that for an equal number of lines of erasures  $(\frac{d+1}{2})$ , placing them consecutively in the center yields a higher effective distance than placing them alternately. This follows from our results in Figures 6 and 13, which show that we should expect the middle data qubits to have a higher impact on performance than those on the edge, as prescribed by the optimized placement strategy.

## C. Validation and limits of code-capacity level analysis

Although we did not expect capacity level analysis to yield accurate estimations of logical error rates, our goal was to gain insights about the performance of a hybrid erasure architecture and to capture key characteristics with simple models. With circuit-level simulation results in hand, we revisit the codecapacity analysis of Equation 2 and investigate the agreement between the simpler model and the empirical results.

First, we show in Figure 17 that the empirical effective distance  $d_{\rm eff}$  matches reasonably well between the predictions for effective blocking (consecutive columns) and placement strategies and  $P^*$  (optimized), and between ineffective blocking (consecutive rows) and  $P_r$  (random) across erasure qubit



Fig. 17. Comparing code-capacity model from Section III with simulation data from Section V for two different placement strategies (random placement  $P_r$ , optimized heuristic  $P^*$ ) and placements in consecutive rows and columns.

fractions for small distances. We have leveraged this analytical model in the projective cost studies of Section V-C, where it remains useful in parameter regimes where circuit-level simulations are computationally prohibitive. Secondly, the optimized placement strategy  $P^*$ , informed by insights at capacity level analysis, matches well between theory and circuit-level simulation. Finally, our importance metric based on lattice traversing paths can also be validated by the circuit-level correlations studied in Figure 13.

At the same time, deviations between the capacity-level analysis and circuit-level simulations highlight intriguing future directions for erasure qubits. For one, the channel-specific checkerboard pattern, only seen in the circuit-level correlation study, is likely an artifact of the syndrome extraction schedule designed to minimize hook errors. These variations suggest that a strategic placement of erasure checks may potentially help to identify and prevent error propagation, or to achieve asymmetric protection against different error channels. Secondly, rather surprisingly, the estimated effective distance from circuit-level simulation often outperforms capacity-level predictions. One possible explanation is that a decoder with circuit-level noise has more granular information about the

space-time location of a heralded erasure error, which is lost on a phenomenological model that does not explicitly model syndrome extractions. This suggests that a closer inspection of space-time decoders designed for partial-erasure architectures may reveal additional performance gains.

#### VII. DISCUSSION

Our work has explored hybrid erasure surface codes in which some fraction of physical qubits have an erasure bias and some do not. By combining theoretical code-capacity insights with detailed circuit-level simulations, we have shown that strategic placement of erasure qubits can improve both effective distance and threshold, opening up new avenues for practical error-corrected quantum computing.

We have developed a well-motivated heuristic to place a limited number of erasure qubits in a larger surface code patch, prioritizing full rows and columns of erasure qubits. Using this heuristic, our circuit-level simulation of the hybrid erasure architecture validated the code-capacity analysis, showing a steady improvement in effective distance and threshold as the fraction of erasure qubits in the chip increases. With a noise model tailored to dual-rail qubits in superconducting transmon architectures, we found that a hybrid chip can achieve better logical error rates than homogeneous all-normal or all-erasure qubit chips for certain transmon-count budgets.

While our specific circuit-level noise model and transmoncounting results are targeted towards dual-rail superconducting implementations, our theoretical analysis and circuit-level simulation results pertaining to effective distance and spatial placement are broadly applicable to all types of erasure qubits.

We envision several intriguing directions for future work. In this study, we assumed perfect erasure checks (no false positives or negatives), but the performance of erasure qubits can heavily depend on the accuracy of the checks [46]; in addition, there are several aspects of erasure qubits that can be optimized or balanced, such as the frequency of erasure checks [39]. It would also be interesting to study the performance of hybrid-erasure schemes in the presence of defects, as has been done for the standard surface code [29], [51]. Finally, we are interested in exploring how a hybrid-erasure chip performs under biased noise by adding more rows or columns of erasure qubits as an alternative to nonsquare or XZZX surface codes. Erasure qubits placed along rows or columns may be an effective way to protect one logical observable more than the other if there is a bias in physical noise.

Overall, our work demonstrates that a hybrid-erasure architecture is a viable pathway toward enhancing the logical performance of surface codes while efficiently managing hardware resources. These results lay a strong foundation for future studies to address imperfect erasure checks, biased noise, and defect tolerance, and they provide an exciting roadmap for adapting our approach to various quantum hardware platforms.

## AUTHOR CONTRIBUTIONS

The authors devised the project idea together and all helped guide its direction. W.Y. performed theoretical analysis and processed spatial correlation data. J.D.C. and J.V. developed circuit-level simulation code. J.D.C. and M.H.T. performed circuit-level simulations. J.V. performed scaling cost analysis. All authors wrote and edited the manuscript and figures.

#### **ACKNOWLEDGEMENTS**

We are grateful to Connor Hann for insightful discussions on partial-erasure architectures and for motivating us to pick up this project again after a long hiatus.

#### REFERENCES

- [1] D. Litinski, "A Game of Surface Codes: Large-Scale Quantum Computing with Lattice Surgery," *Quantum*, vol. 3, p. 128, Mar. 2019, arXiv:1808.02892 [cond-mat, physics:quant-ph]. [Online]. Available: http://arxiv.org/abs/1808.02892
- [2] C. Gidney and M. Ekerå, "How to factor 2048 bit RSA integers in 8 hours using 20 million noisy qubits," *Quantum*, vol. 5, p. 433, Apr. 2021, publisher: Verein zur Förderung des Open Access Publizierens in den Quantenwissenschaften. [Online]. Available: https://quantum-journal.org/papers/q-2021-04-15-433/
- [3] M. E. Beverland, P. Murali, M. Troyer, K. M. Svore, T. Hoefler, V. Kliuchnikov, G. H. Low, M. Soeken, A. Sundaram, and A. Vaschillo, "Assessing requirements to scale to practical quantum advantage," Nov. 2022, arXiv:2211.07629 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2211.07629
- [4] C. Chamberland and E. T. Campbell, "Universal Quantum Computing with Twist-Free and Temporally Encoded Lattice Surgery," PRX Quantum, vol. 3, no. 1, p. 010331, Feb. 2022. [Online]. Available: https://link.aps.org/doi/10.1103/PRXQuantum.3.010331
- [5] I. H. Kim, Y.-H. Liu, S. Pallister, W. Pol, S. Roberts, and E. Lee, "Fault-tolerant resource estimate for quantum chemical simulations: Case study on Li-ion battery electrolyte molecules," *Physical Review Research*, vol. 4, no. 2, p. 023019, Apr. 2022, publisher: American Physical Society. [Online]. Available: https: //link.aps.org/doi/10.1103/PhysRevResearch.4.023019
- [6] T. Leblond, C. Dean, G. Watkins, and R. Bennink, "Realistic Cost to Execute Practical Quantum Circuits using Direct Clifford+T Lattice Surgery Compilation," ACM Transactions on Quantum Computing, vol. 5, no. 4, pp. 1–28, Dec. 2024. [Online]. Available: https://dl.acm.org/doi/10.1145/3689826
- [7] A. G. Fowler, M. Mariantoni, J. M. Martinis, and A. N. Cleland, "Surface codes: Towards practical large-scale quantum computation," *Physical Review A*, vol. 86, no. 3, p. 032324, Sep. 2012. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevA.86.032324
- [8] J. P. Bonilla Ataides, D. K. Tuckett, S. D. Bartlett, S. T. Flammia, and B. J. Brown, "The XZZX surface code," *Nature Communications*, vol. 12, no. 1, p. 2172, Apr. 2021. [Online]. Available: https://www.nature.com/articles/s41467-021-22274-1
- [9] A. Dua, A. Kubica, L. Jiang, S. T. Flammia, and M. J. Gullans, "Clifford-deformed Surface Codes," *PRX Quantum*, vol. 5, no. 1, p. 010347, Mar. 2024, arXiv:2201.07802 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2201.07802
- [10] D. K. Tuckett, S. D. Bartlett, and S. T. Flammia, "Ultrahigh Error Threshold for Surface Codes with Biased Noise," *Physical Review Letters*, vol. 120, no. 5, p. 050505, Jan. 2018, arXiv:1708.08474 [quant-ph]. [Online]. Available: http://arxiv.org/abs/1708.08474
- [11] D. K. Tuckett, A. S. Darmawan, C. T. Chubb, S. Bravyi, S. D. Bartlett, and S. T. Flammia, "Tailoring surface codes for highly biased noise," *Physical Review X*, vol. 9, no. 4, p. 041031, Nov. 2019, arXiv:1812.08186 [quant-ph]. [Online]. Available: http://arxiv.org/abs/1812.08186
- [12] S. Gu, A. Retzker, and A. Kubica, "Fault-tolerant quantum architectures based on erasure qubits," Dec. 2023, arXiv:2312.14060 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2312.14060
- [13] K. Sahay, J. Jin, J. Claes, J. D. Thompson, and S. Puri, "High-Threshold Codes for Neutral-Atom Qubits with Biased Erasure Errors," *Physical Review X*, vol. 13, no. 4, p. 041013, Oct. 2023. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevX.13.041013

- [14] Y. Wu, S. Kolkowitz, S. Puri, and J. D. Thompson, "Erasure conversion for fault-tolerant quantum computing in alkaline earth Rydberg atom arrays," *Nature Communications*, vol. 13, no. 1, p. 4657, Aug. 2022. [Online]. Available: https://www.nature.com/articles/ s41467-022-32094-6
- [15] A. Kubica, A. Haim, Y. Vaknin, H. Levine, F. Brandão, and A. Retzker, "Erasure Qubits: Overcoming the T 1 Limit in Superconducting Circuits," *Physical Review X*, vol. 13, no. 4, p. 041022, Nov. 2023. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevX. 13.041022
- [16] J. D. Teoh, P. Winkel, H. K. Babla, B. J. Chapman, J. Claes, S. J. De Graaf, J. W. O. Garmon, W. D. Kalfus, Y. Lu, A. Maiti, K. Sahay, N. Thakur, T. Tsunoda, S. H. Xue, L. Frunzio, S. M. Girvin, S. Puri, and R. J. Schoelkopf, "Dual-rail encoding with superconducting cavities," *Proceedings of the National Academy of Sciences*, vol. 120, no. 41, p. e2221736120, Oct. 2023. [Online]. Available: https://pnas.org/doi/10.1073/pnas.2221736120
- [17] M. Kang, W. C. Campbell, and K. R. Brown, "Quantum Error Correction with Metastable States of Trapped Ions Using Erasure Conversion," PRX Quantum, vol. 4, no. 2, p. 020358, Jun. 2023. [Online]. Available: https://link.aps.org/doi/10.1103/PRXQuantum.4.020358
- [18] S. Ma, G. Liu, P. Peng, B. Zhang, S. Jandura, J. Claes, A. P. Burgers, G. Pupillo, S. Puri, and J. D. Thompson, "High-fidelity gates and mid-circuit erasure conversion in an atomic qubit," *Nature*, vol. 622, no. 7982, pp. 279–284, Oct. 2023. [Online]. Available: https://www.nature.com/articles/s41586-023-06438-1
- [19] P. Scholl, A. L. Shaw, R. B.-S. Tsai, R. Finkelstein, J. Choi, and M. Endres, "Erasure conversion in a high-fidelity Rydberg quantum simulator," *Nature*, vol. 622, no. 7982, pp. 273–278, Oct. 2023. [Online]. Available: https://www.nature.com/articles/s41586-023-06516-4
- [20] K. S. Chou, T. Shemma, H. McCarrick, T.-C. Chien, J. D. Teoh, P. Winkel, A. Anderson, J. Chen, J. Curtis, S. J. de Graaf, J. W. O. Garmon, B. Gudlewski, W. D. Kalfus, T. Keen, N. Khedkar, C. U. Lei, G. Liu, P. Lu, Y. Lu, A. Maiti, L. Mastalli-Kelly, N. Mehta, S. O. Mundhada, A. Narla, T. Noh, T. Tsunoda, S. H. Xue, J. O. Yuan, L. Frunzio, J. Aumentado, S. Puri, S. M. Girvin, J. Moseley, and R. J. Schoelkopf, "Demonstrating a superconducting dual-rail cavity qubit with erasure-detected logical measurements," Oct. 2023, arXiv:2307.03169 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2307.03169
- [21] A. Koottandavida, I. Tsioutsios, A. Kargioti, C. R. Smith, V. R. Joshi, W. Dai, J. D. Teoh, J. C. Curtis, L. Frunzio, R. J. Schoelkopf, and M. H. Devoret, "Erasure detection of a dual-rail qubit encoded in a double-post superconducting cavity," Nov. 2023, arXiv:2311.04423 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2311.04423
- [22] H. Levine, A. Haim, J. Hung, N. Alidoust, M. Kalaee, L. DeLorenzo, E. Wollack, P. Arrangoiz-Arriola, A. Khalajhedayati, R. Sanil, H. Moradinejad, Y. Vaknin, A. Kubica, D. Hover, S. Aghaeimeibodi, J. Alcid, C. Baek, J. Barnett, K. Bawdekar, P. Bienias, H. Carson, C. Chen, L. Chen, H. Chinkezian, E. Chisholm, A. Clifford, R. Cosmic, N. Crisosto, A. Dalzell, E. Davis, J. D'Ewart, S. Diez, N. D'Souza, P. Dumitrescu, E. Elkhouly, M. Fang, Y. Fang, S. Flammia, M. Fling, G. Garcia, M. Gharzai, A. Gorshkov, M. Gray, S. Grimberg, A. Grimsmo, C. Hann, Y. He, S. Heidel, S. Howell, M. Hunt, J. Iverson, I. Jarrige, L. Jiang, W. Jones, R. Karabalin, P. Karalekas, A. Keller, D. Lasi, M. Lee, V. Ly, G. MacCabe, N. Mahuli, G. Marcaud, M. Matheny, S. McArdle, G. McCabe, G. Merton, C. Miles, A. Milsted, A. Mishra, L. Moncelsi, M. Naghiloo, K. Noh, E. Oblepias, G. Ortuno, J. Owens, J. Pagdilao, A. Panduro, J.-P. Paquette, R. Patel, G. Peairs, D. Perello, E. Peterson, S. Ponte, H. Putterman, G. Refael, P. Reinhold, R. Resnick, O. Reyna, R. Rodriguez, J. Rose, A. Rubin, M. Runyan, C. Ryan, A. Sahmoud, T. Scaffidi, B. Shah, S. Siavoshi, P. Sivarajah, T. Skogland, C.-J. Su, L. Swenson, J. Sylvia, S. Teo, A. Tomada, G. Torlai, M. Wistrom, K. Zhang, I. Zuk, A. Clerk, F. Brandão, A. Retzker, and O. Painter, "Demonstrating a Long-Coherence Dual-Rail Erasure Qubit Using Tunable Transmons," Physical Review X, vol. 14, no. 1, p. 011051, Mar. 2024. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevX.14.011051
- [23] M. N. H. Chow, V. Buchemmavari, S. Omanakuttan, B. J. Little, S. Pandey, I. H. Deutsch, and Y.-Y. Jau, "Circuit-based erasure conversion of leakage errors in neutral atoms," May 2024, arXiv:2405.10434 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2405.10434
- [24] C. M. Holland, Y. Lu, S. J. Li, C. L. Welsh, and L. W. Cheuk,

- "Demonstration of Erasure Conversion in a Molecular Tweezer Array," Jun. 2024, arXiv:2406.02391 [cond-mat, physics:physics, physics:quant-ph]. [Online]. Available: http://arxiv.org/abs/2406.02391
- [25] S. J. de Graaf, S. H. Xue, B. J. Chapman, J. D. Teoh, T. Tsunoda, P. Winkel, J. W. O. Garmon, K. M. Chang, L. Frunzio, S. Puri, and R. J. Schoelkopf, "A mid-circuit erasure check on a dual-rail cavity qubit using the joint-photon number-splitting regime of circuit QED," Jun. 2024, arXiv:2406.14621 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2406.14621
- [26] N. Mehta, J. D. Teoh, T. Noh, A. Agrawal, R. Chamberlain, T.-C. Chien, J. C. Curtis, B. H. Elfeky, S. M. Farzaneh, B. Gudlewski, T. Keen, N. Khedkar, C. Kurter, R. Li, G. Liu, P. Lu, H. McCarrick, A. Narla, S. Satapathy, T. Shemma, R. A. Shi, D. K. Weiss, J. Aumentado, C. U. Lei, J. O. Yuan, S. O. Mundhada, S. H. M. Jr, K. S. Chou, and R. J. Schoelkopf, "Bias-preserving and error-detectable entangling operations in a superconducting dual-rail system," Mar. 2025, arXiv:2503.10935 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2503.10935
- [27] J. M. Kreikebaum, K. P. O'Brien, A. Morvan, and I. Siddiqi, "Improving wafer-scale Josephson junction resistance variation in superconducting quantum coherent circuits," *Superconductor Science and Technology*, vol. 33, no. 6, p. 06LT02, Apr. 2020, publisher: IOP Publishing. [Online]. Available: https://dx.doi.org/10.1088/1361-6668/ab8617
- [28] K. Zeissler, "Superconducting qubits at scale," *Nature Electronics*, vol. 7, no. 10, pp. 847–847, Oct. 2024, publisher: Nature Publishing Group. [Online]. Available: https://www.nature.com/articles/s41928-024-01283-0
- [29] D. M. Debroy, M. McEwen, C. Gidney, N. Shutty, and A. Zalcman, "LUCI in the Surface Code with Dropouts," Oct. 2024, arXiv:2410.14891 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2410.14891
- [30] K. N. Smith, G. S. Ravi, J. M. Baker, and F. T. Chong, "Scaling Superconducting Quantum Computers with Chiplet Architectures," in 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), Oct. 2022, pp. 1092–1109. [Online]. Available: https://ieeexplore.ieee.org/document/9923784
- [31] S. F. Lin, J. Viszlai, K. N. Smith, G. S. Ravi, C. Yuan, F. T. Chong, and B. J. Brown, "Codesign of quantum error-correcting codes and modular chiplets in the presence of defects," in *Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2*, ser. ASPLOS '24, vol. 2. New York, NY, USA: Association for Computing Machinery, Apr. 2024, pp. 216–231. [Online]. Available: https://dl.acm.org/doi/10.1145/3620665.3640362
- [32] Y. Hashimoto, S. Yorozu, T. Satoh, and T. Miyazaki, "Demonstration of chip-to-chip transmission of single-flux-quantum pulses at throughputs beyond 100 Gbps," *Applied Physics Letters*, vol. 87, no. 2, p. 022502, Jul. 2005. [Online]. Available: https://doi.org/10.1063/1.1993767
- [33] S. Krinner, S. Storz, P. Kurpiers, P. Magnard, J. Heinsoo, R. Keller, J. Lütolf, C. Eichler, and A. Wallraff, "Engineering cryogenic setups for 100-qubit scale superconducting circuit systems," *EPJ Quantum Technology*, vol. 6, no. 1, pp. 1–29, Dec. 2019, number: 1 Publisher: SpringerOpen. [Online]. Available: https://epjquantumtechnology.springeropen.com/articles/10.1140/epjqt/s40507-019-0072-0
- [34] I. Byun, J. Kim, D. Min, I. Nagaoka, K. Fukumitsu, I. Ishikawa, T. Tanimoto, M. Tanaka, K. Inoue, and J. Kim, "XQsim: modeling cross-technology control processors for 10+K qubit quantum computers," in Proceedings of the 49th Annual International Symposium on Computer Architecture, ser. ISCA '22. New York, NY, USA: Association for Computing Machinery, Jun. 2022, pp. 366–382. [Online]. Available: https://dl.acm.org/doi/10.1145/3470496.3527417
- [35] Y. Ueno, Y. Tomida, T. Tanimoto, M. Tanaka, Y. Tabuchi, K. Inoue, and H. Nakamura, "Inter-Temperature Bandwidth Reduction in Cryogenic QAOA Machines," *IEEE Comput. Archit. Lett.*, vol. 23, no. 1, pp. 6–9, Jan. 2024. [Online]. Available: https://doi.org/10.1109/LCA.2023.3322700
- [36] H. Jnane, B. Undseth, Z. Cai, S. C. Benjamin, and B. Koczor, "Multicore Quantum Computing," *Physical Review Applied*, vol. 18, no. 4, p. 044064, Oct. 2022, publisher: American Physical Society. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevApplied.18.044064
- [37] T. M. Stace, S. D. Barrett, and A. C. Doherty, "Thresholds for Topological Codes in the Presence of Loss," *Physical Review Letters*, vol. 102, no. 20, p. 200501, May 2009, publisher: American Physical Society. [Online]. Available: https://link.aps.org/doi/10.1103/ PhysRevLett.102.200501

- [38] N. Delfosse and N. H. Nickerson, "Almost-linear time decoding algorithm for topological codes," *Quantum*, vol. 5, p. 595, Dec. 2021, arXiv:1709.06218 [quant-ph]. [Online]. Available: http://arxiv.org/abs/ 1709.06218
- [39] S. Gu, Y. Vaknin, A. Retzker, and A. Kubica, "Optimizing quantum error correction protocols with erasure qubits," Aug. 2024, arXiv:2408.00829 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2408.00829
- [40] P. Magnard, P. Kurpiers, B. Royer, T. Walter, J.-C. Besse, S. Gasparinetti, M. Pechal, J. Heinsoo, S. Storz, A. Blais, and A. Wallraff, "Fast and Unconditional All-Microwave Reset of a Superconducting Qubit," *Physical Review Letters*, vol. 121, no. 6, p. 060502, Aug. 2018. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevLett.121.060502
- [41] J. Marques, H. Ali, B. Varbanov, M. Finkel, H. Veen, S. Van Der Meer, S. Valles-Sanclemente, N. Muthusubramanian, M. Beekman, N. Haider, B. Terhal, and L. DiCarlo, "All-Microwave Leakage Reduction Units for Quantum Error Correction with Superconducting Transmon Qubits," *Physical Review Letters*, vol. 130, no. 25, p. 250602, Jun. 2023. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevLett.130.250602
- [42] Y. Zhou, Z. Zhang, Z. Yin, S. Huai, X. Gu, X. Xu, J. Allcock, F. Liu, G. Xi, Q. Yu, H. Zhang, M. Zhang, H. Li, X. Song, Z. Wang, D. Zheng, S. An, Y. Zheng, and S. Zhang, "Rapid and unconditional parametric reset protocol for tunable superconducting qubits," *Nature Communications*, vol. 12, no. 1, p. 5924, Oct. 2021. [Online]. Available: https://www.nature.com/articles/s41467-021-26205-y
- [43] K. Geerlings, Z. Leghtas, I. M. Pop, S. Shankar, L. Frunzio, R. J. Schoelkopf, M. Mirrahimi, and M. H. Devoret, "Demonstrating a Driven Reset Protocol for a Superconducting Qubit," *Physical Review Letters*, vol. 110, no. 12, p. 120501, Mar. 2013. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevLett.110.120501
- [44] E. Dennis, A. Kitaev, A. Landahl, and J. Preskill, "Topological quantum memory," *Journal of Mathematical Physics*, vol. 43, no. 9, pp. 4452–4505, Sep. 2002. [Online]. Available: https://pubs.aip.org/ jmp/article/43/9/4452/230976/Topological-quantum-memory
- [45] OEIS Foundation Inc., "The On-Line Encyclopedia of Integer Sequences," 2024, sequence A038199; accessed April 14, 2025. [Online]. Available: https://oeis.org/A038199
- [46] K. Chang, S. Singh, J. Claes, K. Sahay, J. Teoh, and S. Puri, "Surface Code with Imperfect Erasure Checks," Aug. 2024, arXiv:2408.00842 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2408.00842
- [47] C. Gidney, "Stim: a fast stabilizer circuit simulator," *Quantum*, vol. 5, p. 497, Jul. 2021. [Online]. Available: https://quantum-journal.org/papers/q-2021-07-06-497/
- [48] O. Higgott, "PyMatching: A Python Package for Decoding Quantum Codes with Minimum-Weight Perfect Matching," ACM Transactions on Quantum Computing, vol. 3, no. 3, pp. 1–16, Sep. 2022. [Online]. Available: https://dl.acm.org/doi/10.1145/3505637
- [49] C. Gidney, M. Newman, P. Brooks, and C. Jones, "Yoked surface codes," Dec. 2023, arXiv:2312.04522 [quant-ph]. [Online]. Available: http://arxiv.org/abs/2312.04522
- [50] C. A. Pattison, A. Krishna, and J. Preskill, "Hierarchical memories: Simulating quantum LDPC codes with local gates," Mar. 2023, arXiv:2303.04798 [quant-ph]. [Online]. Available: http://arxiv.org/abs/ 2303.04798
- [51] J. M. Auger, H. Anwar, M. Gimeno-Segovia, T. M. Stace, and D. E. Browne, "Fault-tolerant quantum computation with nondeterministic entangling gates," *Physical Review A*, vol. 97, no. 3, p. 030301, Mar. 2018, publisher: American Physical Society. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevA.97.030301