

# How Good Can Monolayer MoS<sub>2</sub> Transistors Be?

Youngki Yoon, \* Kartik Ganapathi, \* and Sayeef Salahuddin \*

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California 94720, United States

**ABSTRACT:** Monolayer molybdenum disulfide (MoS<sub>2</sub>), unlike its bulk form, is a direct band gap semiconductor with a band gap of 1.8 eV. Recently, field-effect transistors have been demonstrated experimentally using a mechanically exfoliated MoS<sub>2</sub> monolayer, showing promising potential for next generation electronics. Here we project the ultimate performance limit of MoS<sub>2</sub> transistors by using nonequilibrium Green's function based quantum transport simulations. Our simulation results show that the strength of MoS<sub>2</sub> transistors lies in large ON–OFF current ratio (>10<sup>10</sup>), immunity to short channel effects (drain-



induced barrier lowering  $\sim 10 \text{ mV/V}$ ), and abrupt switching (subthreshold swing as low as 60 mV/decade). Our comparison of monolayer MoS<sub>2</sub> transistors to the state-of-the-art III—V materials based transistors, reveals that while MoS<sub>2</sub> transistors may not be ideal for high-performance applications due to heavier electron effective mass ( $m^* = 0.45m_0$ ) and a lower mobility, they can be an attractive alternative for low power applications thanks to the large band gap and the excellent electrostatic integrity inherent in a two-dimensional system.

KEYWORDS: Molybdenum disulfide (MoS<sub>2</sub>), layered materials, field-effect transistor, quantum transport, NEGF, device physics

Molybdenum disulfide (MoS<sub>2</sub>), a layered transition metal dichalcogenide, has several interesting electrical, mechanical, and optical properties. Apart from being widely used as a dry lubricant for automobiles due to its low friction properties, MoS<sub>2</sub> has been explored for applications in photovoltaics<sup>2</sup> and photocatalysis<sup>3,4</sup> for energy conversion. Structurally, MoS<sub>2</sub> is a stack of planes where covalently bonded S-Mo-S atoms are closely packed in a hexagonal arrangement, and the adjacent planes are held together by van der Waals interactions.<sup>5</sup> These weak interlayer interactions, in contrast to strong intralayer bonding, make synthesis of monolayers of MoS<sub>2</sub> possible by micromechanical exfoliation from bulk crystalline  ${\rm MoS_2}^{5-7}$  identical to the fabrication of graphene from graphite.8 Recent experimental studies on few layers of MoS2 using optical absorption and photoluminescence show that, while bulk MoS<sub>2</sub> is an indirect band gap semiconductor with a band gap  $(E_g)$  of 1.29 eV,<sup>9</sup> at monolayer thickness (0.65 nm),<sup>10</sup> MoS<sub>2</sub> transitions to a direct band gap semiconductor with  $E_{\sigma} = 1.8 \text{ eV}$ , <sup>5,6</sup> thereby corroborating earlier ab initio based calculations that predicted a similar band gap. 11 High thermal stability of MoS2 and the absence of dangling bonds, coupled with the presence of significant band gap in a 2-D material render monolayer MoS2 as an attractive candidate for switching applications, unlike graphene wherein the absence of band gap inhibits its use despite large reported mobilities  $(200000 \text{ cm}^2/(\text{V s})).^{12}$ 

While monolayer  $MoS_2$  has previously exhibited poor mobility  $(<10~cm^2/(V~s))^8$  limiting its potential for majority of the electronic applications, it has been recently reported that in the presence of a high- $\kappa$  environment, the mobility of monolayer  $MoS_2$  can increase by several times  $(\sim 200~cm^2/(V~s))$ , which is similar to earlier reports of such phenomenon in the case of graphene. This mobility enhancement, one of the reasons of which could be dielectric screening as predicted by theoretical calculations, opens up the possibility of monolayer  $MoS_2$ 

field-effect transistors for low standby and low operating power electronics. Recently, long channel monolayer  $MoS_2$  transistors with very good ON—OFF current ratio (>10<sup>7</sup>) and subthreshold swing (74 mV/decade) have been demonstrated experimentally. While rapid progress in fabrication of short-channel  $MoS_2$  transistors can be expected, it is of significant technological relevance to estimate the ultimate performance limit that can be achieved in such devices before an aggressive pursuit of miniaturization begins. In this Letter, we attempt to answer this question by using rigorous quantum transport simulations and view their performance in light of some of the competing non-silicon technologies to assess the viability of monolayer  $MoS_2$  transistors for future electronic applications.

The schematic of the atomistic configuration of a monolayer MoS<sub>2</sub> and that of the simulated device are shown in Figure 1. To describe electronic transport through MoS<sub>2</sub>, we perform self-consistent ballistic quantum transport simulations within the nonequilibrium Green's function (NEGF) formalism using an effective mass Hamiltonian. Transport equations are solved iteratively together with Poisson's equation until a self-consistency between charge density (calculated by analytical summation of transverse momentum modes within the first Brillouin zone) and electrostatic potential is achieved. Subsequently current is calculated as

$$\begin{split} I_{\rm D} &= \frac{e}{\hbar^2} \sqrt{\frac{m_y^* k_{\rm B} T}{2\pi^3}} \int \, \mathrm{d}E_{k_x} \bigg\{ F_{-1/2} \bigg( \frac{\mu_1 - E_{k_x}}{k_{\rm B} T} \bigg) \\ &- F_{-1/2} \bigg( \frac{\mu_2 - E_{k_x}}{k_{\rm B} T} \bigg) \bigg\} T_{\rm SD} (E_{k_x}) \end{split} \tag{1}$$

 Received:
 May 28, 2011

 Revised:
 July 15, 2011

 Published:
 July 26, 2011



Figure 1. Device structure of a monolayer  $MoS_2$  transistor. (a) Atomistic configuration of two-dimensional  $MoS_2$  monolayer, which is a 6.5 Å thick large band gap semiconductor ( $E_{\rm g}=1.8$  eV). (b) Schematic cross section of a monolayer  $MoS_2$  transistor. The channel is an  $MoS_2$  monolayer, and the source and drain contacts can be of a metal like gold (Au), which is believed to make a good contact with small Schottky barrier at the junction (ref 7). The nominal device has the following parameters: Gate length  $L_{\rm G}=15$  nm,  $HfO_2$  ( $\kappa=25$ ) gate oxide thickness  $t_{\rm ox}=2.8$  nm, gate underlap of 2 nm at each side, Schottky barrier height of  $\Phi_{\rm B}=0.1$  eV, power supply voltage of 0.5 V.

where  $F_{-1/2}(.)$  denotes the Fermi-Dirac integral of order -1/2,  $T_{\rm SD}(.)$  is the transmission coefficient from source to drain,  $\mu_1$  and  $\mu_2$  are source and drain electrochemical potentials,  $\hbar$ ,  $m_v^*$ , e,  $k_B$ , T, and  $E_k$  are reduced Planck's constant, transverse effective mass, elementary charge, Boltzmann constant, temperature, and longitudinal energy, respectively. Gate leakage current is ignored. The conduction band effective mass along the transport direction (x) is calculated to be  $0.45m_0$  (K  $\rightarrow \Gamma$ ),  $m_0$  being the free electron mass, from the dispersion relations of monolayer  $MoS_2$ . A calculation of effective mass along K  $\rightarrow$  M yields a similar value to the first order. Hence we assume the transverse effective mass to be identical to that along the transport direction. The Hamiltonian of the metallic regions at source and drain is modeled using an effective mass close to  $m_0$  $(1.01m_0)$ , <sup>15</sup> and Dirichlet boundary conditions are imposed at the contacts. We use a dielectric constant of 3.3 for MoS<sub>2</sub>. 16,17 Detailed device parameters used in this work are provided in the caption of Figure 1b. For the simulated device, source and drain work functions are assumed to be the same as that of the gate. A grid spacing as small as 0.1 nm along the x direction ensures the presence of states in both contact and channel regions in the entire energy range of interest within a single band description of effective mass Hamiltonian.

A very good Ohmic contact with low contact resistance is essential in optimizing the device performance. However, one of the issues in the fabrication of an Ohmic contact is the difficulty in finding metals with desired work function. Tunneling contacts using narrow Schottky junctions, which is a more practical way to realize Ohmic contacts, may suffer from Fermi-level pinning due to defects and interface states resulting in a significant tunneling barrier and hence an increased contact resistance. We analyze the experimental  $I_{\rm D}{-}V_{\rm BG}$  (drain current—back-gate voltage) characteristics at source-to-drain voltage  $V_{\rm D}=10$  mV reported in ref 7 in order to estimate the Schottky barrier height for the  ${\rm Au-MoS_2}$  junction, which we use for simulating our nominal device. The total resistance of the device at any gate voltage  $R_{\rm TOT}(V_{\rm BG})$ , expressed as  $V_{\rm D}/I_{\rm D}$ , is composed of three different



Figure 2. Analysis of experimental back-gated characteristics. (a) Variation of effective mobility  $\mu_{\rm eff}$  with back-gate voltage  $V_{\rm BG}$  obtained from the analysis of experimental back-gated current—voltage characteristics at  $V_{\rm D}=10$  mV reported in ref 7. Contact resistances are accounted for by using barrier height  $\Phi_{\rm B}$  at the metal—MoS $_2$  junction as a parameter. For a given  $I_{\rm D}$ , a larger Schottky barrier height decreases the channel resistance resulting in a higher mobility and vice versa. A mobility of 217 cm²/(V s) calculated in ref 7 without accounting for contact resistance is also plotted with a black dotted line. (b) Variation of mean free path  $\lambda$  with  $V_{\rm BG}$  for  $\Phi_{\rm B}=0$  and 0.1 eV.

resistance components: the intrinsic channel resistance  $R_{\rm ch}(V_{\rm BG})$  stemming from momentum breaking in the transport direction; the ballistic resistance  $R_{\rm Bal}(V_{\rm BG})$  arising due to finite electron group velocity; the specific contact resistivity  $R_{\rm c}$ . The resistance  $R_{\rm c}$  (per unit thickness), for a given Schottky barrier height of  $\Phi_{\rm B}$ , assuming only thermionic current for the sake of simplicity, can be written as  $^{18}$ 

$$R_{\rm c} = \frac{h^3}{4\pi e^2 m^* t k_{\rm B} T} \exp\left(\frac{e\Phi_{\rm B}}{k_{\rm B} T}\right) \tag{2}$$

where h,  $m^*$ , and t are Planck's constant, effective mass, and thickness of  $MoS_2$  monolayer, respectively. In order to ensure that  $2R_c$  (factor of 2 to account for resistances on both source and drain sides) is less than  $R_{TOT}$  for all values of  $V_{BG}$ ,  $\Phi_B$  has to be less than or equal to 0.1 eV. Thus, we use a barrier height of 0.1 eV for our nominal device. However, it must be noted that this value of  $\Phi_B$  represents an upper bound on the actual Schottky barrier height, as all other parasitic, gate-voltage-independent resistances have been lumped into metal—semiconductor (M-S) junction resistance.

We further analyze the  $I_{\rm D}{-}V_{\rm BG}$  characteristics, following the approach outlined in ref 19 to extract information about mobility ( $\mu_{\rm eff}$ ) and mean free path ( $\lambda$ ). Figure 2 shows the variation of  $\mu_{\rm eff}$  and  $\lambda$  with  $V_{\rm BG}$  for two different values of Schottky barrier



Figure 3. Device characteristics of monolayer MoS<sub>2</sub> transistors. (a)  $I_{\rm D}-V_{\rm G}$  characteristics at  $V_{\rm D}=0.05$  and 0.5 V on logarithmic (left axis) and linear scales (right axis). For the nominal device simulated, a maximum ON current as high as 1.6 mA/ $\mu$ m and a subthreshold swing (SS =  $\partial V_{\rm G}/\partial \log_{10}(I_{\rm D})$ ) close to 60 mV/decade are achieved. Drain-induced barrier lowering (DIBL) is as small as 10 mV/V even with very short channel length. (b)  $I_{\rm D}-V_{\rm D}$  characteristics at  $V_{\rm G}=0.2$ , 0.3, and 0.4 V. Beyond  $V_{\rm D}=0.4$  V, MoS<sub>2</sub> transistors show a clear saturation behavior with an output conductance ( $g_{\rm d}=\partial I_{\rm D}/\partial V_{\rm D}$ ) of 21, 51, and 133  $\mu$ S/ $\mu$ m at  $V_{\rm G}=0.2$ , 0.3, and 0.4 V, respectively. (c)  $I_{\rm ON}$  versus  $I_{\rm ON}/I_{\rm OFF}$  for  $V_{\rm D}=0.4$ , 0.5, and 0.6 V. With  $V_{\rm D}=0.5$  V,  $I_{\rm ON}$  is as high as 500  $\mu$ A/ $\mu$ m with 4 orders of magnitude in ON-OFF current ratio. For the same ON current,  $I_{\rm ON}/I_{\rm OFF}>10^{5}$  can be achieved with 0.6 V of drain voltage. (d) Transconductance ( $g_{\rm m}=\partial I_{\rm D}/\partial V_{\rm G}$ ) vs  $V_{\rm G}$  at  $V_{\rm D}=0.5$  V. The  $g_{\rm m}$  is as large as 4.4 mS/ $\mu$ m for the nominal device within the voltage range considered in this study. (e)  $I_{\rm D}-V_{\rm G}$  characteristics for a long channel device with  $I_{\rm ch}=500$  nm is projected (dashed line) by  $I_{\rm proj}=I_{\rm bal}\times\lambda_{\rm max}/(I_{\rm ch}+\lambda_{\rm max})$ , where  $I_{\rm bal}$  is the ballistic current adopted from our simulation results (solid line),  $I_{\rm proj}$  is the projected current for larger size of devices taking scattering into account,  $\lambda_{\rm max}$  is peak mean free path, and  $I_{\rm ch}$  is channel length. (f) Variation of  $I_{\rm ON}$  (defined at  $I_{\rm CON}=0.4$  V) with channel length. As  $I_{\rm ch}=0.4$  increases, current is decreased since carriers are exposed to greater number of scattering events. When  $I_{\rm ch}=100$  nm, projected current is one-fifth of the ballistic current. The inset shows a similar plot for peak  $g_{\rm m}$ .

heights of  $\Phi_B=0$  and 0.1 eV. The extracted peak mobility is about 220 and 350 cm²/(V s) for  $\Phi_B=0$  and 0.1 eV, respectively. The corresponding mean free paths are 15 and 22 nm. We choose the gate length to be 15 nm so that the device operates away from the diffusive limit where the transport is predominantly limited by scattering, thereby making our performance projections realistic. An underlap of 2 nm each on the source and the drain sides is introduced to reduce the fringe capacitances without significantly increasing the series resistance.

The key device characteristics of MoS<sub>2</sub> transistors are shown in Figure 3. The transfer characteristics reveal that the maximum current  $(I_{\text{max}})$  is  $\sim$  1.6 mA/ $\mu$ m. However, we note that in the case of reflectionless contacts,  $I_{\text{max}}$  can be even larger due to the nonparabolicity present in the conduction band of monolayer  $MoS_2$  wherein a satellite valley along  $K \rightarrow \Gamma$ , which is only about  $3k_{\rm B}T$  above the conduction band minima, contributes to enhanced density-of-states than what is predicted from our parabolic band approximation. The maximum-minimum current ratio  $(I_{\text{max}}/I_{\text{min}})$  can be more than 10 orders of magnitude ignoring the gate leakage (Figure 3a). Gate-induced drain leakage (GIDL), one of the main leakage mechanisms that limits  $I_{\min}$  in a conventional metal—oxide—semiconductor (MOS) geometry, is significantly less in the case of an MoS<sub>2</sub> transistor than in its Si counterpart due to the larger band gap, and hence a smaller gate voltage can, in principle, further reduce  $I_{min}$ . It must be noted, however, that a more rigorous analysis to predict the maximum achievable  $I_{\text{max}}/I_{\text{min}}$  requires a multiband Hamiltonian description (including valence band) to properly account for the effects of GIDL, which is beyond the scope of this study. In practical applications, what is more important than  $I_{
m max}/I_{
m min}$  is the ON-OFF current ratio  $(I_{\rm ON}/I_{\rm OFF})$ , where the voltage window between  $V_{
m ON}$  and  $V_{
m OFF}$  is the same as power supply voltage (i.e.,  $V_{\rm ON} - V_{\rm OFF}$  =  $V_{\rm D}$ ). Therefore,  $I_{\rm ON}/I_{\rm OFF}$  can be increased with a larger supply voltage for the same ON state current. For the simulated device structure ( $L_G = 15$  nm), drain-induced barrier lowering (DIBL) is negligibly small (10 mV/V, Figure 3a) due to excellent electrostatics of the 2-D geometry, and hence a larger  $V_{\rm D}$  can significantly increase the  $I_{\rm ON}/I_{\rm OFF}$  ratio for a given  $I_{\rm ON}$ (Figure 3c). In Figure 3b, we can also see that the output characteristics for a given  $V_{\rm G}$  show saturation beyond  $V_{\rm D}$  = 0.4 V with reasonably small output conductance ( $g_d = 21, 51$ , and 133  $\mu$ S/ $\mu$ m at  $V_G$  = 0.2, 0.3, and 0.4 V, respectively). We have also plotted the intrinsic device transconductance ( $g_{\rm m}$  =  $\partial I_{\rm D}/$  $\partial V_{\rm G}$ ) from the  $I_{\rm D} - V_{\rm G}$  data at  $V_{\rm D}$  = 0.5 V (Figure 3d). The maximum  $g_{\rm m}$  is 4.4 mS/ $\mu$ m, which is still less in comparison to the peak  $g_{\rm m}$  that can be achieved, as  $g_{\rm m}$  is monotonically increasing over the entire range of  $V_{\rm G}$  considered. This implies that at the largest gate voltage applied ( $V_{\rm G}$  = 0.7 V), additional gate voltage could still significantly enhance current and the consequent voltage drop would mainly be across the semiconductor and not across the gate oxide. Therefore, for the simulated EOT, the operation of a monolayer MoS<sub>2</sub> transistor is mainly



Figure 4. Capacitance—voltage (C-V) characteristics of monolayer MoS<sub>2</sub> transistors with  $V_{\rm D}=0$  V. (a) Gate capacitance  $(C_{\rm G}=\partial Q/\partial V_{\rm G})$  and quantum capacitance  $(C_{\rm Q}=\partial Q/\partial \psi_s)$  vs  $V_{\rm G}$ . Oxide capacitance  $(C_{\rm ox}=\kappa\varepsilon_0/t_{\rm ox})$  is shown by the dashed line.  $\partial\psi_s/\partial V_{\rm G}$  and  $\partial V_{\rm ox}/\partial V_{\rm G}$  are shown for the same  $V_{\rm G}$  range in the inset. (b) Density-of-states (DOS) vs energy (main panel) and surface plot of logarithmic local density-of-states (LDOS) (inset) at  $V_{\rm G}=0.1$  V. The  $C_{\rm Q}$  plot, shown in Figure 4a, can be understood herein by examining the average of DOS near the Fermi level (dotted line) at a given  $V_{\rm G}$ . For gate voltages up to about  $\sim$ 0.1 V, the gate controls the channel potential efficiently, as shown by a large  $\partial\psi_s/\partial V_{\rm G}$  in the inset of Figure 4a; subsequently  $C_{\rm Q}$  increases leading to a reduction in gate control. Due to the short channel length, the DOS is reminiscent of a 1-D material system, wherein the Van Hove peak at each sub-band energy is broadened to a different extent by the contact, rather than that of a 2-D system with constant DOS. (c) Conduction band  $(E_{\rm c})$  profile along the channel at  $V_{\rm D}=0.5$  V and  $V_{\rm G}=-0.2$  to 0.5 V in steps of 0.1 V. When a considerable  $V_{\rm D}$  is applied, gate control over the channel potential can still be efficient even at high gate voltages, indicating the operation close to quantum capacitance regime at the ON state.



**Figure 5.** Variation of subthreshold swing with oxide thickness. Subthreshold swing (SS) vs oxide thickness ( $t_{\rm ox}$ ) with gate length of  $L_{\rm G}=15$  and 30 nm. Subthreshold swing increases linearly with oxide thickness due to the decrease in  $C_{\rm ox}$ . For the same  $t_{\rm ox}$  the subthreshold swing of a device with longer gate length is significantly smaller due to immunity to short channel effects. The cross shows the experimental data from ref 7, implying that the gate efficiency, in practice, could be significantly improved by optimization.

dictated by its quantum capacitance and not by the oxide capacitance, the details of which will be discussed later.

The experimental characteristics in ref 7 show a maximum drive current of 2.5  $\mu$ A/ $\mu$ m for a device with a 500 nm long gate. Therefore, it is important to estimate the transfer characteristics for an optimized device with a similar gate length. Using the mean free path extracted from the experimental characteristics (Figure 2b), we calculate the corresponding  $I_{\rm D}-V_{\rm G}$  characteristics by multiplying the ballistic current from our simulations with  $\lambda_{\rm max}/(L_{\rm ch}+\lambda_{\rm max})$  (where  $\lambda_{\rm max}$  and  $L_{\rm ch}$  are the peak mean free path and channel length, respectively) as shown in Figure 3e. The maximum current obtained in this case is 69  $\mu$ A/ $\mu$ m. The difference between this value and the experimentally observed 2.5  $\mu$ A/ $\mu$ m is then likely due to the underlap series resistances, and a significant performance boost may be expected by reducing them.



Figure 6. (a) Logarithmic LDOS for  $\Phi_{\rm B}=0$  at  $V_{\rm G}=0.15$  V and  $V_{\rm D}=0.5$  V near the source. Conduction band profile is shown as a solid line. The channel electrostatic potential increases near the metal–semiconductor interface, resulting in an effective barrier of  $\sim$ 0.1 eV, due to metal-induced gap states (MIGS). The  $E_{\rm c}$  in the case of  $\Phi_{\rm B}=0.1$  eV is also plotted for comparison (dashed line). (b) DOS from the source—channel interface (x=1 Å) for  $\Phi_{\rm B}=0$  and 0.1 eV. At a given energy, the DOS is larger in the case of a smaller barrier height since carriers in metallic contact effectively see a smaller tunneling barrier.

The scaling behavior is similarly investigated by calculating  $I_{\rm ON}$  (defined at  $V_{\rm ON}$  = 0.4 V) and the peak  $g_{\rm m}$  as a function of  $L_{\rm ch}$  up to 100 nm, as shown in Figure 3f.

Capacitance—gate voltage  $(C-V_G)$  characteristics are explored by performing equilibrium simulations, i.e., with source

Table 1. Comparison of Key Device Performance Parameters of In<sub>0.7</sub>Ga<sub>0.3</sub>As Quantum-Well FET (ref 22) and Monolayer MoS<sub>2</sub> FET of Identical EOT and Channel Length

|                                                | $L_{\rm ch}~({\rm nm})$ | EOT (Å) | $I_{\rm max} \left({\rm mA}/\mu{\rm m}\right)$ | peak $g_{\rm m}$ (mS/ $\mu$ m) | $\max I_{\rm ON}/I_{\rm OFF}$ | min SS (mV/decade) |
|------------------------------------------------|-------------------------|---------|------------------------------------------------|--------------------------------|-------------------------------|--------------------|
| $In_{0.7}Ga_{0.3}As$ quantum-well FET (ref 22) | 75                      | 22      | 0.49                                           | 1.75                           | 312                           | 85                 |
| monolayer MoS <sub>2</sub> FET                 | 75                      | 22      | 0.19                                           | 0.3                            | $1.4 \times 10^{7}$           | 60                 |

and drain terminals grounded. The total gate capacitance  $C_G$  $(=\partial Q/\partial V_{\rm G})$  and the quantum capacitance  $C_{\rm Q}$   $(=\partial Q/\partial \psi_{\rm s})$  are numerically calculated from self-consistent charge Q and surface potential  $\psi_s$  obtained at each gate voltage (Figure 4a). Our numerical simulation results are in accordance with the analytical capacitance model (i.e.,  $1/C_G = 1/C_Q + 1/C_{ox}$ ) and the principle of voltage division.  $^{20}$  At low values of  $V_G$ , the total gate capacitance is very small due to negligible charge density in the device. However as  $V_G$  increases,  $C_O$ , which is a measure of the average density-of-states (DOS) at equilibrium Fermi level,<sup>20</sup> increases due to lowering of electrostatic potential in the channel (Figure 4b). The saturation in  $C_{\rm O}$  to a value smaller than the theoretically expected quantum capacitance in 2-D systems  $(C_Q^{2-D} = e^2 m^* / \pi \hbar^2)$  is a result of reduced DOS in the channel due to wave function reflections at the contacts. With increasing gate voltage, the gate efficiency  $(\partial \psi_s/\partial V_G)$  decreases (inset of Figure 4a) due to charge accumulation and screening effect. It must, however, be noted that under nonequilibrium conditions (with finite  $V_D$ ), the gate control over channel electrostatics is good even at large gate voltages  $(\partial (E_c/e)/\partial V_G = 0.78 \text{ at } V_G = 0.5$ V as shown in Figure 4c) due to lack of charge accumulation - an indication of the fact that the device operates closer to the quantum capacitance regime than to the oxide capacitance regime.

It has been reported that high- $\kappa$  dielectric plays an important role in improving the monolayer MoS2 mobility and hence the device performance.<sup>7</sup> The gate oxide can also significantly affect the switching abruptness, and therefore we examine the effect of oxide thickness on subthreshold swing (SS). As shown in Figure 5, SS increases linearly with oxide thickness, which is also predicted by the analytical subthreshold swing model in a conventional MOSFET.<sup>21</sup> At  $t_{ox}$  = 30 nm, our nominal device shows larger SS (79 mV/decade with  $L_G = 15$  nm) than the experimentally reported value for same oxide thickness (74 mV/decade with  $L_{\rm G}$  = 500 nm),<sup>7</sup> owing to short channel behavior. However, with  $L_{\rm G}$  = 30 nm, SS improves considerably due to the suppression of short channel effects (71 mV/decade for  $t_{ox}$  = 30 nm). Our simulations predict that the reported value of SS = 74 mV/decadecould be achieved at  $L_G = 23$  nm with  $t_{ox} = 30$  nm. We also analyzed the electrostatics of the exact geometry reported in ref 7 at OFF state by solving the Laplace equation and confirmed  $\partial (E_c/e)/\partial V_G$  to be equal to 1, implying that SS is expected to be 60 mV/decade. Hence we believe that there exists considerable room for optimization of gate dielectrics in MoS<sub>2</sub> transistors.

In fabricating monolayer  $MoS_2$  transistors, gold (Au) has been used to create an Ohmic contact—Schottky contact with negligible barrier height. Our simulations show that the Schottky barrier height can effectively increase (by up to 0.1 eV in the case of  $\Phi_B=0$  for intermediate values of gate voltage) as can be seen from the solid line in Figure 6a. This is due to enhanced polarization in the channel near the M–S junction owing to localized states induced by the metallic contact (known as metal-induced gap states (MIGS), which are clearly shown in the local density-of-states (LDOS) plot in Figure 6a). We note that this increase in barrier height, at identical gate voltage, is smaller for

contacts with larger Schottky barriers (dashed line in Figure 6a). This is due to the fact that a larger barrier reduces the tunneling probability for carriers, resulting in a smaller penetration of contact states into the channel, which is confirmed by the density-of-states plot at the source end for  $\Phi_{\rm B}=0$  and 0.1 eV shown in Figure 6b. However, this increase in effective barrier height vanishes as  $V_{\rm G}$  increases further.

With the above analysis of monolayer MoS<sub>2</sub> transistors, it is instructive to compare some of their key device performance parameters to those of some other nonconventional devices recently explored. Table 1 shows such a comparison with In<sub>0.7</sub>-Ga<sub>0.3</sub>As quantum-well FETs reported in ref 22. It is evident that the strength of MoS<sub>2</sub> transistors lies in their large band gap, which results in a significant  $I_{ON}/I_{OFF}$  and the excellent electrostatic integrity due to the 2-D nature of the system. However, with mobility lower than most of the III-V materials, MoS<sub>2</sub> transistors are more suited for low standby and operating power applications than for high performance where the experimental results from the former outperform the best theoretical predictions for the latter. The other most widely investigated 2-D system — a graphene transistor - has a very poor ON-OFF ratio despite a high ON current,<sup>23</sup> due to lack of band gap, making it very difficult to use them for digital applications. Fabrication of very narrow graphene nanoribbons with a finite band gap still remains a challenge and is prone to edge roughness resulting in a variation in energy gap. While there have been several reports of high-quality graphene nanoribbon transistors with large ON current, poor subthreshold swing and small ON-OFF ratio continue to remain problems. 24,25 Hence monolayer MoS<sub>2</sub> transistors, owing to their unique combination of exquisite electrostatic integrity and large band gap, can prove to be a better alternative for low power applications than several of the non-Si devices already explored.

To summarize, we have projected the ultimate scaling limit of monolayer MoS<sub>2</sub> transistors by performing self-consistent quantum transport simulations. The key features of MoS<sub>2</sub> transistors are (i) large  $g_{\rm m}$  (4.4 mS/ $\mu$ m) due to low DOS, (ii) significant  $I_{\rm max}/I_{\rm min}$  (>10<sup>10</sup>) owing to a large band gap, and (iii) excellent short channel behavior (DIBL ~10 mV/V and SS ~60 mV/decade) resulting from enhanced gate control. Along with these very good electrical characteristics, planarity of MoS<sub>2</sub> monolayer makes MoS<sub>2</sub> transistors one of the most viable candidates for future low power applications. Further, the properties of monolayer MoS<sub>2</sub> like high thermal stability, chemical inertness, transparency, flexibility, and relative inexpensiveness give MoS<sub>2</sub> transistors a unique advantage for several low cost electronic applications.

#### AUTHOR INFORMATION

# **Corresponding Author**

\*E-mail: sayeef@eecs.berkeley.edu.

## **Author Contributions**

<sup>†</sup>These authors contributed equally to this work.

### ■ ACKNOWLEDGMENT

This work was supported in part by faculty startup funding from University of California.

#### REFERENCES

- (1) Wilson, J. A.; Yoffe, A. D. Adv. Phys. 1969, 18, 193.
- (2) Gourmelon, E.; Lignier, O.; Hadouda, H.; Couturier, G.; Bernede, J. C.; Tedd, J.; Pouzed, J.; Salardenne, J. Sol. Energy Mater. Sol. Cells 1997, 46, 115.
- (3) Ho, W. K.; Yu, J. C.; Lin, J.; Yu, J. G.; Li, P. S. Langmuir 2004, 20, 5865.
- (4) Zong, X.; Yan, H.; Wu, G.; Ma, G.; Wen, F.; Wang, L.; Li, C. J. Am. Chem. Soc. 2008, 130, 7176.
- (5) Mak, K. F.; Lee, C.; Hone, J.; Shan, J.; Heinz, T. F. Phys. Rev. Lett. 2010, 105, 136805.
- (6) Splendiani, A.; Sun, L.; Zhang, Y.; Li, T.; Kim, J.; Chim, C. -Y.; Galli, G.; Wang, F. *Nano Lett.* **2010**, *10*, 1271.
- (7) Radisavljevic, B.; Radenovi, A.; Brivio, J.; Giacometti, V.; Kis, A. Nat. Nanotechnol. 2011, 6, 147.
- (8) Novoselov, K. S.; Jiang, D.; Schedin, F.; Booth, T. J.; Khotkevich, V. V.; Morozov, S. V.; Geim, A. K. *Proc. Natl. Acad. Sci. U.S.A.* **2005**, 102, 10451
- (9) Gmelin Handbook of Inorganic and Organometallic Chemistry, 8th ed.; Springer-Verlag: Berlin, 1995; Vol. B7.
  - (10) Frindt, R. F. J. Appl. Phys. 1965, 37, 1928.
  - (11) Lebègue, S.; Eriksson, E. Phys. Rev. B 2009, 79, 115409.
- (12) Bolotin, K. I.; Sikes, K. J.; Jiang, Z.; Klima, M.; Fudenberg, G.; Hone, J.; Kim, P.; Stormer, H. L. Solid State Commun. 2008, 146, 351.
  - (13) Chen, F.; Xia, J.; Ferry, D. K.; Tao, N. Nano Lett. 2009, 9, 2571.
  - (14) Jena, D.; Konar, A. Phys. Rev. Lett. 2007, 98, 136805.
  - (15) Szczyrbowski, J. J. Phys. D 1986, 19, 1257.
- (16) Molybdenum Disulfide; Kee Hing Cheung Kee Co., Ltd: Hong Kong, http://www.khck.hk/adgoogle/Molybdenum-Disulfide.htm (accessed July 12, 2011).
- (17) We use the value of dielectric constant corresponding to that of bulk  $MoS_2$ , as detailed electrical characterization data of monolayer  $MoS_2$  has not yet been reported.
- (18) Sze, S. M. *Physics of Semiconductor Devices*, 3rd ed.; Wiley-Interscience: Hoboken, NJ, 2006.
- (19) Lundstrom, M. Physics of Nanoscale MOSFETs, https://nano-hub.org/resources/5306
- (20) Datta, S. Quantum Transport: Atom to Transistor, 2nd ed.; Cambridge University Press: Cambridge and New York, 2005.
- (21) Taur, Y.; Ning, T. H Fundamentals of Modern VLSI Devices; Cambridge University Press: Cambridge and New York, 1998.
- (22) Radosavljevic, M.; Chu-Kung, B.; Corcoran, S.; Dewey, G.; Hudait, M. K.; Fastenau, J. M.; Kavalieros, J.; Liu, W. K.; Lubyshev, D.; Metz, M.; Millard, K.; Mukherjee, N.; Rachmady, W.; Shah, U.; Chau, Robert *IEDM Tech. Dig.* **2009**, 319.
  - (23) Schwierz, Frank Nat. Nanotechnol. 2010, 89, 487.
- (24) Wang, X.; Ouyang, Y.; Li, X.; Wang, H.; Guo, J.; Dai, H. Phys. Rev. Lett. 2008, 100, 206803.
- (25) Liao, L.; Bai, J.; Cheng, R.; Lin, Y.-C.; Jiang, S.; Huang, Y.; Duan, X. Nano Lett. 2010, 10, 1917.