# On the Temperature and Voltage Dependence of Short-Term Negative Bias Temperature Stress Ph. Hehenberger\*, P.-J. Wagner\*, H. Reisinger° and T. Grasser\* \* Institute for Microelectronics, TU Wien, A-1040 Wien, Austria Christian Doppler Laboratory for TCAD at the Institute for Microelectronics, TU Wien, A-1040 Wien, Austria Infineon Technologies, D-81739 München, Germany Abstract—Initial NBTI degradation is often explained by elastic hole trapping which also considerably distorts long-term measurements. In order to clarify this issue, short-term NBT stress measurements are performed using different temperatures, stress voltages and oxide thicknesses. The data shows a clear temperature activation and a superlinear voltage dependence, thereby effectively ruling out elastic hole tunneling. Rather, our data supports an explanation based on a thermally activated hole capture mechanism. #### INTRODUCTION Degradation of transistor device parameters, such as the threshold voltage $V_{\mathrm{TH}}$ and the mobility, already attracted the attention of the semiconductor industry many decades ago. When biasing the gate at higher temperature while keeping the rest of the transistor contacts grounded, negative bias temperature instability (NBTI) [1,2] is observed. The conventional explanation of the resulting degradation uses elastic hole trapping due to tunneling carrier exchange with the substrate (initial degradation) and the creation of interface states (long-term degradation) [3,4]. While [3,4] claim that processes in the short-time scale show a negligible temperature dependence, our latest results support a thermally activated tunneling mechanism [5]. To better understand the underlying mechanisms of short-term NBTI degradation [6] an extensive study of the short stress time behavior from the range of $\mu s$ to s is necessary. Unfortunately, due to noise, accurate measurements in these time scales are difficult [7, 8]. In particular, the noise in the $\mu s$ regime makes it difficult to extract information on the smallest time-constants contributing to the degradation. We make use of a refined measurement procedure and collect a large dataset of stress measurements encompassing different temperatures, voltages and oxide thicknesses. ## I. SAMPLES USED AND STRESS CONDITIONS PMOSFETS from a standard 90 nm CMOS process with plasma-nitrided oxide (around $6\,\%$ of nitrogen) were used. Two thin oxide devices $(t_{\rm ox}=1.8\,{\rm nm},\,2.2\,{\rm nm})$ with geometry $W/L=10\,\mu{\rm m}/0.12\,\mu{\rm m}$ and one thicker oxide device $(t_{\rm ox}=5\,{\rm nm})$ with $W/L=10\,\mu{\rm m}/0.24\,\mu{\rm m}$ were used. The devices were stressed with gate voltages $V_{\rm G,str}$ of $-1.75\,{\rm V},\,-2.00\,{\rm V},\,-2.25\,{\rm V},$ and $-2.50\,{\rm V}$ at temperatures of $25\,^{\circ}{\rm C},\,75\,^{\circ}{\rm C},\,125\,^{\circ}{\rm C},$ and $175\,^{\circ}{\rm C}.$ #### II. MEASUREMENT EQUIPMENT AND SETUP State-of-the-art equipment does not meet the combined resolution and measurement speed requirements of NBTI assessment. Instruments either meet (and exceed) the required accuracy, but are too slow to capture the fast NBTI degradation transients (e.g. parameter analyzers), or deliver the necessary time resolution, but are limited by their inherent coarse amplitude resolution (e.g. digital storage oscilloscopes, DSO). Since in the latter case the amplitude resolution can be enhanced by averaging, while in the former there is no remedy for a too slow measurement, we use a DSO to record multiple NBTI processes and take an average of these. Care has to be taken to conform to the preconditions of proper averaging, namely to record the *same* process many times. Only in this way, the measurement noise is reduced, while the 'hidden' deterministic process is reproduced without introducing systematic errors. In our measurements this is ensured by very short stress times, and a very low duty cycle in order to achieve full relaxation in-between stresses. The basic setup is described in [9] and uses a Hewlett-Packard 81101A pulse generator and a Tektronik TDS5034B digital storage oscilloscope. It was extended to perform short-term stress measurements including a fast gate-pulse mode and differential amplifier. To obtain the required resolution of better than $10^{-4}$ in $I_{\rm D}$ , the equipment was designed to deliver a settled gate stress voltage $V_{\rm G,str}$ within $\pm 1\,{\rm mV}$ in $1\,\mu{\rm s}$ . For this reason, a battery using a passive voltage divider and a fast electronic switch are used. As a second measure to suppress noise, the $I_{\rm D}$ of the device under test (DUT) is compared to a reference current, giving only differences, which can be captured with higher resolution prior to digitization. According to [9] the degradation of mobility is small for stress times below $10\,\mathrm{s}$ . Also, in the technologies investigated, the impact of the gate current on the measurement results was found to be negligible. The recorded $I_\mathrm{D}$ -shift is thus regarded as due to a $V_\mathrm{TH}$ -shift alone. # III. PULSE SETTINGS In order to automatically perform the required averaging of the recorded $I_{\rm D}$ , rectangular gate pulses were used for short-term NBTI stresses in the range of $1\,\mu{\rm s}$ to $1\,{\rm s}$ . Each gate pulse was followed by a 100 times longer recovery sequence which allowed for full recovery of the built-up degradation [10]. Consequently, we use a pulse train with $t_{\rm lead}=t_{\rm trail}=5\,{\rm ns}$ , a width $t_{\rm W}=t_{\rm str}$ and a period of $t_{\rm P}=100\,t_{\rm str}$ , consisting of N pulses. The product $Nt_{\rm P}$ is only limited by the overall contingent measurement time $t_{\rm M}=Nt_{\rm P}$ . A compromise between the recovery time in-between pulses ( $\approx t_{\rm P}$ ) to let the device fully recover and a reasonably high N has to be found in order to gain sufficient measurement accuracy through averaging. Since the oscilloscope uses a linear time scale, but NBTI stress must be assessed on a logarithmic scale spanning at least 3 to 4 decades, we had to split the stress time of $1\,\mathrm{s}$ into three intervals. The according values of $t_{\mathrm{str}}$ , $t_{\mathrm{P}}$ , and N are shown below, as well as the resolution, which also equals the minimum stress time of the respective stress sequence. | Sequence | $t_{ m W}=t_{ m str}$ | $t_{ m P}$ | N | Resolution | |----------|-----------------------|-----------------|------|--------------------| | 1 | $1\mathrm{ms}$ | $0.1\mathrm{s}$ | 1000 | $0.16 \mu { m s}$ | | 2 | $100\mathrm{ms}$ | $10\mathrm{s}$ | 10 | $16\mu\mathrm{s}$ | | 3 | $1000\mathrm{ms}$ | $100\mathrm{s}$ | 5 | $160\mu\mathrm{s}$ | In order to combine the three sequences into a single degradation curve with a maximum effective resolution from $1 \mu s$ to 1 s, the three Fig. 1: Different DSO settings are responsible for the vertical offset. This has to be corrected to make the stress sequences coincide (**upper graph**). Merged stress sample using a log-fit and shifted to the reference time $t_{0,\mathrm{ref}}=2\,\mu\mathrm{s}$ (**lower graph**). stress sequences are chosen to overlap for at least one decade of time. Since only differences of currents ( $I_{\rm D}$ ) are recorded, the overlap regions provide information to align the sequences to a single stress characteristic. An example is displayed in Fig. 1. The offset is due to different DSO settings in each measurement sequence. #### IV. DATA EXTRACTION ESSENTIALS Since both the measurement equipment and the pulse generator are operated at their limits, a few points have to be carefully considered during the final data extraction. ## A. Gate Voltage Criteria Monitoring $V_{\rm G}$ gives insight into the time evolution of the actual waveform, which has to be checked carefully [11]. As shown in Fig. 2 the pulse is affected by the transient behavior and a possible overshoot due to the non-instantaneous switching between $V_{\rm G,rel}$ , which is applied in-between the pulses, and $V_{\rm G,str}$ . Therefore, after the transition regime, a steady state value of $V_{\rm G,str}$ is determined and set as $V_{\rm G,ref}$ (usually taken at $t_{\rm str}/2$ ). Then an error criterion, i.e. $|V_{\rm G,str}-V_{\rm G,ref}|/V_{\rm G,ref} \leq \pm \epsilon$ is employed. Since noise is apparent in all three sequences, $\epsilon$ has to be chosen large enough to not disrupt the pulse, usually in the range of $\epsilon \approx 0.3$ %. Starting at $V_{\rm G,ref}$ and moving as well to lower (to the beginning of the pulse) and higher (to the end of the pulse) times sets new borders of our accepted stress time $t_{\rm str}$ . ## B. Offset Acquisition of $25\,\mathrm{kSamples}$ yields 3 to 4 usable decades in time for each sequence. The combined sequences result in 5 to 6 decades in time, with a possibly too large deviation of $V_{\mathrm{G,str}}$ from $V_{\mathrm{G,ref}}$ during the first decade. In the remaining decades the data can be either fit by a logarithmic time-dependence $$\frac{\Delta I_{\rm D}(t_{\rm str})}{I_{\rm D0}} = \frac{I_{\rm D}(t_{\rm str}) - I_{\rm D0}}{I_{\rm D0}} = -B \log_{10}(t_{\rm str}/t_{\rm 0,ref})$$ (1) with $I_{\rm D0}=I_{\rm D}(t_{0,\rm ref})$ , or a power-law $-A\,(t_{\rm str}/t_{0,\rm ref})^n$ with a very small exponent $n\approx 0.04$ . $I_{\rm D0}$ is obtained at stress-level with a delay $t_{0,\rm ref}$ and thus *not* equal to $I_{\rm D}(0)$ [12] and results in an offset of the relative degradation, see Fig. 2. Fig. 2: Top: The inset shows the gate stress pulses. The main graph is enlarged to make the transient and the overshoot visible. This is due to the limited switching speed of the oscilloscope when moving from $V_{\rm G,rel}$ to $V_{\rm G,str}$ and back. The therefore employed error criterion ( $|V_{\rm G,str}-V_{\rm G,ref}|/V_{\rm G,ref} \leq \pm \epsilon$ ) is displayed for $\epsilon=0.3\,\%$ . The first (last) proper values of the pulse for each sequence are marked by circles (squares). The noise is apparent in all three sequences and limits $\epsilon$ to excessively small values. **Bottom**: Different reference times $t_{0,\rm ref}$ result in different degradation. It can be seen that for $t_{0,\rm ref}=50\,\mu{\rm s}$ about $25\,\%$ of the $\Delta I_{\rm D}/I_{\rm D0}$ are missed. On the other hand, too short $t_{0,\rm ref}$ are not reasonable and result in a spurious shift by a not-yet steady measurement signal $(t_{0,\rm ref}=0.2\,\mu{\rm s},1\,\mu{\rm s})$ . Compare with top figure. ### C. Final Setting of Parameters The finally extraced data is more or less sensitive to the values of the parameters $t_{0,\mathrm{ref}}$ and $\epsilon$ . For $\epsilon$ a value of $0.3\,\%$ is used. As can be seen in Fig. 2 a $t_{0,\mathrm{ref}}$ slightly after the first value should be selected to both eliminate the influence of the first noisy points and delay time. Hence, $t_{0,\mathrm{ref}}=2\,\mu\mathrm{s}$ appears a reasonable compromise. #### V. DISCUSSION In order to understand the microscopic physics behind the short-time degradation, the temperature, voltage, and oxide-thickness dependence of the prefactor B is investigated. ## A. Temperature Scaling The temperature dependence of $\Delta I_{\rm D}/I_{\rm D0}$ is displayed in Fig. 3 for the thinnest device $(t_{\rm ox}=1.8~{\rm nm})$ with $V_{\rm G,str}=-2.25~{\rm V}$ . In the range 25 °C to 125 °C, the data can be perfectly fit by a logarithmic time dependence (differences would not be visible in the plots). A slight deviation is observed for higher temperatures for $t_{\rm str}>10~{\rm ms}$ , possibly due to the onset of the mechanism responsible for the long-time power-law behavior with a larger power-law exponent $n\approx 0.12$ . Apart from that, different temperatures can be scaled well to the data Fig. 3: Top: The temperature (25 °C, 75 °C, 125 °C and 175 °C) and voltage dependence ( $-1.75\,\mathrm{V}, -2.00\,\mathrm{V}, -2.25\,\mathrm{V}$ and $-2.50\,\mathrm{V})$ of $\Delta I_\mathrm{D}/I_\mathrm{D0}$ degradation Scaling to the dotted lines works perfectly for the later case, while different temperatures lead to a small deviation for $t_\mathrm{str}>10\,\mathrm{ms}.$ The scaling factors are also given. Bottom: $\Delta I_\mathrm{D}/I_\mathrm{D0}$ for different oxide thicknesses (1.8 nm, 2.2 nm and 5.0 nm) can be scaled as well. Only the thick device is affected by noise due to the low degradation. The graph at the very bottom combines the three dependencies. at $T_{\rm ref}=175\,^{\circ}{\rm C}$ , as shown by the dotted lines in Fig. 3, and the indicated scaling factors marked by arrows. ## B. Voltage Scaling The voltage dependence is depicted for $t_{\rm ox}=1.8\,{\rm nm}$ and $T=175\,^{\circ}{\rm C}$ (Fig. 3). Scaling to $V_{\rm G,ref}=-2.50\,{\rm V}$ leads to perfect congruence. Again, the scaling factors are shown next to their corresponding values. # C. Oxide Thickness Scaling Due to the relatively low $\Delta I_{\rm D}/I_{\rm D0}$ degradation for $t_{\rm ox}=5.0~{\rm nm}$ resulting from the low-voltage stress conditions studied here (small $E_{\rm ox}$ ), noise seriously limits the accuracy. Nonetheless, good scalability for different $t_{\rm ox}$ devices (1.8 nm, 2.2 nm, and 5.0 nm) can be obtained (Fig. 3). ## D. Extracted Prefactors The prefactors B of the log-fit of various $t_{\rm ox}$ , $V_{\rm G,str}$ , and T are displayed in Fig. 4. In agreement with previous experiments, it is observed that low $V_{\rm G,str}$ results in small temperature activation, while $V_{\rm G,str}$ larger than the operating voltage gives a notable activation energy of $0.1\,{\rm eV}$ . Note that this value is in agreement with activation energies extracted at long stress times [6]. Fitting the data to a power Fig. 4: Top: Arrhenius plot of the prefactor B of the log-fit, extracted from three different $t_{\rm ox}$ for different $V_{\rm G,str}$ . An activation energy $E_{\rm A}$ of about 0.1 eV is gained for $t_{\rm ox}=1.8\,\rm nm$ and $t_{\rm ox}=2.2\,\rm nm$ , represented by the black solid line. Degradation for the $t_{\rm ox}=5.0\,\rm nm$ devices was too noisy due to too low $E_{\rm ox}\sim (V_{\rm G,str}-V_{\rm TH})/t_{\rm ox}$ . Scale is equal for all plots. Bottom: Prefactor B of the log-fit plotted for different $t_{\rm ox}$ with different temperature T. While $t_{\rm ox}=1.8\,\rm nm$ shows a clear temperature activation, $t_{\rm ox}=5.0\,\rm nm$ does not due to the low electric stress field. For $t_{\rm ox}=2.2\,\rm nm$ the transition of the temperature dependence is visible at $T=175\,\rm ^{\circ}C$ between $V_{\rm G,str}=-2.00\,\rm V$ and $V_{\rm G,str}=-2.25\,\rm V$ . law $A\left(t_{\rm str}/t_{0,\rm ref}\right)^n$ results in a exponent $n\approx 0.04$ for short-term, roughly a third of the often reported $n\approx 0.12$ of the long-term behavior. The lower graph of Fig. 4 represents the prefactor B plotted for different $t_{\rm ox}$ with different temperature T. In the devices with $t_{\rm ox}=1.8$ nm, all the stress voltages are above the operating voltage and result in a marked temperature activation. For $t_{\rm ox}=2.2$ nm the transition from no temperature activation to temperature activation is observed for $T=175\,^{\circ}{\rm C}$ between $V_{\rm G,str}=-2.00\,{\rm V}$ and $V_{\rm G,str}=-2.25\,{\rm V}$ . For the thickest oxides used in this study, $t_{\rm ox}=5.0$ nm, the applied stress fields are very small, resulting in no temperature activation. All these dependencies support thermally activated tunneling mechanism [5] rather than elastic (and thus temperature-independent) hole tunneling [3]. #### VI. CONCLUSIONS Ultra-fast short-time NBT stress measurements from the $\mu s$ to s regime using different temperatures, stress voltages, and oxide thicknesses have been performed. In this initial degradation phase, the data can be well fit by logarithmic time dependence [8,9, 13]. Alternatively, a power law using an exponent considerably smaller ( $n \approx 0.04$ ) than generally observed during long-time stress ( $n \approx 0.12$ ) could be used. On the other hand, the extracted activation energy of about $0.1\,\mathrm{eV}$ is compatible with the values typically obtained during long-time stress [6]. Finally, the extracted temperature and voltage dependencies rule out elastic and thus temperature-independent hole tunneling as being responsible for short-time NBT degradation as proposed by [3, 4]. Another possible explanation could involve an inelastic tunneling process [5]. #### ACKNOWLEDGMENT The research leading to these results has received funding from the European Community's Seventh Framework Programme under grant agreement $n^{\circ}216436$ (project ATHENIS). #### REFERENCES - V. Huard, M. Denais, and C. Parthasarathy, "NBTI Degradation: From Physical Mechanisms to Modelling," *Microelectronics Reliability*, vol. 46, no. 1, pp. 1–23, 2006. - [2] D.K. Schroder and J.A. Babcock, "Negative Bias Temperature Instability: Road to Cross in Deep Submicron Silicon Semiconductor Manufacturing," J. Appl. Phys., vol. 94, no. 1, pp. 1–18, Jul. 2003. - [3] M. Denais, V. Huard, C. Parthasarathy, G. Ribes, F. Perrier, N. Revil, and A. Bravaix, "Interface Trap Generation and Hole Trapping under NBTI and PBTI in Advanced CMOS Technology with a 2-nm Gate Oxide," *T-DMR*, vol. 4, pp. 715–722, 2004. - [4] S. Mahapatra, V.D. Maheta, A.E. Islam, and M.A. Alam, "Isolation of NBTI Stress Generated Interface Trap and Hole-Trapping Components in PNO p-MOSFETs," *T-ED*, vol. 56, no. 2, pp. 236–242, Feb. 2009. - [5] T. Grasser, B. Kaczer, W. Gös, Th. Aichinger, Ph. Hehenberger, and M. Nelhiebel, "A Two-Stage Model for Negative Bias Temperature Instability," in *IRPS*, 2009. - [6] T. Grasser and B. Kaczer, "Evidence that Two Tightly Coupled Mechanisms are Responsible for Negative Bias Temperature Instability in Oxynitride MOSFETs," *T-ED*, 2009, to appear. - [7] V.D. Maheta, E.N. Kumar, S. Purawat, C. Olsen, K. Ahmed, and S. Mahapatra, "Development of an Ultrafast On-the-Fly I<sub>DLIN</sub> Technique to Study NBTI in Plasma and Thermal Oxynitride p-MOSFETs," *T-ED*, vol. 55, no. 10, pp. 2614–2622, Oct. 2008. - [8] J.F. Zhang, Z. Ji, M.H. Chang, B. Kaczer, and G. Groeseneken, "Real Vth Instability of pMOSFETs under Practical Operation Conditions," in *IEDM*, 2007, pp. 817–820. - [9] H. Reisinger, U. Brunner, W. Heinrigs, W. Gustin, and C. Schlünder, "A Comparison of Fast Methods for Measuring NBTI Degradation," *T-DMR*, vol. 7, no. 4, pp. 531–539, Dec. 2007. - DMR, vol. 7, no. 4, pp. 531–539, Dec. 2007. [10] S. Rangan, N. Mielke, and E.C.C. Yeh, "Universal Recovery Behavior of Negative Bias Temperature Instability," in *IEDM*, 2003, pp. 341–344. - [11] B. Kaczer, T. Grasser, Ph.J. Roussel, J. Martin-Martinez, R. O'Connor, B.J. O'Sullivan, and G. Groeseneken, "Ubiquitous Relaxation in BTI Stressing New Evaluation and Insights," in *IRPS*, 2008. - [12] C. Shen, M.-F. Li, C.E. Foo, T. Yang, D.M. Huang, A. Yap, G.S. Samudra, and Y.-C. Yeo, "Characterization and Physical Origin of Fast Vth Transient in NBTI of pMOSFETs with SiON Dielectric," in *IEDM*, 2006 - [13] H. Reisinger, O. Blank, W. Heinrigs, W. Gustin, and C. Schlünder, "A Comparison of Very Fast to Very Slow Components in Degradation and Recovery due to NBTI and Bulk Hole Trapping to Existing Physical Models," *T-DMR*, vol. 7, no. 1, pp. 119–129, Mar. 2007.