ABSTRACT Title of Thesis: PART SELECTION AND MANAGEMENT BASED ON RELIABILITY ASSESSMENT FOR DIE-LEVEL FAILURE MECHANISMS Lokesh Sangepu, Master of Science, 2023 Thesis Directed By: Dr. Diganta Das, Department of Mechanical Engineering Electronic part manufacturers often provide reliability values in metrics such as Mean Time Between Failures (MTBF) and its inverse, Failures in Time (FIT). These metrics assume a constant failure rate and do not account for damage accumulation or wear-out phenomena, making the part selection and management based on this information meaningless. This thesis will report on the challenges associated with manufacturers' avoidance of sharing critical part information and how insufficient information hampers decision-making for part selection. The thesis uses four die-level failure mechanisms (Electromigration, Time-Dependent Dielectric Breakdown, Hot Carrier Injection, and Negative Bias Temperature Instability) as demonstration cases. It investigates the extent to which industry-published documents can be used to obtain the data necessary to simulate these mechanisms. It will report on methods of selecting an appropriate failure model based on the part technology level and identifying the required parameters for estimating the part's time to failure. Various scattered part information sources, literature, and industry-published documents may include the input parameters of failure models. The thesis provides insights into the complexity of understanding these information sources and various methods to obtain the required parameters to estimate the time to failure distributions. The methodology considers the susceptibility of parts to die-level failure mechanisms and compares components for reliability. A simulation template that facilitates practical implementation by enabling designers, engineers, and procurement teams to make informed decisions while selecting electronic parts for specific applications is introduced. The research findings and methodology presented provide valuable insights for users to improve the reliability and performance of electronic systems through effective part selection. PART SELECTION AND MANAGEMENT BASED ON RELIABILITY ASSESSMENT FOR DIE-LEVEL FAILURE MECHANISMS By Lokesh Sangepu Thesis submitted to the Faculty of the Graduate School of the University of Maryland, College Park, in partial fulfillment of the requirements for the degree of Master’s of Science 2023 Advisory Committee: Dr. Diganta Das, Chair Professor Francis Patrick McCluskey Professor Peter Sandborn ii Acknowledgments Several different groups of people helped me develop and formulate this thesis, and I would like to express my sincere gratitude to all those who have contributed to the completion of this thesis. First and foremost, I sincerely thank my thesis advisor Dr. Diganta Das for his invaluable guidance, support, and expertise throughout my time at the Center for Advanced Life Cycle Engineering and the University of Maryland. His insightful feedback, encouragement, and unwavering commitment have been instrumental in shaping this thesis. Secondly, I would like to thank my committee members (Professor Francis Patrick McCluskey, and Professor Peter Sandborn) for their time as I defended my thesis and for providing feedback as necessary. Thirdly, I would like to thank the faculty in my research group (Professor Michael Pecht, Dr. Michael Osterman, Dr. Michael Azarian, and Dr. Robert Utter) for their teaching dedication and valuable insights shared during academic discussions. Their expertise and knowledge have greatly enriched my understanding of the subject matter. Additionally, I would like to extend my gratitude to the consortium of companies that collaborate with the Center for Advanced Life Cycle Engineering. In personal, I would like to express my appreciation to the individuals within these companies with whom I had the privilege of working. Their patience, friendly demeanor, and willingness to share their expertise have been immensely valuable. Although confidentiality obligations prevent me from disclosing their names, I hold them in the highest esteem and am sincerely grateful for their contributions to this thesis. I am indebted to the staff and personnel at the University of Maryland and particularly in CALCE for their cooperation and assistance in providing the necessary resources and facilities iii for carrying out my thesis. Their support has been instrumental in the successful completion of this thesis. I also want to acknowledge the academic community and researchers in the field for their groundbreaking work and publications, which have served as a valuable foundation for this thesis. This thesis would not have been possible without the aforementioned individual's and organizations' contributions, support, and guidance. Their collective efforts have played a significant role in successfully completing this research endeavor. Last but certainly not least, I would like to thank my family: my mother, Sharda Kumari; my father, Sivaramaiah; my sister, Shamili and my dear friends who stood by me at each and every step. It is fair to say that this thesis would not have been possible without the contributions, support, and guidance of the aforementioned individuals and organizations. Their collective efforts have played a significant role in the successful completion of this endeavor. iv Contents Acknowledgments........................................................................................................................... ii List of Figures ............................................................................................................................... vii List of Tables ................................................................................................................................... x 1. Introduction: Comprehensive Idea of Part Manufacturers Avoiding Sharing Critical Information ..................................................................................................................................... 1 1.1. Part Assessment Process................................................................................................... 3 1.2. Problem with Part Reliability Information from the Manufacturer ................................. 4 1.2.1. Abuse of AEC-Q100 Specifications ......................................................................... 5 1.2.2. Problems with Reliability Testing ............................................................................. 8 1.2.3. Refusing to Divulge Information ............................................................................ 10 1.2.4. PPAP Availability .................................................................................................... 10 1.3. Potential Recommendations Based on Observations ..................................................... 11 2. Unmasking the Limitations of Industry-Published Documents on Die-Level Failure Mechanism Models Concerning Time-to-Failure Estimation ....................................................... 13 2.1. Literature Review of Failure Mechanisms and Models ................................................. 14 2.1.1. Electromigration ..................................................................................................... 15 2.1.2. Time-Dependent Dielectric Breakdown ................................................................. 19 2.1.3. Hot Carrier Injection ............................................................................................... 26 v 2.1.4. Negative Bias Temperature Instability .................................................................... 31 2.2. Observations and Limitations......................................................................................... 37 3. Process Technology Via Part Manufacturer Documents ....................................................... 38 3.1. What is Process Technology? ......................................................................................... 39 3.2. Benefits of Obtaining Process Technology .................................................................... 40 3.3. Sources to Obtain Process Technology .......................................................................... 43 3.4. Methodology to Obtain Process Technology ................................................................. 51 4. Part Reliability Assessment and Management Methodology ................................................ 55 5. Obtaining Input Parameter Information of Failure Models, Simulation, and Results ........... 61 5.1. Parameter Information of Failure Models ...................................................................... 62 5.1.1. Model Constants ..................................................................................................... 62 5.1.2. Geometric Parameters ............................................................................................. 65 5.1.3. Application Conditions ........................................................................................... 66 5.2. Simulation and Analysis ................................................................................................. 67 5.3. Simulation Results.......................................................................................................... 72 6. Summary of Approach, Contributions, and Future Work ...................................................... 78 6.1. Approach and Contributions .......................................................................................... 78 6.2. Future Work .................................................................................................................... 80 Appendices .................................................................................................................................... 83 vi Appendix A: Time to Failure Distribution MATLAB Script of EM ......................................... 83 Appendix B: Time to Failure Distribution MATLAB Script of TDDB E Model ..................... 84 Appendix C: Time to Failure Distribution MATLAB Script of TDDB 1/E Model .................. 85 Appendix D: Time to Failure Distribution MATLAB Script of HCI Model ............................ 86 Bibliography ................................................................................................................................. 88 vii List of Figures Figure 1: Part assessment process [6]. ............................................................................................ 4 Figure 2: Example of a part manufacturer reported reliability information provided as MTBF and FIT rates [8]. ................................................................................................................................... 5 Figure 3: Die fabrication reliability tests to be performed as per AEC-Q100 [9]. .......................... 7 Figure 4: Die reliability tests information from Texas Instruments PPAP document for part number TPS54560BQDDAQ1 [14]. ............................................................................................... 7 Figure 5: Wafer/die technology description provided in the PPAP document [14]. ..................... 10 Figure 6: Availability of PPAP documentation for Diodes Inc. MOSFETs [22]. ......................... 11 Figure 7: CMOS wearout die-level failure mechanisms [28]. ...................................................... 14 Figure 8: Illustration of void and hillock due to electromigration in interconnects [31]. ............. 15 Figure 9: Failure mechanism of electromigration [35]. ................................................................ 16 Figure 10: TDDB failure mechanism process steps [44]. ............................................................. 21 Figure 11: TDDB failure mechanism explained through gate current vs stress time plot [45]. ... 22 Figure 12: Cross-section of an n-channel MOSFET illustrating the generation and injection of hot carriers into the gate-oxide (Si𝑂𝑂2) [53]. ................................................................................. 27 Figure 13: Si – SiO2 interface [35]. .............................................................................................. 28 Figure 14: Drain end of the channel producing electron-hole pairs [54]. ..................................... 28 Figure 15: NBTI failure mechanism through Si/SiO2 interface [35]. ........................................... 33 Figure 16: Partial recovery of the gate oxide in NBTI failure mechanism [35]. .......................... 33 viii Figure 17: Cross-section of pMOS device under (a) NBTI stress conditions (b) in recovery [62]. ....................................................................................................................................................... 34 Figure 18: Physical representation of node size or process technology (L) [68]. ......................... 39 Figure 19: Using Moore's Law scaling to estimate node size and, in due course, estimate the effective gate length for the transistor. .......................................................................................... 41 Figure 20: Industry white paper providing information for the 1.2 Β΅m CMOS process family [73]. ............................................................................................................................................... 42 Figure 21: Process technology obtained through the original component manufacturer (OCM) website [78]. .................................................................................................................................. 44 Figure 22: Datasheets of various part manufacturers as information sources for process technology [79], [80], [81]. ........................................................................................................... 45 Figure 23: Process technology from user guide document [82]. .................................................. 45 Figure 24: Die technology parameters provided in a PPAP document. ........................................ 46 Figure 25: Process technology/design rule for a part obtained through the qualification report [83]. ............................................................................................................................................... 46 Figure 26: Process technology vs. year of introduction [85]. ....................................................... 48 Figure 27: Documents through which part manufacturers can report process technology. .......... 50 Figure 28: Methodology to obtain part process technology. ......................................................... 52 Figure 29: Example of procedure to validate datasheet. ............................................................... 53 Figure 30: Increment of risk associated with the information source of a part process technology. ....................................................................................................................................................... 54 ix Figure 31: Process for completing stress and damage modeling for reliability prediction [88]. .. 55 Figure 32: Part selection and management developed by CALCE [89]. ...................................... 56 Figure 33: Part reliability assessment and selection methodology [22]. ...................................... 57 Figure 34: Part-life estimation methodology for a failure mechanism (TDDB). ......................... 59 Figure 35: Part-life estimation methodology extended to four die-level failure mechanisms. ..... 59 Figure 36: GUI initial window. ..................................................................................................... 67 Figure 37: GUI window for electromigration. .............................................................................. 68 Figure 38: GUI window for TDDB E model. ............................................................................... 69 Figure 39: GUI window for TDDB 1/E model. ............................................................................ 69 Figure 40: GUI window for HCI failure models. ......................................................................... 70 Figure 41: Analysis tab to view the TTF distribution plotted based on the parameters provided in the data input tab. .......................................................................................................................... 71 Figure 42: GUI datasheets window to access previously stored part-related information. .......... 72 Figure 43: CDF Vs. TTF for EM at 1% parameter deviation range of P1. ................................... 74 Figure 44: Part selection methodology considering multiple failure mechanisms. ...................... 81 x List of Tables Table 1: Input Parameters for Black's Equation [23]. ................................................................... 18 Table 2: Input Parameters for the E model [23]. ........................................................................... 24 Table 3: Input Parameters for 1/E model [23]. ............................................................................. 25 Table 4: Input Parameters for HCI time to failure model [23]. .................................................... 30 Table 5: Model parameters for NBTI failure mechanism [23]. .................................................... 36 Table 6: Example of part information sources. ............................................................................. 43 Table 7: Current state of the art of process nodes or technologies [86], [87]. .............................. 49 Table 8: Model constants of three die-level failure mechanisms and associated models [23]. .... 62 Table 9: Geometric parameters of three die-level failure mechanisms and associated models. ... 65 Table 10: Application conditions of three die-level failure mechanisms and associated models. 66 Table 11: Variation of estimated TTF at different numbers of random sampling. ........................ 73 Table 12: Results of EM as Bx Life. ............................................................................................. 74 Table 13: Results of TDDB concerning the failure model as Bx Life. .......................................... 75 Table 14: Results of HCI for N-channel and P-channel as Bx Life. ............................................. 75 Table 15: Ranking of the parts based on Bx life results obtained through MATLAB GUI. ......... 77 1 1. Introduction: Comprehensive Idea of Part Manufacturers Avoiding Sharing Critical Information The practice of part manufacturers avoiding providing relevant information is a source of concern that can have significant implications for product quality, reliability, safety, and customer satisfaction [1]. The practice describes circumstances in which manufacturers intentionally withhold or give customers insufficient information, which can impede decision- making processes. There are several reasons why part manufacturers may engage in this practice. Preserving a competitive advantage or protecting private information is one of the main drivers. Manufacturers could be reluctant to provide critical information about their manufacturing procedures, geometry dimensions, materials, or design requirements out of concern that rivals may replicate their techniques or learn information that might jeopardize their position in the market [2]. Protecting confidential information is understandable, but it becomes a problem when it prevents the customer from assessing the product thoroughly. Part manufacturers often hesitate to provide detailed reliability information because they are often unaware of how their parts will be utilized in the end product. As a result, they may refrain from sharing specific reliability data or performance characteristics that could be useful for customers. Manufacturers typically produce parts that are used in a wide range of applications across various industries. These applications can vary significantly in terms of operating conditions, environmental factors, and stress levels. Consequently, it becomes challenging for part manufacturers to predict precisely how their parts will perform in every possible scenario. By withholding reliability information, manufacturers can avoid making claims or guarantees 2 about the performance of their parts in specific applications. They may fear potential liability issues if their parts fail to meet customer expectations in unforeseen circumstances or environments. Another element that contributes to this issue is cost reduction. Gathering and providing comprehensive information takes time, effort, and resources [3]. Manufacturers may reduce these outlays by offering what is strictly necessary to comply with contracts or requirements. This approach allows them to cut costs but neglects the importance of sharing crucial details that could impact the customer's ability to evaluate the part's quality, reliability, and compatibility with their specific requirements. Manufacturers may also refrain from disseminating helpful information due to poor communication channels and a lack of standard operating procedures. Manufacturers may undervalue the importance of giving their customers detailed data when communication is not established or there is no defined framework for information exchange. In these situations, they can presume that the client would ask for relevant information or overlook the significance of proactive information exchange [4]. The consequences of manufacturers avoiding conveying useful information can be far- reaching. Customers may face challenges understanding the part performance capabilities, potential risks, or compliance with industry standards [5]. Inadequate information can hinder their ability to make informed decisions about the part's suitability for their applications or to assess its compatibility within their existing systems or processes. This can lead to delays, additional costs, and potential safety or performance issues. This chapter examines the part selection based on reliability assessment, which calls for manufacturer-provided information, and how the lack of information impacts the assessment. 3 Additionally, it offers perceptions into numerous sources of original component manufacturer (OCM) documents that only adhere to the letter of the law but do not do so in a way that effectively conveys crucial information. 1.1. Part Assessment Process The production of a part typically involves a complex supply chain comprising various companies that are directly or indirectly involved in manufacturing constituent parts and materials for the overall part. To ensure the production of a reliable product, it is essential to carefully choose parts that possess sufficient quality and the ability to satisfy the expected performance requirements for the specified life-cycle conditions [6]. Generally, a group with considerable expertise is responsible for part selection and management. This team develops part assessment criteria and establishes acceptability levels to facilitate the selection of parts. The selection of a part is based upon its adherence to the specified requirements, its cost-effective feasibility, and its timely availability to meet the schedule requirements. In the event of any issues, the parts management team assists in identifying substitute sources of parts or devising strategies to help the supplier produce a better part. Several product design teams uphold a catalog of preferred parts with consistent reliability and performance. A "preferred part" is typically characterized by maturity and a successful manufacturing, assembly, and field operation track record. A mature part may become undesired or obsolete because of new technologies, methods, markets, materials, and pricing pressures. A new part could be necessary when creating a brand- new product or upgrading an existing one. It is not always practicable for a client of components to dictate requirements because a company may develop parts for numerous distinct customers 4 with various target applications. Sometimes, it could be essential to evaluate a part beyond what the manufacturer does to choose the right one while selecting parts with the necessary qualities. Performance, quality, dependability, and ease of assembly are crucial factors in part assessment (see Figure 1) [6]. Performance is evaluated by functional assessment against the datasheet specifications. Process capability and outgoing quality metrics serve to assess quality. Results of reliability tests and part qualification are used to evaluate reliability. If a part is compatible with the machinery and procedures used in the subsequent assembly, it is acceptable from an assembly standpoint. Although all four components are necessary to complete the part assessment, we focus on the part selection based on reliability. Procurability of the part also plays vital role in performing part selection and management. Figure 1: Part assessment process [6]. 1.2. Problem with Part Reliability Information from the Manufacturer Information regarding part qualification and reliability test results is required to perform part selection based on reliability, but obtaining part reliability information is more complex. Electronic part manufacturers usually communicate part reliability information as the mean time between failures (MTBF) or failure per billion hours (FIT). However, a constant failure rate (exponential failure distribution) as a metric reliability value does not account for the damage accumulation. Additionally, the constant failure rate estimation is based on the assumptions that 5 the failures during operation are random and that there is no wear-out, two rarely true assumptions for electronics [7]. Figure 2 illustrates the problem with part reliability information provided as MTBF and FIT rates. It depicts a significant change in MTBF of approximately 20,000 years depending on the confidence level for the same part and is also the same case with the FIT rates [8]. Assuming a constant failure rate, manufacturers can test a large number of parts and use the results to estimate the failure rate for a particular part number. This methodology is used because, in many cases, no failures are observed during the testing. Because of these assumptions and the lack of accounting for application conditions, it is not recommended to use failure rate data for projecting reliability metrics. Figure 2: Example of a part manufacturer reported reliability information provided as MTBF and FIT rates [8]. 1.2.1. Abuse of AEC-Q100 Specifications As the part reliability information from the manufacturer does not provide the actual reliability estimates, one can estimate the reliability of a part by utilizing the testing data performed on the part by the manufacturer. AEC-Q100 is known for failure mechanism-based stress test qualification for packaged integrated circuits (ICs) used in automotive applications [9]. The Automotive Electronics Council (AEC) established this specification to define qualification requirements and procedures for packaged ICs in the automotive industry. An AEC-Q100- 6 qualified device means that the device has passed the specified stress tests and guarantees a certain level of quality/reliability [10]. Automotive parts that meet the qualification requirements of AEC must complete a production part approval process (PPAP) [11] [12]. This process requires documenting the design, manufacture, and status of a part to the customer. Each PPAP document contains 18 – 20 sections, including material information, qualification testing results, and design/process failure mode effect analysis (D/PFMEA) [13]. This information is useful for comparing parts and can be used to understand the expected reliability of the part better. According to AEC-Q100, the part manufacturer requires to complete device qualification by performing die fabrication reliability testing for electromigration (EM), time-dependent dielectric breakdown (TDDB), hot carrier injection (HCI), negative bias temperature instability (NBTI), and stress migration (SM) as shown in Figure 3. The manufacturer should provide information on sample size/lot, number of lots, acceptance criteria, and testing methods for these tests. Additionally, the data, test method, calculations, and internal criteria should be available to the user upon request for new technologies. However, it is not usual practice for many part manufacturers to provide such information. Figure 4 represents a PPAP document from Texas Instruments for a packaged IC used in the automotive industry [14]. The first observation when Figures 3 and 4 are compared is that the manufacturer did not follow the template providing the information as per the AEC-Q100. The acceptance criteria, notes, number of lots, test method, and additional requirements columns are neglected and not included. Additionally, the naming convention is not followed in a few other columns, which can lead to confusion or improper understanding of the data. Sample size/lot is provided as SS/lot. Furthermore, new or unknown columns, such as test specification, minimum lot quantity, and duration, are included. 7 Figure 3: Die fabrication reliability tests to be performed as per AEC-Q100 [9]. Figure 4: Die reliability tests information from Texas Instruments PPAP document for part number TPS54560BQDDAQ1 [14]. The manufacturer must provide all the information, such as sample size per lot, number of lots, acceptance criteria, and testing methods for all the failure mechanisms. However, the manufacturer has not provided such information, and even the columns in the part's PPAP document were left blank. This concludes that no data related to these die-level failure 8 mechanisms is provided. Furthermore, it is mentioned that the device testing is performed and completed as per process technology, which depicts that the tests for these failure mechanisms are not performed for every part number but for process technology. The PPAP provides test specifications for EM, TDDB, and HCI failure mechanisms but not for the NBTI and SM. The provided test specifications are of Joint Electron Device Engineering Council (JEDEC). JEDEC Solid State Technology Association developed the JESD61 [15]: Isothermal Electromigration Test Procedure, JESD35 [16]: Procedure for the Wafer-Level Testing of Thin Dielectrics, and JESD60 & 28 [17] [18]: A Procedure for Measuring P & N- Channel metal-oxide-semiconductor field-effect transistor (MOSFET) Hot-Carrier-Induced Degradation Under direct current (DC) Stress. The manufacturer provided these standards as test specifications for the respective failure mechanisms, and they offered the testing method. Still, the standards do not provide insights into stress application conditions, failure criteria, and the number of samples. In JESD35 [16], it is clearly stated that "the purpose of this document is to describe oxide test techniques for quick evaluation and control of oxide fabrication techniques. It does not specify acceptance or rejection criteria for any of the described procedures and, therefore, is not intended to be used to predict metal oxide semiconductor (MOS) Integrated Circuit failure rates." Based on this finding, the critical observation is that the manufacturer can toggle with the number of samples, failure criteria, and stress conditions and obtain the required qualification or reliability to satisfy AEC-Q100. With no data relevant to reliability testing, the possibility of securing part reliability through this approach is not achievable and advisable. 1.2.2. Problems with Reliability Testing Another approach to obtaining part reliability is testing the part under usage or accelerated stress conditions to achieve time-to-failure (TTF) data on sample size. Later, these times to 9 failure are extrapolated to usage conditions if tested under accelerated stress conditions which are then plotted and modeled through life-stress relationships such as Arrhenius and Power-Law models to obtain the reliability of the part [19]. While testing, the number of failures depends on the testing time. Depending on the length they are tested for, there can be samples that may not fail by the end of the test, also known as censor data. While running accelerated tests, one critical aspect is extrapolating the accelerated testing data to use condition failure data for the exact failure mechanism [6]. If not, the estimated reliability could be misleading for a part reliability assessment. Even though the procedure to obtain part reliability is more accurate through this method, performing part selection based on reliability estimated through this approach is far from reach. Accelerated stress testing involves subjecting the components or products to extreme conditions for an extended period. These tests typically require significant time to observe failures and collect reliable data. In a fast-paced industry or when there are time constraints for product development and release, companies may not have the luxury of dedicating the necessary time for such tests. Accelerated stress testing can be expensive, especially involving specialized equipment, facilities, and resources. The costs associated with creating the necessary testing infrastructure, acquiring or building test setups, and conducting the tests over an extended period can be prohibitive for some companies, particularly smaller ones with limited budgets. Accelerated stress tests require skilled personnel, specialized equipment, and dedicated resources. Not all companies may have the in-house expertise or resources to design, set up, and conduct these tests. The lack of available resources or expertise can make it challenging to perform such tests within the company. The TTF could vary from seconds to hundreds of years, especially for die- level failure mechanisms such as EM, TDDB, HCI, and NBTI [20]. In such conditions, even the 10 accelerated tests could not provide results as the acceleration factor (AF) will be so high that the failure mechanism to be tested will not remain the same. Also, testing must be performed for each failure mechanism for a part that requires different test setups. In addition, to complete part selection based on reliability for "n" number of parts for "m" number of failure mechanisms, the number of tests, testing setups, and the length of test till failure is so huge that companies will be out of the market by the time they finish performing the testing. Moreover, the part manufacturers do not provide the test methods, number of samples, sample size, failure criteria, and other critical information required to perform testing [14]. In conclusion, testing the samples is not a solution to achieving part selection based on reliability. 1.2.3. Refusing to Divulge Information Apart from the concerns mentioned above, there are additional aspects for which the information is labeled as proprietary, which cannot be shared or considered a business secret in the PPAP document. For the same part mentioned in the reported PPAP, the die channel length is regarded as confidential information as per Texas Instruments, which is not provided, as shown in Figure 5. Understanding the die channel length is one of the preliminary steps in performing a part reliability assessment which will be discussed in Chapter 4. Figure 5: Wafer/die technology description provided in the PPAP document [14]. 1.2.4. PPAP Availability In addition, not all part manufacturers provide the PPAP document. Of the ten parts used for the study, only 1 part manufacturer provided the PPAP document. While some manufacturers do 11 or may provide the information upon request, it depends on the scale at which the parts are purchased from the manufacturer. There will be some instances in which the manufacturer may not respond or does not want to provide the information. This was evaluated by analyzing all of the MOSFETs provided on the Diodes Inc. website [21]. Figure 6: Availability of PPAP documentation for Diodes Inc. MOSFETs [22]. Diodes Inc. has 1174 MOSFETs, 179 of which are rated as commercial (COTS), and the remaining 995 are automotive parts. Figure 6 shows the results of the analysis. Of the automotive parts, 19% do not provide a PPAP document, 48% provide a PPAP document with an additional request, and the rest have a direct link to the PPAP document [22]. To summarize, the PPAP could be a good resource of information related to a part that in itself is limited to providing the bare minimum data, and one may not find it for a few other parts at all. 1.3. Potential Recommendations Based on Observations In summary, the issue of part manufacturers avoiding conveying useful information is a challenge requiring proactive measures. Highlighting the importance of effective communication and transparency throughout the supply chain is vital. Manufacturers should recognize that 12 providing useful information enhances trust, strengthens relationships, and ultimately benefits their long-term reputation and customer satisfaction. Clear guidelines and contractual agreements must be established to ensure manufacturers fulfill their obligations to convey comprehensive and relevant information to customers. Furthermore, customers should actively engage with their suppliers and express their expectations regarding information exchange. Customers can encourage manufacturers to be more forthcoming with valuable information by clearly articulating their needs and requirements. Collaboration and open dialogue between manufacturers and customers can help build mutual understanding and enable the development of effective solutions that address concerns while respecting proprietary rights and competitive considerations. Regulatory bodies and industry associations also play a vital role in addressing this issue. Organizations such as JEDEC, International Electrotechnical Commission (IEC), and AEC can establish guidelines, standards, and best practices that promote transparent information exchange between manufacturers and customers. Regular audits and inspections can be conducted to ensure compliance and discourage the practice of avoiding conveying useful information. By fostering a culture of transparency, standardized procedures, and open communication, manufacturers can fulfill their responsibility to provide comprehensive information while protecting their legitimate interests. Performing these will ultimately contribute to better decision-making, enhanced product quality, and stronger relationships between manufacturers and customers. 13 2. Unmasking the Limitations of Industry-Published Documents on Die-Level Failure Mechanism Models Concerning Time-to- Failure Estimation The focus of this project was not on developing a new model for die-level failure mechanisms. Instead, my exploration began by delving into industry-published documents specifically addressing failure models at the die level. I aimed to thoroughly examine the available information sources and determine how the models can be utilized in modeling part selection. By carefully analyzing these documents, I hoped to gain valuable insights into the various models of failure mechanisms, such as Time-Dependent Dielectric Breakdown (TDDB), Electromigration (EM), Hot Carrier Injection (HCI), and Negative Bias Temperature Instability (NBTI). I recognized that the existing industry literature JEDEC JEP122H: Failure Mechanisms and Models for Semiconductor Devices [23], might provide valuable knowledge and insights regarding these failure mechanisms. However, my primary objective was to understand the extent to which industry-published documents discuss these mechanisms and the applicability of failure models. Specifically, I aimed to investigate whether these documents shed light on time- to-failure analysis, which is crucial in evaluating the reliability and durability of semiconductor devices at the die level. This chapter details the four die-level failure mechanisms (EM, TDDB, HCI, and NBTI) and respective failure models. By scrutinizing the available source of information, I aimed to uncover any potential gaps or limitations in the JEP122H standard of these failure mechanisms and assess the feasibility of utilizing existing models for my research and modeling part selection. 14 2.1. Literature Review of Failure Mechanisms and Models Advanced integrated circuits (ICs) are characterized by complex designs and usage of a wide variety of materials, including semiconductors, insulators, metals, and plastic molding compounds, among others [24]. The success of semiconductors has been significantly attributed to the critical role played by the scaling of device geometries, which has led to improved performance and cost reductions per device [25]. The process of scaling, which involves reducing device geometries by 0.7x for each new technology node following Moore's Law, has resulted in an increase in electric fields within materials [26]. This has brought the materials closer to their breakdown strength and has led to concerns regarding electromigration (EM) due to the rise in current densities in metallization. The acceleration of reliability issues, such as time-dependent dielectric breakdown (TDDB), hot-carrier injection (HCI), and negative-bias temperature instability (NBTI), can occur with higher electric fields, as elaborated in the sections that follow [27]. These failure mechanisms in complementary metal-oxide-semiconductor (CMOS) are illustrated in Figure 7. Figure 7: CMOS wearout die-level failure mechanisms [28]. 15 2.1.1. Electromigration Electromigration is a critical reliability concern in CMOS devices. Electromigration is when the formation of a high current density, caused by local heating, parasitic currents, and other sources, results in the flow of metallic atoms in the direction of current flow [29]. This flow of metallic atoms over time impacts electronic function over time. The flow of metallic atoms essentially causes the formation of voids in some areas of the overall lattice (potentially leading to open circuits) and the nucleation of denser metallic layers leading to the formation of hillocks (potentially leading to short circuits) [30]. Figure 8 demonstrates the electromigration process. Figure 8: Illustration of void and hillock due to electromigration in interconnects [31]. The electromigration failure mechanism, as shown in Figure 9, begins due to the momentum exchange between the current-carrying electrons and the metal lattice. For high-electron current densities, the electron wind (collisions of the electrons with the metal ions in the lattice) serves to exert force on the metal ion, which is large enough to cause the metal ion to drift [32]. Generally, this metal-ion movement is along grain boundaries in aluminum (Al)-alloys and along interfaces for copper (Cu) [33]. 16 Along with the electron wind and interconnect material, flux divergence is a critical element of the electromigration failure mechanism [34]. Flux divergence represents the net flow of material into and out of a region of interest (interconnect length). A flux divergence can result in the accumulation or depletion of metal ions in the region of interest. Microstructure differences, such as grain size differences, result in flux divergences. Electromigration-induced transport is primarily along grain boundaries in polycrystalline Al-alloy conductors. Grain boundaries in an interconnect are usually not uniform which is illustrated as two regions of uniform grain structure in Figure 9 for simplicity [35]. Electron flow from left to right can serve to produce a void in the metal due to the flux divergence at the microstructure gradient. Electron flow from right to left can produce an accumulation of metal due to the flux divergence at the microstructure gradient. The void or accumulation (hillock) primarily initiates at triple points of grain boundaries. The dominant electromigration-transport mechanism for Cu can be along the Cu barrier interfaces. Figure 9: Failure mechanism of electromigration [35]. 17 Failures caused by electromigration in CMOS include open circuits and shorts [36]. Open circuits occur when the metal line breaks due to voids, leading to a discontinuity in the electrical path. Shorts occur when adjacent metal lines come into contact due to metal accumulation or the formation of hillocks. The metal accumulated can break through the dielectric separating the adjacent interconnects resulting in unintended electrical connections. An increase in resistance can cause performance degradation in the IC resulting in slower signal transmission, increased power consumption, and reduced overall circuit functionality. To detect and analyze electromigration failures, various techniques are employed [37]. One common method is electrical testing, where the resistance of metal lines is monitored over time. An increase in resistance can indicate the thinning or breakage of metal lines. Additionally, techniques such as current stress tests are used to evaluate the reliability of CMOS devices under accelerated aging conditions. Failure analysis techniques, such as scanning electron microscopy (SEM), transmission electron microscopy (TEM), and focused ion beam (FIB), are utilized to analyze the failed metal lines and identify the root cause of the failure. SEM and TEM allow for visualizing voids, metal thinning, and other structural changes [38]. FIB is used for cross- sectional analysis, enabling precise imaging, and probing of the failed region. Design tools and techniques are employed to protect against electromigration failures in CMOS devices. Proper material selection is crucial, with copper and copper alloys being preferred over aluminum due to their better resistance to electromigration [39]. Design rules are optimized to ensure appropriate line widths and thicknesses, minimizing the current density and reducing the susceptibility to electromigration. Selecting dielectric materials with low diffusion rates, such as low-k dielectrics, and using barrier layers and passivation materials to prevent metal diffusion into the surrounding dielectric materials improves the reliability of interconnects 18 [40]. Stress engineering techniques, such as introducing compressive or tensile stress in the metal lines, can enhance their resistance to electromigration-induced failures. The PPAP provided JEDEC JESD61A: Isothermal Electromigration Test Procedure as test specification for electromigration qualification [41]. This document provided an accelerated current stress test, and the failure criterion is defined based on the maximum percent resistance increment of the test line. It is provided that the test is terminated in general when the maximum percent resistance increment is about 2%. Black's equation estimates the time to failure based on the interconnect dimensions, absolute temperature, physical material properties, electron current density, and activation energy for EM to occur and an experimentally determined exponent that varies depending on the EM failure mode [23]. This model assumes that empirical values attained during accelerated testing can be applied to normal use conditions to provide a model of TTF. The model is shown in equation (1), and all the model inputs are listed in Table 1. For aluminum, the test current densities should be on or above the order of 10 mA/ΞΌm2 (1 x 106 A/cm2), the conductor should be at least 500 ΞΌm in length and preferably 1 mm long. In contrast to Al, for Cu test structure, lengths are commonly between 200 and 400 ΞΌm for stress current densities on the order of 20 mA/ΞΌm2. The failure criterion is a fractional or percentage resistance increase (e.g., Ξ”R/R x 100 = 20% is commonly used). 𝑇𝑇𝑇𝑇𝑇𝑇 = 𝐴𝐴0 𝑗𝑗𝑛𝑛 . 𝑒𝑒 πΈπΈπ‘Žπ‘Žπ‘Žπ‘Ž π‘˜π‘˜π‘˜π‘˜ (1) Table 1: Input Parameters for Black's Equation [23]. Variable Function or Purpose Unit Method of Determination 19 A0 Scaling Factor (time*A)/cm2 Experimentally determined from life testing for technology by product developers j Current density A/cm2 Calculated from applied interconnect current and cross-sectional area n Exponent Constant Experimentally determined, dependent on interconnect material, experimental conditions, and junction failure mode. Range lies between 1.1 < n < 2, in common, Aluminum: n = 2 Copper: n = 1.1 Eaa Apparent activation energy eV Experimentally determined, dependent on conductor diffusion pathways. Al + %Si: 0.5 < Eaa < 0.6 Al + %Cu: 0.7 < Eaa < 0.8 Cu: 0.85 < Eaa < 0.95 k Boltzmann's constant eV/K 8.6173303 Γ— 10βˆ’5 eV/K T Temperature K Application specific 2.1.2. Time-Dependent Dielectric Breakdown Due to the very high operating electric fields in the gate dielectric of CMOS devices, time- dependent dielectric breakdown (TDDB) can be an important IC failure mechanism. Time- dependent dielectric breakdown is the failure mechanism that occurs when a constant electronic field application less than the breakdown strength of the dielectric material eventually wears down the gate oxide of a CMOS [42]. The gate oxide, typically silicon dioxide (SiO2), serves as an insulating layer between the gate electrode and the channel region in a CMOS. TDDB refers to the degradation and eventual breakdown of the gate oxide over time under electrical stress which is caused by creating conductive paths [43]. 20 The failure mechanism of gate oxide TDDB involves four steps, as illustrated in Figure 10 [44]. Step 1 is about the formation and growth of defects within the oxide layer due to the application of high electric fields. There are two types of defects extrinsic and intrinsic. Extrinsic defects occur due to material defects and manufacturing process-induced defects. Intrinsic defects are created while operating as the defects flow around in the gate oxide, the electrons at the gate/gate oxide interface can leak through the interface using the extended defects or traps, leaving a hole inside the gate oxide. These defects, known as traps, can capture and release charge carriers leading to an increase in the number of traps inside the gate oxide. Over time, the cumulative effect creates a conduction path for the charge carriers to flow through the gate oxide, as in Step 2. This leakage current or charge carriers flowing through the conduction path further increases the number of defects due to thermal damage, which creates more paths leading to thermal runaway, as shown in Step 3, also known as a soft breakdown. With an increase in temperature due to thermal runaway, the gate oxide (SiO2) melts, leaving a conduction channel through breakdown as shown in Step 4, also identified as hard breakdown. The steps that occur during the failure mechanism of TDDB are explained through the gate current vs. stress time plot, as shown in Figure 11 [45]. Due to the presence of extrinsic defects, there is a leakage current right from the beginning, as shown in Step 1, but the leakage current increases slightly over time due to extra traps that are created over time. Step 2 shows a sudden spike in leakage current as the conduction path is created during this phase. Step 3 shows an increase in leakage current due to thermal runaway, also called soft breakdown. There are two soft breakdowns in the plot suggesting that two conduction channels are simultaneously created during the stage at different locations of the gate oxide. Step 4 has the spike in leakage current 21 representing the gate oxide hard breakdown or channel creation through the oxide, as shown in the neighboring microscopy images. Figure 10: TDDB failure mechanism process steps [44]. Failures caused by gate oxide TDDB include oxide breakdown and gate leakage. Oxide breakdown occurs when the gate oxide layer cannot withstand the applied voltage, forming a conductive path through the oxide [46]. This leads to an unintended electrical connection and can cause device malfunctions or failures. Gate leakage refers to the undesirable current flow through the gate oxide, which can adversely affect device performance and increase power consumption. 22 Figure 11: TDDB failure mechanism explained through gate current vs stress time plot [45]. Accelerated stress tests are performed on samples to detect and analyze gate oxide TDDB failures. These tests involve subjecting the devices to high electric fields and monitoring their degradation over time. The devices are typically stressed under high voltage, which accelerates the TDDB process. Electrical measurements, such as time-dependent leakage current measurements and breakdown voltage measurements, are used to assess the reliability and lifetime of the gate oxide [47]. Failure analysis techniques, including electrical characterization, microscopy, and spectroscopy, are employed to analyze the failed gate oxide [48]. Techniques such as scanning electron microscopy (SEM) and transmission electron microscopy (TEM) are used to visualize the defects and structural changes within the oxide layer. Additionally, techniques like Fourier- transform infrared (FTIR) spectroscopy and secondary ion mass spectrometry (SIMS) can provide valuable information about the chemical composition and elemental distribution within the gate oxide. 23 Design tools and techniques are employed to protect against gate oxide TDDB failures. Proper material selection is crucial, with the choice of high-quality silicon dioxide (SiO2) or advanced gate dielectric materials like high-k materials with superior TDDB characteristics [49]. Optimized design rules are implemented to ensure appropriate oxide thickness and gate geometry, reducing the electric field and stress within the oxide. Stress engineering techniques, such as introducing strain in the channel region, can also improve the reliability of the gate oxide [50]. Cooling techniques and thermal dissipation within the ICs are implemented to maintain lower operating temperatures and improve reliability as temperature rise accelerates TDDB failures. Control of deposition, annealing conditions, and reduction of impurities or defects in the dielectric layer are important process considerations. The PPAP provided JEDEC JESD35A: Procedure for the Wafer-Level Testing of Thin Dielectrics as test specifications for TDDB qualification [51]. This document provided ramped voltage, constant voltage, and ramped current stress tests. No failure criterion is defined in the standard, and it is explicitly mentioned that "the purpose of this document is to describe oxide test techniques for quick evaluation and control of oxide fabrication techniques. It does not specify acceptance or rejection criteria for any of the described procedures and, therefore not intended to be used to predict MOS Integrated Circuit failure rates." TDDB models are mathematical representations that estimate the time to failure of the gate oxide due to dielectric breakdown. These models are intended for silicon dioxide gate dielectric application over a range of oxide thicknesses. The physical models used to explain the four empirical models are the thermo-chemical model (E model or constant field/voltage acceleration exponential model), anode hole injection (1/E model or, equivalently, anode hole injection model), bulk trap generation (V model, where the failure rate is exponential with voltage), and 24 anode hydrogen release model (Anode hydrogen release for the power-law model) [23]. While the thermo-chemical and V models adopt a field-driven mechanism, both anode hole injection and anode hydrogen release models assume a current-driven mechanism in addition to the role of the applied voltage or oxide fields, depending on the specific conditions. Based on the existing literature and the standards, E and 1/E models best correlate for estimating the TDDB failures which are discussed in this section. In the E model, as shown in equation (2), for gate oxide thickness greater than 4 nm, TDDB is due to field-enhanced thermal bond breakage at the silicon-silica interface. The E-field reduces the activation energy required for thermal bond breakage and exponentially increases the failure reaction rate. The time-to-failure (TTF), inverse to reaction rate, decreases exponentially with temperature. The inputs required for the E model are provided in Table 2. 𝑇𝑇𝑇𝑇𝑇𝑇 = 𝐡𝐡0 exp οΏ½βˆ’π›Ύπ›Ύ 𝑉𝑉𝑔𝑔 π‘‘π‘‘π‘œπ‘œπ‘œπ‘œ οΏ½ expοΏ½ πΈπΈπ‘Žπ‘Ž π‘˜π‘˜π‘‡π‘‡π‘—π‘— οΏ½ (2) Table 2: Input Parameters for the E model [23]. Variable Function or Purpose Unit Method of Determination 𝐡𝐡0 Scaling factor times Experimental results 𝛾𝛾 Field acceleration constant cm/MV Industry standards or literature. Range in 2.5 < 𝛾𝛾 < 3.5 𝑉𝑉𝑔𝑔 Gate voltage V Application specific π‘‘π‘‘π‘œπ‘œπ‘œπ‘œ Gate oxide thickness nm Part Specific 25 Variable Function or Purpose Unit Method of Determination πΈπΈπ‘Žπ‘Ž Apparent activation energy for TDDB eV Industry standards or literature. Range in 0.6 < πΈπΈπ‘Žπ‘Žπ‘Žπ‘Ž < 0.9 𝑇𝑇𝑗𝑗 Junction temperature K Application specific k Boltzmann's constant eV/K 8.6173303 Γ— 10βˆ’5 eV/K The 1/E model, as shown in equation (3), for gate oxide thickness less than 4 nm, assumes the cause of TDDB (even at low fields) is postulated to be due to current through the dielectric by Fowler-Nordheim (F-N) conduction. F-N injected electrons (from the cathode) cause impact ionization damage of the dielectric due to accelerating through the dielectric. When these accelerated electrons reach the anode, hot holes may be produced that can tunnel back into the dielectric, causing damage (hot-hole anode injection mechanism). The time-to-failure is expected to show an exponential dependence on the inverse of the electric field. The inputs required for the 1/E model are provided in Table 3. 𝑇𝑇𝑇𝑇𝑇𝑇 = 𝜏𝜏0(𝑇𝑇) expοΏ½ 𝐺𝐺(𝑇𝑇)π‘‘π‘‘π‘œπ‘œπ‘œπ‘œ 𝑉𝑉𝑔𝑔 οΏ½ (3) Table 3: Input Parameters for 1/E model [23]. Variable Function or Purpose Unit Method of Determination 𝜏𝜏0(𝑇𝑇) Scaling factor seconds Industry standards or literature. Approximately 1 x 10βˆ’11 s 26 Variable Function or Purpose Unit Method of Determination 𝐺𝐺(𝑇𝑇) Field acceleration constant MV/cm Industry standards or literature. Approximately 350 MV/cm 𝑉𝑉𝑔𝑔 Gate voltage V Application specific π‘‘π‘‘π‘œπ‘œπ‘œπ‘œ Gate oxide thickness nm Part Specific 2.1.3. Hot Carrier Injection Hot carrier injection is a reliability concern in CMOS technology that can lead to device degradation and failure. It occurs when high-energy carriers, typically electrons or holes, gain sufficient kinetic energy and impact the gate oxide, causing damage to its structure and properties. The primary cause of hot carrier injection is the presence of high electric fields in the transistor channel region [52]. This occurs as carriers move along the channel in CMOS and experience impact ionization near the drain end of the device, as shown in Figure 12 [53]. The damage can occur at the interface, within the oxide and/or within the sidewall spacer. The silicon–silicon dioxide interface shown in Figure 13 is critical to undergoing failure mechanism [35]. Silicon atoms in the substrate are four-fold bonded in a crystalline lattice. The SiO2 layer is amorphous, with the silicon four-fold bonded to neighboring oxygen (in a tetrahedral arrangement). Due to lattice mismatch at the interface, not all silicon bonds will be satisfied, creating dangling silicon bonds. Hydrogen is usually introduced during CMOS fabrication to chemically tie up/terminate these dangling bonds and prevent them from being electrically active. 27 Figure 12: Cross-section of an n-channel MOSFET illustrating the generation and injection of hot carriers into the gate-oxide (Si𝑂𝑂2) [53]. As electrons are accelerated from the source to the drain in the electric field, impact ionization at the drain end of the CMOS can produce electron–hole pairs. The electrons are redirected towards the gate oxide under the influence of the strong electric field near the drain end of the channel. The channel is not a perfect rectangular path but a tapered path, as shown in Figure 14 [54]. The energetic electrons, redirected toward the Si/SiO2 interface, produce interface damage in a localized region near the drain end (Si-H bond breakage), reducing its electrical performance and overall reliability. Over time, this damage accumulates and results in threshold voltage shift, increased leakage current, or even gate oxide breakdown. 28 Figure 13: Si – SiO2 interface [35]. Figure 14: Drain end of the channel producing electron-hole pairs [54]. Failures caused by gate oxide hot carrier injection include degradation of transistor parameters, such as threshold voltage shift and subthreshold slope degradation. Threshold voltage shift refers to the change in the voltage required to turn on or off the transistor, affecting the device's operational characteristics [55]. Subthreshold slope degradation relates to the increase in the off-state leakage current, which reduces the overall efficiency and increases power consumption. Unlike other failure mechanisms, HCI-induced damage does not typically result in visible physical changes or visible signs of failure on the device surface. Instead, electrical 29 measurements and characterization primarily detect and analyze HCI failures [56]. This involves evaluating the electrical performance of the affected transistors or circuits, such as threshold voltage shifts, subthreshold slope degradation, or increased leakage currents. These electrical changes are indicators of the impact of HCI on the device's functionality and reliability. Techniques like transient current measurement, time-dependent degradation measurement, and stress tests can be performed to investigate HCI-induced failures further. These techniques involve subjecting the device to specific electrical conditions and monitoring its response over time. By observing the device's electrical behavior under different stress conditions, analysts can gain insights into the extent of HCI-induced degradation and the device's reliability [57]. Design tools and techniques are implemented to protect against gate oxide hot carrier injection. One approach is optimizing transistor dimensions and channel doping profiles to reduce the electric fields in the channel region. Using lightly doped drain (LDD) or double doped drain (DDD) structures or extension implants such as side walls can help distribute the electric field more uniformly and mitigate the impact of hot carriers on the gate oxide [58]. Additionally, advanced device structures, such as strained silicon or high-k dielectrics, can improve the resistance to hot carrier injection. HCI evaluations are almost always performed on test structures rather than products and done under direct current (DC) conditions, thus, the calculated lifetime should be considered a figure of merit for process comparison [59]. A short lifetime observed with DC test structures does not imply unacceptable product performance under alternating current (AC) conditions. For a digital circuit, like an inverter, HCI stress only occurs during the device's turn-on and turn-off periods. These turn-on and turn-off periods are typically 1-2% of the overall cycle time. Thus, the time to failure (TTF) due to HCI under usage stress conditions is significantly lower than the 30 predicted TTF under DC conditions. HCI is considered a critical failure mechanism or tends to increase the time to failure for advanced process technologies, such as below 130 nm. As technology nodes shrink and operating voltages increase, the electric fields in the channel region intensify, making the gate oxide more susceptible to hot carrier injection. The PPAP provided JEDEC JESD28A and 60A: A Procedure for Measuring N and P- Channel MOSFET Hot-Carrier-Induced Degradation Under DC Stress as test specifications for HCI qualification [60], [61]. No failure criterion is defined in these standards, and it is explicitly mentioned that "These are to be used for comparison purposes only and should not be used as acceptance or rejection criteria. It is also important to realize that this procedure should not be interpreted to predict MOS IC failure rates." HCI-induced transistor degradation is well modeled by peak substrate current for the n- channels and peak gate current for the p-channels, at least for transistors at >0.25 ΞΌm gate length (L). For <0.25 ΞΌm p-channel, the worst-case lifetime occurs at maximum substrate current stress, and the time-to-failure (TTF) model is the same as the n-channel. The time to failure for HCI can be obtained from equation (4), and the input parameters are listed in Table 4. 𝑇𝑇𝑇𝑇𝑇𝑇 = 𝐢𝐢0(𝐼𝐼)βˆ’π‘›π‘›π‘’π‘’ πΈπΈπ‘Žπ‘Žπ‘Žπ‘Ž π‘˜π‘˜π‘˜π‘˜ (4) Table 4: Input Parameters for HCI time to failure model [23]. Variable Function or Purpose Unit Method of Determination 𝐢𝐢0 Scaling factor times Experimental results 𝐼𝐼 = 𝐼𝐼𝑠𝑠𝑠𝑠𝑠𝑠 Substrate current ΞΌA For N-channel, Measure peak substrate current during stressing or estimate based on information from datasheet 31 Variable Function or Purpose Unit Method of Determination 𝐼𝐼 = 𝐼𝐼𝐺𝐺 Gate current ΞΌA For P-channel, Measure peak gate current during stressing or estimate based on information from datasheet 𝑛𝑛 Empirically exponent Constant Function of stress voltage, temperature, and effective transistor channel length Common range is between 2 to 4 πΈπΈπ‘Žπ‘Žπ‘Žπ‘Ž Apparent Activation Energy for HCI eV Industry standards or literature. For N βˆ’ channel: βˆ’0.2 < Eaa < 0.4 For P βˆ’ channel: (L > 0.25Β΅m):βˆ’0.1 < Eaa < βˆ’ 0.2 (L < 0.25Β΅m): 0.1 < Eaa < 0.4 T Operating temperature K Application specific k Boltzmann's constant eV/K 8.6173303 Γ— 10βˆ’5 eV/K 2.1.4. Negative Bias Temperature Instability Negative Bias Temperature Instability (NBTI) is a phenomenon that affects the reliability and performance of p-channel transistors in CMOS technology. As mentioned in HCI, the dangling bonds are bonded with hydrogen during the manufacturing or fabrication process. Since the P-type MOS operates with a negative gate voltage, the electric field in the SiO2 layer is directed away from the Si and SiO2 interface, as shown in Figure 15 [35]. If a Si–H bond is broken during device operation by capturing a hole from the channel, it leads to freeing an H+ ion, as shown in equation (5). 𝑆𝑆𝑆𝑆 βˆ’ 𝐻𝐻 + (𝐻𝐻𝐻𝐻𝐻𝐻𝑒𝑒)+ β†’ 𝑆𝑆𝑆𝑆 βˆ’ + 𝐻𝐻+ (5) 32 Once H+ ions are generated, they drift away from the interface under the influence of an electric field governed by the transport equation as shown in equation (6), where 𝐽𝐽(π‘₯π‘₯, 𝑑𝑑) is the total flux, Β΅ is the mobility, 𝜌𝜌 is the density of 𝐻𝐻+ ions, π‘žπ‘žπΈπΈ is the force acting on 𝐻𝐻+ ions, and D is the diffusivity. 𝐽𝐽(π‘₯π‘₯, 𝑑𝑑) = Β΅ 𝜌𝜌(π‘₯π‘₯, 𝑑𝑑) (π‘žπ‘žπΈπΈ) βˆ’ 𝐷𝐷 πœ•πœ•πœŒπœŒ(π‘₯π‘₯, 𝑑𝑑) πœ•πœ•π‘₯π‘₯ (6) The total flux equation can also be expressed as a drift and diffusion flux change, as shown in Figure 16. Drift flux occurs as the free 𝐻𝐻+ ions move under the influence of the electric field into the gate oxide from the interface. This increases the concentration of the 𝐻𝐻+ ions in the gate oxide. When the electric field is not applied, due to the higher concentration, the 𝐻𝐻+ ions diffuse to the lower concentration areas (gate oxide interface), and re-bond with the silicon atoms which is represented by the diffusion flux. This leads to the recovery of the gate oxide properties by reducing the number of traps that are created during the application of the electric field. But the recovery is partial because of the 𝐻𝐻+ ions reacting with the already present traps, as shown in equations (7), (8), and (9). In summary, there are two modes in the NBTI failure mechanism: the stress condition mode and the recovery mode, as shown in Figure 17 [62]. The stress condition is when the electric field is applied, leading to hole trapping and bond breakage. The recovery mode is when no electric field is applied, due to which the bonds are partially rejoined. 33 Figure 15: NBTI failure mechanism through Si/SiO2 interface [35]. Figure 16: Partial recovery of the gate oxide in NBTI failure mechanism [35]. 𝐻𝐻+ + 𝑒𝑒 β†’ 𝐻𝐻 (7) 𝐻𝐻+ + 𝐻𝐻 + 𝑒𝑒 β†’ 𝐻𝐻2 (8) 𝐻𝐻+ + 𝐻𝐻+ + 2𝑒𝑒 β†’ 𝐻𝐻2 (9) 34 Figure 17: Cross-section of pMOS device under (a) NBTI stress conditions (b) in recovery [62]. NBTI involves the trapping and de-trapping positive charges in the gate oxide of p-channel transistors under negative bias and elevated temperature conditions [63]. The trapped charges cause a degradation in the transistor's characteristics, leading to increased threshold voltage and reduced channel mobility. This results in decreased device speed, increased power consumption, and timing violations in CMOS circuits. Examples of NBTI failures can be observed in various scenarios. In a digital circuit, the increased threshold voltage can cause a delay in signal propagation, impacting overall circuit performance. In analog circuits, NBTI-induced threshold 35 voltage shifts can result in offset voltage variations and reduced gain, compromising the accuracy and linearity of the circuit [64]. These failures manifest as performance degradation, reduced reliability, and potential functional failures in CMOS devices. The detection and analysis of NBTI failures involve several techniques. Stress testing methodologies, such as applying negative bias and elevated temperature for extended periods, accelerate the degradation process. Electrical measurements, such as monitoring threshold voltage shifts over time, provide valuable insights into the extent of NBTI-induced degradation. Statistical analysis and modeling approaches are employed to predict the reliability and lifetime of devices under NBTI stress conditions. To protect against NBTI failures, design tools and techniques are employed during the fabrication and design stages. Stress engineering involves optimizing the transistor's gate stack and process parameters to mitigate NBTI effects. Channel engineering techniques, such as using stress liners and embedded silicon-germanium (SiGe), help enhance the channel's resistance to NBTI-induced degradation. Material selection, such as high-k dielectrics for gate oxides, can reduce NBTI susceptibility. Design rules are optimized to minimize the impact of NBTI during layout and ensure robustness against NBTI-induced failures. Unlike the other failure mechanisms (EM, TDDB, and HCI), the PPAP document does not provide any specification for NBTI, which leads to no conclusions about how the part has been tested and qualified for NBTI as there is no information about the testing methods, failure criteria, number of samples tested, failure times, testing data, and many more. Although there are disagreements about the physical mechanism behind the degradation and the exact cause of bias temperature instability, two effects are believed to be at play: trapping positively charged holes and generating interface states [23]. It has been found that a broad 36 consensus is that when a MOS is stressed with a constant gate voltage at an elevated temperature, positive charges accumulate in the SiO2/Si interface or gate oxide layer. This charge leads to the deterioration of transistor parameters. The knowledge of physics limits the current state of the NBTI models for this mechanism. For a given gate oxide thickness (π‘‘π‘‘π‘œπ‘œπ‘œπ‘œ), either one of the equation (10) or (11) phenomenological models is generally used to describe the NBTI degradation. π›₯π›₯π›₯π›₯ = π·π·π‘œπ‘œ exp οΏ½ πΈπΈπ‘Žπ‘Žπ‘Žπ‘Ž π‘˜π‘˜π‘‡π‘‡ οΏ½ exp�𝛽𝛽𝑉𝑉𝑔𝑔� 𝑑𝑑𝑛𝑛 (10) Thus, time-to-failure (TTF) for a given accepted Ξ”p failure criterion (Ξ”π›₯π›₯𝑑𝑑), is 𝑇𝑇𝑇𝑇𝑇𝑇 = οΏ½ π›₯π›₯π›₯π›₯𝑑𝑑 𝐷𝐷0 exp οΏ½πΈπΈπ‘Žπ‘Žπ‘Žπ‘Žπ‘˜π‘˜π‘‡π‘‡ οΏ½ exp�𝛼𝛼𝑉𝑉𝑔𝑔� οΏ½ 1 𝑛𝑛 (11) The failure criterion Ξ”π›₯π›₯𝑑𝑑 is defined in terms of an allowed pMOS parameter shift. Typically, selecting a given failure criterion should depend on the circuit sensitivities and requirements of the pMOS device under investigation. Table 5: Model parameters for NBTI failure mechanism [23]. Variable Function or Purpose Unit Method of Determination 𝐷𝐷0 Scaling factor times Pre-factor is dependent on the gate oxide process and CMOS technology. Obtained from experimental results. π›₯π›₯π›₯π›₯𝑑𝑑 The shift in device parameter of interest (threshold voltage (π‘‰π‘‰π‘˜π‘˜), drain current (πΌπΌπ·π·π‘ π‘ π‘Žπ‘Žπ‘‘π‘‘)) Based on the parameter of interest. - 37 Variable Function or Purpose Unit Method of Determination 𝑉𝑉𝑔𝑔 Gate voltage V The absolute value of the gate voltage applied to the pMOS device in inversion. Obtained from datasheet or experimental results. Ξ± Measured gate voltage exponent Constant Measured values range between 3 to 4 n Measured time exponent Constant Measured values range between 0.15 to 0.25 πΈπΈπ‘Žπ‘Žπ‘Žπ‘Ž Apparent activation energy for NBTI eV Industry standards or literature: The typical range is 0.15 to 0.25 eV T Operating temperature K Application specific k Boltzmann's constant eV/K 8.6173303 Γ— 10βˆ’5 eV/K 2.2. Observations and Limitations The JEP122 standard provided comprehensive information on die-level failure mechanisms (EM, TDDB, HCI, and NBTI) and their corresponding failure models. Although the models are defined with relevant information, many parameters are provided as a range of values that vary depending on the parts, technology level, application conditions, and materials. Providing a broad range of values as inputs does not give proper time-to-failure distribution estimates. The remaining parameters related to part geometry or architecture and application conditions are to be collected for each part to model TTF estimates. Apart from these, the scaling factor in every failure model is not provided because it changes for each failure mechanism and for each process technology. The units of the TTF estimates also depend on the scaling factor value which is not provided in the document. 38 3. Process Technology Via Part Manufacturer Documents The process technology keeps reducing by following Moore's law, and the technologies implemented are moving towards their limitations of materials and processes [65]. This has brought about innovative design rules by companies that vary from each other. As a result, a uniform approach is not being followed as in previous generations. In the past, design rules and specifications were easier to locate. It has become increasingly difficult to keep track of the complex and varying design rules developed for various applications by different companies. This is due to the decreasing portability, universal testability, and regular public disclosure of information that the industry has lost. The design rules help estimate the reliability of the product [66]. There are failure models through which reliability simulations may be run or calculated. However, to use them, design rules need to be inputted to provide the models with the information they need. This requirement of design rules, combined with the fact that they are now less readily available, makes it more difficult for individuals and companies to estimate the reliability of their components. Failure models depend on parameters such as part geometry, model constants, and application conditions to estimate the part's time to failure. Identifying these parameters is critical in obtaining accurate estimates of time to failure, which provides valuable insights to perform the part selection. In addition, critical decisions can be made based on a part's geometric information, which is discussed in this chapter. This chapter provides information on process technology, its benefits, and where to obtain it. Further, to better explain the process of obtaining this information, three different procedures are provided along with a summary of various documents to identify the information. 39 3.1. What is Process Technology? When speaking about semiconductors, they are often referred to by their node size by the part manufacturers. A semiconductor's node size is a measure of a technology's feature size and is also commonly referred to as process technology [67]. For example, a 32 nm node, a 32 nm process technology, and a 32 nm feature size all refer to the same semiconductor process technology. The question remains, what metric does a semiconductor's feature size refer to? The feature size is defined as the minimum length of the channel, as shown in Figure 18 [68]. Feature size is sometimes synonymous with process node, a value of lambda (in general, the length of the channel). In addition to the feature size, many other geometries and properties of a semiconductor must be considered when assessing reliability. These specifications ensure that the design will be within the restraints of the manufacturing process as well as within the boundaries of physics needed for proper device functionality. All the geometries that restrict the design are collectively called design rules [69]. Examples of common geometric design rules include but are not limited to different parts' width, spacing, and pitch. Figure 18: Physical representation of node size or process technology (L) [68]. 40 3.2. Benefits of Obtaining Process Technology Identifying the part process technology primarily supports decision-making on whether to consider a particular failure mechanism as critical or not. Each part consists of process technology based on which the part is manufactured [70]. As mentioned in section 2.2, failure mechanisms can be ranked, and the critical mechanisms that affect the part reliability based on process technology can be identified. Therefore, critical failure mechanisms can be identified for a given part based on the part process technology. In addition, for a given failure mechanism, an appropriate failure model can be selected for time-to-failure estimation from various models based on process technology [71]. For example, the time-dependent dielectric breakdown mechanism has the E and 1/E failure models, as shown in section 2.1.2. Selection of the failure model for a part is performed based on the gate oxide thickness of the dielectric present between the gate and silicon substrate. If the gate oxide thickness is greater than 4 nm, the E model correlates better, and the 1/E model if less than 4 nm. In general, part manufacturers should provide gate oxide thickness. However, as discussed in Chapter 1, part manufacturers avoid providing critical information. This makes obtaining the gate oxide thickness parameter challenging. Nevertheless, knowing the process technology of part can provide insights into various geometric properties from literature sources through which a plot summary can be made to estimate the oxide thickness based on process technology. If no part-specific information can be found for a given part, the next step is to work backward from the year of part introduction to the industry. Following Moore's Law node size scaling, which states that a transistor node size halves every two years, using the year of introduction (adjusted by two years for design and development time), one can estimate the transistor node size [72]. By surveying similar products from the same year of introduction, one 41 can estimate the product characteristics and use them in further failure analysis. Figure 19 shows an example of this method. Additionally, by looking at research papers from the year of introduction, one can extract information on the state of the art of product modeling at that time. Figure 19: Using Moore's Law scaling to estimate node size and, in due course, estimate the effective gate length for the transistor. Furthermore, industry white papers on semiconductors for process technology, as shown in Figure 20, can provide insights into geometric parameters [73]. For example, the selection of failure models for hot carrier injection mechanism is based on channel gate length. Additionally, the time-dependent dielectric breakdown E model requires a gate oxide thickness parameter as an input to estimate the time to failure. Identifying the metal interconnect thickness helps determine the occurrence of electromigration failure in the part. 42 Figure 20: Industry white paper providing information for the 1.2 Β΅m CMOS process family [73]. Overall, understanding the process technology provides insights into other geometry or physical parameters that help create a list of mechanisms and requirements that optimizes the time and efforts towards performing reliability assessment. In further steps, process technology can be used to identify the critical failure mechanisms for a given process technology and selection of appropriate failure model for the failure mechanism specific to the part. Moreover, few failure models require geometric parameters as inputs to estimate the time to failure. Therefore, understanding the process technology of a given part is critical to perform reliability assessment and part selection. 43 3.3. Sources to Obtain Process Technology To gather information about each part that is intended for application, the user has to investigate two main sources of information. These two sources of information are documents directly supplied by the manufacturer and the estimation of characteristics using related parts from a literature survey. Table 6 shows an example of part information sources. Table 6: Example of part information sources. Source Type Data Source Information Available Manufacturer Data Sources Datasheets [74] β€’ Part ratings (Operating conditions / Thermal information) β€’ Environmental ratings (Operating temperature ranges / Moisture sensitivity level) β€’ Characteristics curves (e.g., Junction-to- ambient thermal resistance vs. Air velocity) Qualification Reports [75] β€’ Die dimensions / Design rule. β€’ Stress test results (e.g., High-temperature operating life) β€’ Ongoing reliability monitoring (FAB / Assembly process reliability data) β€’ Material composition Material Declaration [76] β€’ Material composition by part structure β€’ Environmental rating (RoHS / REACH) Product Change Notifications (PCN) [54] [77] β€’ Material composition β€’ Part structure / Packaging β€’ Qualification results Production Part Approval Process (PPAP) β€’ Die Dimensions / Design Rule β€’ Material performance test (e.g., AEC-Q100 test results) 44 Source Type Data Source Information Available Reliability Reports β€’ Qualification summary β€’ Material content Other Sources Industry Standards [23] β€’ Qualification testing levels, Model constants (e.g., JEP122), Literature Research Papers [23] [35] β€’ Model parameters/constants, Activation energy information, Node size, and associated part parameters The part webpage serves as an introduction to the part and a gateway for more detailed information through downloadable documents and other linked pages. Each part manufacturer has their own style of part information page, which will vary but are usually consistent within a single manufacturer. The page traditionally features at least a link to the datasheet, some basic ratings and features, a set of documents associated with the part, and an image of the part. Unlike datasheets or other .pdf files, part information webpages are easily searchable and allow quick compilation of information. Figure 21 is an example of the Maxim part for which the process technology is identified through the reliability report obtained from the part webpage [78]. Figure 21: Process technology obtained through the original component manufacturer (OCM) website [78]. The classical document used for gaining part information is the part datasheet. The part datasheet outlines the performance parameters, operating conditions, physical dimensions, and other aspects depending on the type of part. Every part or family of parts will have a datasheet. Each datasheet typically includes the following sections: introduction, absolute maximum 45 ratings, electrical specifications, part layout, contact information, and document updates. Depending on the manufacturer, these sections will vary in name and order, but these sections at minimum, should be included in an effective datasheet. When compared to previous reports on part datasheets, datasheets now rarely include testing or quality data sections as these are in their documents. Figure 22 represents the process technology provided in datasheets of part manufacturers such as Microsemi, Actel Corporation, and Texas Instruments [79], [80], [81]. Figure 22: Datasheets of various part manufacturers as information sources for process technology [79], [80], [81]. Part manufacturers provide their customers with user guide documents, such as portfolio brochures, including critical information about the part. One such example of the part manufactured by Texas Instruments for which the process technology is obtained is shown in Figure 23 [82]. Figure 23: Process technology from user guide document [82]. Automotive parts that meet the qualification requirements of AEC (Automotive Electronic Council) must complete PPAP. This process requires documentation of a part's design, manufacture, and status to the customer. Each PPAP document contains several sections: material information, qualification testing results, and design/process failure mode effect analysis. It has 46 to be noted that the availability of PPAP documentation varies between parts and manufacturers. Figure 24 shows the die technology parameters provided by Texas Instruments in part PPAP document. Figure 24: Die technology parameters provided in a PPAP document. The qualification report for a specific part provides information about the part, including information from different tests completed on the part. This testing can include thermal, electrical, and physical testing and characterization of the part that the manufacturer completes. In addition to testing results, information about the material composition of the packaging and die structure can also be provided. Figure 25 provides the process technology/design rule for a part manufactured by TSMC, obtained through the qualification report [83]. Figure 25: Process technology/design rule for a part obtained through the qualification report [83]. Product Change Notifications (PCNs) are documents manufacturer issues that detail any change in the parts' construction, operation, or capabilities. PCNs can provide information regarding a change in manufacturing location, a change in material composition, or an alteration in the expected performance parameters for the part due to updated testing or other changes. Unlike datasheets or qualification reports, PCNs will typically be for a range of products provided by the manufacturer, not an individual part. At a bare minimum, a PCN must include a 47 unique code or number for identification, a definition or classification of the proposed part changes, the timing for the change, the deliverables to the customer, and record retention requirements. PCNs can be utilized in the part selection process for initial and long-term considerations. The initial selection process can require information only found in a PCN for the part and long-term evaluation for monitoring if the part still maintains the required performance levels for the application. For example, a PCN report on changing the process technology from 0.5 Β΅m to 0.35 Β΅m at a Fab location is released by TSMC [84]. Process technology for the part can be assumed from roadmaps available based on the product's year of introduction and maturity. If no part-specific information can be found for a given part, the next step is to work backward from the year of part introduction to the industry. Following Moore's Law node size scaling, which states that a transistor node size halves every two years, using the year of introduction (adjusted by two years for design and development time), one can estimate the transistor process technology. Figure 26 provides the year of introduction of process technologies [85]. Table 7 summarizes the current state of the art of selected feature sizes from 65 to 7 nm [86], [87]. The maturity of each technology is split into three categories generally: β€’ 'High' maturity corresponds to nodes for which semiconductor foundries are researching advanced chips for specialty applications. β€’ 'Moderate' maturity is associated with manufacturing facilities reporting research and development efforts towards basic process improvements or developing new lithography techniques. β€’ 'Low' maturity is for nodes at which full-scale manufacturing has not yet begun. 48 In general, foundries are working to scale these nodes to production volume within a few years. Table 7 also provides approximate market penetration figures. These estimates stem from two semiconductor foundries' annual financial reports, which provided revenue figures broken down by the wafer process node. Annual revenue totals then provided a means to estimate approximate market share relative to total FY19 revenue, as shown. Note that approximately 43.9% of wafer sales revenue was reported for process nodes larger than 65 nm, which is therefore not represented in the table below. The state of use is summarized below and defined based on market share and on the variety of current-generation chipsets using technology based at that node. Table 7 also provides examples of the chipsets typically manufactured at a given node for additional context. Figure 26: Process technology vs. year of introduction [85]. 49 Table 7: Current state of the art of process nodes or technologies [86], [87]. 65 nm Market Penetration: 22.6 % Maturity: High. Introduced in 2005. United Microelectronics Corporation (UMC) developing specialty technology at this node in 2019. Use: High. Used in relatively low-computing power and non-space constrained applications such as computer and server processors, and digital cameras. 40/45 nm Market Penetration: 15.4 % Maturity: High. Introduced in 2007. UMC was developing specialty technology at this node in 2019. Use: High. Used in automotive processors, image processing, personal computers and servers, and low-power chipsets. 28 nm Market Penetration: 6.7 % Maturity: High to Moderate. Introduced in 2011. UMC is working on process improvement research for production in 2021. Samsung was developing SoC at this node for production in 2019. Use: Moderate. Used in entry-level smartphones and workstations, advanced SoC designs, desktop computers and some machine learning chipset architectures. 20 nm Market Penetration: 0.2 % Maturity: Moderate. Introduced in 2014. UMC developing process improvements for implementation in 2020. Use: Low. Intermediate process node used by some inexpensive smartphones and specialty computing hardware. 16 nm Market Penetration: 4.5 % Maturity: Moderate. Introduced in 2014. Semiconductor Manufacturing International Company (SMIC) completed initial research and development in 2019 to begin large-scale production [2]. UMC completed product assessment in 2019 to begin risk production in 2020. Use: Moderate. Current generation of laptops and desktop processors, high- performance smartphones, supercomputer processors, graphics processing 50 units, computer vision chipsets, random access memory controllers. 10 nm Market Penetration: 0.6 % Maturity: Low. Introduced in 2016. Samsung was developing DRAM and processor technology at this node in 2017 & 2018. Use: Low. Applications are similar to those at the 16 nm node, but manufacturing is not yet fully mature. The industry is scheduling the production of high-end smartphones implementing this technology in 2021. 7 nm Market Penetration: 6.1 % Maturity: Low. Introduced in 2018. SMIC research and development focused on developing 7 and 5 nm large-scale production implementations. Samsung was researching new 7 nm lithography processes in 2018 and planning to use some 7 nm chipsets in the Galaxy Note 10 in 2019. Use: Low. Supercomputer processors, neural network processing, high-end smartphones, graphics processing units, and personal computers. Applications in development for production 2021-2022. Figure 27: Documents through which part manufacturers can report process technology. 51 Overall, obtaining the process technology of the active electronic component is challenging but not impossible. It is possible to obtain part process technology based on the assessment of the information provided by the original part manufacturer. In addition, engineering judgment can be made based on the year of introduction, part type, and roadmaps developed by the industry (e.g., ITRS, IMEC). Based on the active integrated circuit (IC) parts used for reliability assessment, a procedure to obtain the process technology through various manufacturer-provided documents is developed, as shown in Figure 27. These documents can be used as a starting point to obtain process technology, which can be extended further if needed. 3.4. Methodology to Obtain Process Technology The following steps, as shown in Figure 28, outline a systematic approach for obtaining process technology information from part manufacturer documents. The first step involves collecting relevant documents such as datasheets, qualification reports, reliability reports, PCNs, and PPAPs, as discussed in Section 3.3. These documents should specifically pertain to the desired part, and the very first source of information or identifying part-related documents is the part manufacturer's website or webpage. Subsequently, thoroughly review the collected documents to identify any explicit or implicit information related to the process technology. Cross-referencing the information obtained from different documents is crucial to identify consistencies and discrepancies. It helps validate the accuracy and completeness of the obtained information. Validating the document is vital when the identified document is related to the part being purchased. It is essential to ensure that the information provided in the datasheet accurately represents the specifications and characteristics of the part. Figure 29 provides a procedure to validate the datasheet. 52 Figure 28: Methodology to obtain part process technology. Formulation of assumptions is vital when specific process technology information is not explicitly mentioned in the part manufacturer documents. These assumptions are based on similar functional parts or components fabricated by the same manufacturer or within the same fab location. Industry knowledge, technological trends, and experience are valuable inputs for formulating reasonable assumptions regarding the process technology parameters. 53 Figure 29: Example of procedure to validate datasheet. In addition to the previously mentioned steps, further assumptions can be made based on additional factors such as the fab location, PCNs, date codes, and press releases provided by the manufacturer. The fab location can play a significant role in determining the process technology employed. Different fabs may have specific capabilities or expertise, and understanding the site can provide insights into the likely process technologies used. PCN notifications can sometimes reveal information related to process technology modifications. Press releases or public announcements by the manufacturer can also be valuable sources of information. Manufacturers often share updates about process technology advancements, collaborations, or new manufacturing capabilities. Monitoring such announcements can provide helpful context and indications of the process technology employed by the manufacturer. By considering these additional factors, researchers can make informed assumptions while estimating process technology. While these assumptions may not provide direct or explicit process technology information, they serve as process technology estimation. 54 Consulting industry-developed roadmaps, such as technology nodes or industry standards, is another essential step in the methodology. These roadmaps provide a general understanding of process technology advancements and can guide the estimation process. The process technology estimation is achievable by leveraging the year of introduction or the fabrication timeline of the part. There are various sources of information to obtain part process technology, but identifying the best source depends on the risk associated with the document and the assumptions. Figure 30 provides the sources of information from the low to high risk, which can affect the accuracy of the obtained process technology. For example, identifying the process technology through the part website has low risk as the information on the part manufacturer's website is the most reliable source. Similarly, process technology estimated through roadmaps is of greater risk