首页 > 最新文献

International Symposium on Quality Electronic Design (ISQED)最新文献

英文 中文
Early system level modeling of real-time applications on embedded platforms 嵌入式平台上实时应用的早期系统级建模
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523666
Richard Lee, Karim Abdel-Khalek, S. Abdi, Frederic Risacher
This paper describes a methodology for developing abstract and executable system-level model in SystemC of real-time embedded software, targeted to an RTOS. We design a RTOS emulation layer, called RESC, on top of the SystemC kernel. The application software is linked against the emulation layer to create an executable model of the software. The model can be integrated into system level HW-SW models which can be used for fast, accurate and early system validation. We first identify key real-time software constructs such as task-level concurrency, priorities, timers, pulses, and message-passing communication. We, then, define equivalent abstractions of the constructs in RESC on top of the SystemC library. We validated our models using industrial-size examples such as MP3 decoder and Vocoder. The experimental results show that our models are very accurate (<; 1% error) and significantly faster (up to 11X) than real-time software execution on target platform.
本文介绍了一种以实时嵌入式软件为对象,在SystemC中开发抽象的、可执行的系统级模型的方法。我们在SystemC内核之上设计了一个RTOS仿真层,称为RESC。应用软件与仿真层相关联,以创建软件的可执行模型。该模型可集成到系统级HW-SW模型中,用于快速、准确和早期的系统验证。我们首先确定关键的实时软件结构,如任务级并发性、优先级、计时器、脉冲和消息传递通信。然后,我们在SystemC库的基础上定义RESC中构造的等价抽象。我们使用MP3解码器和声码器等工业规模的示例验证了我们的模型。实验结果表明,我们的模型非常准确(<;1%的错误),并且比目标平台上的实时软件执行要快得多(高达11倍)。
{"title":"Early system level modeling of real-time applications on embedded platforms","authors":"Richard Lee, Karim Abdel-Khalek, S. Abdi, Frederic Risacher","doi":"10.1109/ISQED.2013.6523666","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523666","url":null,"abstract":"This paper describes a methodology for developing abstract and executable system-level model in SystemC of real-time embedded software, targeted to an RTOS. We design a RTOS emulation layer, called RESC, on top of the SystemC kernel. The application software is linked against the emulation layer to create an executable model of the software. The model can be integrated into system level HW-SW models which can be used for fast, accurate and early system validation. We first identify key real-time software constructs such as task-level concurrency, priorities, timers, pulses, and message-passing communication. We, then, define equivalent abstractions of the constructs in RESC on top of the SystemC library. We validated our models using industrial-size examples such as MP3 decoder and Vocoder. The experimental results show that our models are very accurate (<; 1% error) and significantly faster (up to 11X) than real-time software execution on target platform.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"160 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115970108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Reliability consideration with rectangle- and double-signal through silicon vias insertion in 3D thermal-aware floorplanning 三维热感知地板规划中矩形和双信号硅孔插入的可靠性考虑
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523628
Chih-han Hsu, S. Ruan, Ying-Jung Chen, Tsang-Chi Kan
Vertical integration of layers in a 3D IC exacerbates thermal problem especially for reliability degradation. Low reliability can not only damage the whole circuits but also cause unexpected performance loss. In this paper, we conduct the SA engine with rectangle-STSVs and double-STSVs for improving reliability. The earlier research indicates that the more STSVs a chip has, the better the reliability is. However, it also implies a larger area. Therefore, we develop a methodology to manipulate thermal-aware floorplan with the tradeoff among the number of STSVs, reliability, and area of a chip. Moreover, we manage our manipulated floorplan with precise thermal model for TTSVs insertion at via channel. Experimental results show that more than 80% of single-STSVs can be replaced by rectangle-STSVs or double-STSVs, thereby improving reliability. Furthermore, temperature can be maintained around 80°C with minimal TTSVs after inserting TTSVs.
三维集成电路中的垂直集成层加剧了热问题,特别是可靠性下降。低可靠性不仅会损坏整个电路,而且会造成意想不到的性能损失。为了提高可靠性,本文采用矩形stsvs和双stsvs对SA发动机进行了设计。早期的研究表明,芯片的STSVs越多,可靠性越好。然而,它也意味着更大的面积。因此,我们开发了一种方法来操纵热感知平面,并在STSVs数量,可靠性和芯片面积之间进行权衡。此外,我们用精确的热模型来管理我们的操纵平面,以使TTSVs在通过通道插入。实验结果表明,80%以上的单stv可以被矩形stv或双stv所取代,从而提高了可靠性。此外,在插入TTSVs后,温度可以保持在80°C左右,并且TTSVs最小。
{"title":"Reliability consideration with rectangle- and double-signal through silicon vias insertion in 3D thermal-aware floorplanning","authors":"Chih-han Hsu, S. Ruan, Ying-Jung Chen, Tsang-Chi Kan","doi":"10.1109/ISQED.2013.6523628","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523628","url":null,"abstract":"Vertical integration of layers in a 3D IC exacerbates thermal problem especially for reliability degradation. Low reliability can not only damage the whole circuits but also cause unexpected performance loss. In this paper, we conduct the SA engine with rectangle-STSVs and double-STSVs for improving reliability. The earlier research indicates that the more STSVs a chip has, the better the reliability is. However, it also implies a larger area. Therefore, we develop a methodology to manipulate thermal-aware floorplan with the tradeoff among the number of STSVs, reliability, and area of a chip. Moreover, we manage our manipulated floorplan with precise thermal model for TTSVs insertion at via channel. Experimental results show that more than 80% of single-STSVs can be replaced by rectangle-STSVs or double-STSVs, thereby improving reliability. Furthermore, temperature can be maintained around 80°C with minimal TTSVs after inserting TTSVs.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121993309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Fast analog design optimization using regression-based modeling and genetic algorithm: A nano-CMOS VCO case study 基于回归建模和遗传算法的快速模拟设计优化:纳米cmos压控振荡器案例研究
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523643
D. Ghai, S. Mohanty, G. Thakral
The mature electronic design automation (EDA) tools and well-defined abstraction-levels for digital circuits have almost automated the digital design process. However, analog circuit design and optimization is still not automated. Custom design of analog circuits and slow analog in SPICE has always needed maximum efforts, skills, design cycle time. This paper presents a novel design flow for constrained optimization of nano-CMOS analog circuits. The proposed analog design flow combines polynomial-regression based models and genetic algorithm for fast optimization. For evaluating the effectiveness of the proposed design flow, power minimization in a 50nm CMOS based current-starved voltage-controlled oscillator (VCO) is carried out, while treating oscillation frequency as a performance constraint. Accurate polynomial-regression based models are developed for power and frequency of the VCO. The goodness-of-fit of the models is evaluated using SSE, RMSE and R2. Using these models, we form a constrained optimization problem which is solved using genetic algorithm. The flow achieved 21.67% power savings, with a constraint of frequency ≥ 100 MHz. To the best of the authors' knowledge, this is the first study which approaches a VCO design problem as a mathematical constrained optimization involving the usage of regression based modeling and genetic algorithm.
成熟的电子设计自动化(EDA)工具和定义良好的数字电路抽象层几乎使数字设计过程自动化。然而,模拟电路的设计和优化仍然不是自动化的。SPICE中模拟电路和慢速模拟的定制设计一直需要最大的努力,技能和设计周期时间。本文提出了一种新的纳米cmos模拟电路约束优化设计流程。所提出的模拟设计流程结合了基于多项式回归模型和遗传算法的快速优化。为了评估所提出的设计流程的有效性,在将振荡频率作为性能约束的同时,对基于50nm CMOS的电流匮乏压控振荡器(VCO)进行了功率最小化。建立了精确的基于多项式回归的压控振荡器功率和频率模型。使用SSE、RMSE和R2来评估模型的拟合优度。利用这些模型,形成一个约束优化问题,并用遗传算法求解。在频率约束≥100 MHz的情况下,该流节省了21.67%的功率。据作者所知,这是第一个将VCO设计问题作为数学约束优化的研究,涉及使用基于回归的建模和遗传算法。
{"title":"Fast analog design optimization using regression-based modeling and genetic algorithm: A nano-CMOS VCO case study","authors":"D. Ghai, S. Mohanty, G. Thakral","doi":"10.1109/ISQED.2013.6523643","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523643","url":null,"abstract":"The mature electronic design automation (EDA) tools and well-defined abstraction-levels for digital circuits have almost automated the digital design process. However, analog circuit design and optimization is still not automated. Custom design of analog circuits and slow analog in SPICE has always needed maximum efforts, skills, design cycle time. This paper presents a novel design flow for constrained optimization of nano-CMOS analog circuits. The proposed analog design flow combines polynomial-regression based models and genetic algorithm for fast optimization. For evaluating the effectiveness of the proposed design flow, power minimization in a 50nm CMOS based current-starved voltage-controlled oscillator (VCO) is carried out, while treating oscillation frequency as a performance constraint. Accurate polynomial-regression based models are developed for power and frequency of the VCO. The goodness-of-fit of the models is evaluated using SSE, RMSE and R2. Using these models, we form a constrained optimization problem which is solved using genetic algorithm. The flow achieved 21.67% power savings, with a constraint of frequency ≥ 100 MHz. To the best of the authors' knowledge, this is the first study which approaches a VCO design problem as a mathematical constrained optimization involving the usage of regression based modeling and genetic algorithm.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125235039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Vertically-addressed test structures (VATS) for 3D IC variability and stress measurements 垂直寻址测试结构(VATS),用于3D集成电路变异性和应力测量
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523596
C. O'Sullivan, P. Levine, S. Garg
We propose a new test array architecture-vertically-addressed test structures (VATS)-to experimentally characterize the within-tier and tier-to-tier process variations and through-silicon via (TSV) induced stress in 3D integrated circuits (ICs). The proposed VATS architecture utilizes the benefits of 3D integration to simultaneously provide high density, low I/O pin utilization, and high fidelity. A test chip featuring eight VATS arrays (>15,000 active devices) has been designed and fabricated in a two-tier, 130-nm 3D IC technology. Simulation results highlight the advantages of the proposed VATS architecture compared to conventional 2D test arrays.We also propose a radial filtering scheme to discriminate between process variations and the impact of TSV-induced stress in 3D ICs.
我们提出了一种新的测试阵列架构——垂直寻址测试结构(VATS)——来实验表征层内和层间工艺变化以及三维集成电路(ic)中的硅通孔(TSV)诱导应力。提出的VATS架构利用3D集成的优势,同时提供高密度、低I/O引脚利用率和高保真度。采用两层130纳米3D集成电路技术设计和制造了具有8个VATS阵列(>15,000个有源器件)的测试芯片。仿真结果表明,与传统的二维测试阵列相比,所提出的VATS结构具有优势。我们还提出了一种径向滤波方案,以区分工艺变化和三维集成电路中tsv诱导应力的影响。
{"title":"Vertically-addressed test structures (VATS) for 3D IC variability and stress measurements","authors":"C. O'Sullivan, P. Levine, S. Garg","doi":"10.1109/ISQED.2013.6523596","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523596","url":null,"abstract":"We propose a new test array architecture-vertically-addressed test structures (VATS)-to experimentally characterize the within-tier and tier-to-tier process variations and through-silicon via (TSV) induced stress in 3D integrated circuits (ICs). The proposed VATS architecture utilizes the benefits of 3D integration to simultaneously provide high density, low I/O pin utilization, and high fidelity. A test chip featuring eight VATS arrays (>15,000 active devices) has been designed and fabricated in a two-tier, 130-nm 3D IC technology. Simulation results highlight the advantages of the proposed VATS architecture compared to conventional 2D test arrays.We also propose a radial filtering scheme to discriminate between process variations and the impact of TSV-induced stress in 3D ICs.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127187652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast FPGA-based fault injection tool for embedded processors 基于fpga的嵌入式处理器快速故障注入工具
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523654
Mohammad Shokrolah Shirazi, B. Morris, H. Selvaraj
FPGA-based fault injection methods have recently become more popular since they provide high speed in fault injection experiments. During each fault injection experiment, FPGA should send data related with observation points back to host computer for fault tolerant analysis. Since there is high data volume, FPGA should spend most of its time in communication. In this paper, we solve this problem by bringing all parts of fault injection tool inside FPGA. The area overhead problem related with observation data is obviated by using simple observation circuit. As case study, we injected 6400 SEU faults into OpensRISC 1200 processor over the Cyclone II FPGA. Results show that our fault injection experiments are done more than 400 times faster than one of the traditional FPGA based fault injection methods with only 5% area overhead.
基于fpga的故障注入方法由于提供了高速的故障注入实验,近年来越来越受欢迎。在每次故障注入实验中,FPGA将与观测点相关的数据发回上位机进行容错分析。由于数据量大,FPGA应该把大部分时间花在通信上。本文通过将故障注入工具的各个部分集成到FPGA中来解决这一问题。利用简单的观测电路,解决了观测数据的面积开销问题。作为案例研究,我们通过Cyclone II FPGA将6400个SEU故障注入到OpensRISC 1200处理器中。结果表明,我们的故障注入实验比传统的基于FPGA的故障注入方法快400倍以上,而面积开销仅为5%。
{"title":"Fast FPGA-based fault injection tool for embedded processors","authors":"Mohammad Shokrolah Shirazi, B. Morris, H. Selvaraj","doi":"10.1109/ISQED.2013.6523654","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523654","url":null,"abstract":"FPGA-based fault injection methods have recently become more popular since they provide high speed in fault injection experiments. During each fault injection experiment, FPGA should send data related with observation points back to host computer for fault tolerant analysis. Since there is high data volume, FPGA should spend most of its time in communication. In this paper, we solve this problem by bringing all parts of fault injection tool inside FPGA. The area overhead problem related with observation data is obviated by using simple observation circuit. As case study, we injected 6400 SEU faults into OpensRISC 1200 processor over the Cyclone II FPGA. Results show that our fault injection experiments are done more than 400 times faster than one of the traditional FPGA based fault injection methods with only 5% area overhead.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"187 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114017982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Impacts of NBTI and PBTI effects on ternary CAM NBTI和PBTI效应对三元CAM的影响
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523588
Yen-Han Lee, Ing-Chao Lin, Sheng-Wei Wang
Ternary content addressable memory (TCAM), which can store 0, 1 and X in its cells, is widely used to store routing tables in network routers. Meanwhile, NBTI (Negative Bias Temperature Instability) and PBTI (Positive Biased Temperature Instability), which increase Vth and degrade transistor switching speed, have become major reliability challenges. In this paper, we propose a novel TCAM architecture to reduce BTI degradation using a bit-flipping technique. This novel TCAM architecture ensures the correctness of read, write and search operations. We also analyze the signal probabilities of TCAM cells, and demonstrate that the bit-flipping technique can balance signal probabilities. By using the bit-flipping technique, 76.40% of the data cells under investigation were found to have signal probabilities close to 50%, which is 62.80% higher than the original architecture. In addition, 92.60% of the mask cells had signal probabilities close to 50%, which is 91.20% higher than the original architecture. When considering the overhead of the bit-flipping technique, the best flipping frequency is once a day.
三元内容可寻址存储器(TCAM)在网络路由器中广泛用于存储路由表,它可以在单元中存储0、1和X。同时,NBTI(负偏置温度不稳定性)和PBTI(正偏置温度不稳定性)增加了v值,降低了晶体管的开关速度,成为主要的可靠性挑战。在本文中,我们提出了一种新的TCAM架构,利用比特翻转技术来减少BTI的退化。这种新颖的TCAM架构保证了读、写和搜索操作的正确性。我们还分析了TCAM单元的信号概率,并证明了比特翻转技术可以平衡信号概率。通过使用比特翻转技术,76.40%的被调查数据单元的信号概率接近50%,比原始架构提高了62.80%。此外,92.60%的掩模单元的信号概率接近50%,比原结构提高了91.20%。考虑到比特翻转技术的开销,最好的翻转频率是一天一次。
{"title":"Impacts of NBTI and PBTI effects on ternary CAM","authors":"Yen-Han Lee, Ing-Chao Lin, Sheng-Wei Wang","doi":"10.1109/ISQED.2013.6523588","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523588","url":null,"abstract":"Ternary content addressable memory (TCAM), which can store 0, 1 and X in its cells, is widely used to store routing tables in network routers. Meanwhile, NBTI (Negative Bias Temperature Instability) and PBTI (Positive Biased Temperature Instability), which increase Vth and degrade transistor switching speed, have become major reliability challenges. In this paper, we propose a novel TCAM architecture to reduce BTI degradation using a bit-flipping technique. This novel TCAM architecture ensures the correctness of read, write and search operations. We also analyze the signal probabilities of TCAM cells, and demonstrate that the bit-flipping technique can balance signal probabilities. By using the bit-flipping technique, 76.40% of the data cells under investigation were found to have signal probabilities close to 50%, which is 62.80% higher than the original architecture. In addition, 92.60% of the mask cells had signal probabilities close to 50%, which is 91.20% higher than the original architecture. When considering the overhead of the bit-flipping technique, the best flipping frequency is once a day.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125758334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An arbitrary stressed NBTI compact model for analog/mixed-signal reliability simulations 用于模拟/混合信号可靠性仿真的任意应力NBTI紧凑模型
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523587
J. Wan, H. Kerkhoff
A compact NBTI model is presented by directly solving the reaction-diffusion (RD) equations in a simple way. The new model can handle arbitrary stress conditions without solving time-consuming equations and is hence very suitable for analog/mixed-signal NBTI simulations in SPICE-like environments. The model has been implemented in Cadence ADE with Verilog-A and also takes the stochastic effect of aging into account. The simulation speed has increased at least thousands times. The performance of the model is validated by both RD theoretical solutions as well as silicon results.
通过直接求解反应扩散方程,给出了一个紧凑的NBTI模型。新模型可以处理任意应力条件,而无需求解耗时的方程,因此非常适合在类似spice的环境中进行模拟/混合信号NBTI仿真。该模型已在Cadence ADE和Verilog-A中实现,并考虑了老化的随机效应。模拟速度至少提高了数千倍。该模型的性能得到了RD理论解和硅实验结果的验证。
{"title":"An arbitrary stressed NBTI compact model for analog/mixed-signal reliability simulations","authors":"J. Wan, H. Kerkhoff","doi":"10.1109/ISQED.2013.6523587","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523587","url":null,"abstract":"A compact NBTI model is presented by directly solving the reaction-diffusion (RD) equations in a simple way. The new model can handle arbitrary stress conditions without solving time-consuming equations and is hence very suitable for analog/mixed-signal NBTI simulations in SPICE-like environments. The model has been implemented in Cadence ADE with Verilog-A and also takes the stochastic effect of aging into account. The simulation speed has increased at least thousands times. The performance of the model is validated by both RD theoretical solutions as well as silicon results.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128959245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Reducing IR drop in 3D integration to less than 1/4 using Buck Converter on Top die (BCT) scheme 采用Buck Converter on Top die (BCT)方案,将3D集成中的IR下降降低到1/4以下
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523612
Y. Shinozuka, H. Fuketa, K. Ishida, F. Furuta, K. Osada, K. Takeda, M. Takamiya, T. Sakurai
This paper proposes a method to reduce the supply voltage IR drop of 3D stacked-die systems by implementing an on-chip Buck Converter on Top die (BCT) scheme. The IR drop is caused by the parasitic resistance of Through Silicon Vias (TSV's) used in the 3D integration. The IR drop reduction and the overhead associated with the BCT scheme are modeled and analyzed. A 3D stacked-die system is manufactured using 90nm CMOS technology with TSV's and a silicon interposer. A chip inductor and chip capacitors for the buck converter are mounted directly on the top die. The reduction of the IR drop to less than 1/4 is verified through experiments.
本文提出了一种采用片上Buck转换器(BCT)的方法来降低三维堆叠芯片系统的电源电压IR降。红外下降是由三维集成中使用的硅通孔(TSV)的寄生电阻引起的。对BCT方案的红外降噪和开销进行了建模和分析。采用90nm CMOS技术与TSV和硅中间体制造了3D堆叠芯片系统。用于降压转换器的片式电感器和片式电容器直接安装在上模上。通过实验验证了将红外降降低到1/4以下。
{"title":"Reducing IR drop in 3D integration to less than 1/4 using Buck Converter on Top die (BCT) scheme","authors":"Y. Shinozuka, H. Fuketa, K. Ishida, F. Furuta, K. Osada, K. Takeda, M. Takamiya, T. Sakurai","doi":"10.1109/ISQED.2013.6523612","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523612","url":null,"abstract":"This paper proposes a method to reduce the supply voltage IR drop of 3D stacked-die systems by implementing an on-chip Buck Converter on Top die (BCT) scheme. The IR drop is caused by the parasitic resistance of Through Silicon Vias (TSV's) used in the 3D integration. The IR drop reduction and the overhead associated with the BCT scheme are modeled and analyzed. A 3D stacked-die system is manufactured using 90nm CMOS technology with TSV's and a silicon interposer. A chip inductor and chip capacitors for the buck converter are mounted directly on the top die. The reduction of the IR drop to less than 1/4 is verified through experiments.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130809819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A virtualization approach for MIPS-based MPSoCs 基于mips的mpsoc的虚拟化方法
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523674
A. Aguiar, C. Moratelli, M. Sartori, Fabiano Hessel
Recently, virtualization techniques has been investigated as an interesting approach for complex embedded systems designs since they allow more secure systems, improve software design quality and reduce costs. However, the need to meet design constraints, mainly the real-time constraints, constitutes one of the biggest challenges that may prevent the wide adoption of virtualization in embedded systems. Industry designers and researchers believe that the use of hardware support to virtualization is a possible way of improving the system's performance and meeting its real-time constraints. In this paper we present our virtualization-aware architecture intended for MIPS processors with support to real-time applications. In our proposed approach no changes are needed in the Guest OS since we implement a full virtualization scheme. Real-time constraints are achieved by mixing the full virtualization technique with hardware support along with bare-metal application usage. Details of our virtualization platform are presented and discussed in the paper. Results demonstrate the effectiveness of our approach considering the hardware impact in terms of area, the software performance overhead, and the operating system port to allow its execution in a virtualized environment.
最近,虚拟化技术作为复杂嵌入式系统设计的一种有趣的方法被研究,因为它们允许更安全的系统,提高软件设计质量并降低成本。然而,需要满足设计约束,主要是实时约束,构成了可能阻碍虚拟化在嵌入式系统中广泛采用的最大挑战之一。工业设计师和研究人员认为,使用硬件支持虚拟化是提高系统性能和满足其实时限制的一种可能方法。在本文中,我们提出了用于支持实时应用程序的MIPS处理器的虚拟化感知架构。在我们建议的方法中,由于我们实现了一个完整的虚拟化方案,因此不需要对来宾操作系统进行任何更改。实时约束是通过将完全虚拟化技术与硬件支持以及裸机应用程序的使用相结合来实现的。本文对我们的虚拟化平台进行了详细的介绍和讨论。结果表明,考虑到硬件对面积的影响、软件性能开销和允许在虚拟化环境中执行的操作系统端口,我们的方法是有效的。
{"title":"A virtualization approach for MIPS-based MPSoCs","authors":"A. Aguiar, C. Moratelli, M. Sartori, Fabiano Hessel","doi":"10.1109/ISQED.2013.6523674","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523674","url":null,"abstract":"Recently, virtualization techniques has been investigated as an interesting approach for complex embedded systems designs since they allow more secure systems, improve software design quality and reduce costs. However, the need to meet design constraints, mainly the real-time constraints, constitutes one of the biggest challenges that may prevent the wide adoption of virtualization in embedded systems. Industry designers and researchers believe that the use of hardware support to virtualization is a possible way of improving the system's performance and meeting its real-time constraints. In this paper we present our virtualization-aware architecture intended for MIPS processors with support to real-time applications. In our proposed approach no changes are needed in the Guest OS since we implement a full virtualization scheme. Real-time constraints are achieved by mixing the full virtualization technique with hardware support along with bare-metal application usage. Details of our virtualization platform are presented and discussed in the paper. Results demonstrate the effectiveness of our approach considering the hardware impact in terms of area, the software performance overhead, and the operating system port to allow its execution in a virtualized environment.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124554221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Hetero2 3D integration: A scheme for optimizing efficiency/cost of Chip Multiprocessors 异构三维集成:一种优化芯片多处理器效率/成本的方案
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523582
S. Priyadarshi, N. Choudhary, Brandon H. Dwiel, Ankita Upreti, E. Rotenberg, W. R. Davis, P. Franzon
Timing the transition of a processor design to a new technology poses a provocative tradeoff. On the one hand, transitioning as early as possible offers a significant competitive advantage, by bringing improved designs to market early. On the other hand, an aggressive strategy may prove to be unprofitable, due to the low manufacturing yield of a technology that has not had time to mature. We propose exploiting two complementary forms of heterogeneity to profitably exploit an immature technology for Chip Multiprocessors (CMP). First, 3D integration facilitates a technology alloy. The CMP is split across two dies, one fabricated in the old technology and the other in the new technology. The alloy derives benefit from the new technology while limiting cost exposure. Second, to compensate for lower efficiency of old-technology cores, we exploit application and microarchitectural heterogeneity: applications which gain less from technology scaling are scheduled on old-technology cores, moreover, these cores are retuned to optimize this class of application. For a defect density ratio of 200 between 45nm and 65nm, Hetero2 3D gives 3.6× and 1.5× higher efficiency/cost compared to 2D and 3D homogeneous implementations, respectively, with only 6.5% degradation in efficiency. We also present a sensitivity analysis by sweeping the defect density ratio. The analysis reveals the defect density break-even points, where homogeneous 2D and 3D designs in 45nm achieve the same efficiency/cost as Hetero2 3D, marking significant points in the maturing of the technology.
处理器设计向新技术过渡的时机是一个令人激动的权衡。一方面,尽早进行转换可以提供显著的竞争优势,因为可以将改进的设计尽早推向市场。另一方面,激进的策略可能被证明是无利可图的,因为一项技术的制造产量很低,还没有时间成熟。我们建议利用两种互补的异构形式来利用芯片多处理器(CMP)的不成熟技术。首先,3D集成促进了一种技术合金。CMP分为两个模具,一个用旧技术制造,另一个用新技术制造。这种合金从新技术中获益,同时降低了成本。其次,为了弥补老技术核心的低效率,我们利用应用程序和微架构的异质性:在老技术核心上调度从技术扩展中获得较少收益的应用程序,并且这些核心被返回以优化这类应用程序。在45nm和65nm之间的缺陷密度比为200时,与2D和3D均质实现相比,Hetero2 3D的效率/成本分别提高了3.6倍和1.5倍,效率仅下降6.5%。我们还提出了通过扫描缺陷密度比的灵敏度分析。分析揭示了缺陷密度的收支平衡点,在45nm的均匀2D和3D设计中,可以实现与Hetero2 3D相同的效率/成本,这标志着该技术的成熟。
{"title":"Hetero2 3D integration: A scheme for optimizing efficiency/cost of Chip Multiprocessors","authors":"S. Priyadarshi, N. Choudhary, Brandon H. Dwiel, Ankita Upreti, E. Rotenberg, W. R. Davis, P. Franzon","doi":"10.1109/ISQED.2013.6523582","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523582","url":null,"abstract":"Timing the transition of a processor design to a new technology poses a provocative tradeoff. On the one hand, transitioning as early as possible offers a significant competitive advantage, by bringing improved designs to market early. On the other hand, an aggressive strategy may prove to be unprofitable, due to the low manufacturing yield of a technology that has not had time to mature. We propose exploiting two complementary forms of heterogeneity to profitably exploit an immature technology for Chip Multiprocessors (CMP). First, 3D integration facilitates a technology alloy. The CMP is split across two dies, one fabricated in the old technology and the other in the new technology. The alloy derives benefit from the new technology while limiting cost exposure. Second, to compensate for lower efficiency of old-technology cores, we exploit application and microarchitectural heterogeneity: applications which gain less from technology scaling are scheduled on old-technology cores, moreover, these cores are retuned to optimize this class of application. For a defect density ratio of 200 between 45nm and 65nm, Hetero2 3D gives 3.6× and 1.5× higher efficiency/cost compared to 2D and 3D homogeneous implementations, respectively, with only 6.5% degradation in efficiency. We also present a sensitivity analysis by sweeping the defect density ratio. The analysis reveals the defect density break-even points, where homogeneous 2D and 3D designs in 45nm achieve the same efficiency/cost as Hetero2 3D, marking significant points in the maturing of the technology.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128967079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
International Symposium on Quality Electronic Design (ISQED)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1