首页 > 最新文献

2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)最新文献

英文 中文
Increasing PCM main memory lifetime 增加PCM主存储器寿命
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5456923
A. Ferreira, Miao Zhou, S. Bock, B. Childers, R. Melhem, D. Mossé
The introduction of Phase-Change Memory (PCM) as a main memory technology has great potential to achieve a large energy reduction. PCM has desirable energy and scalability properties, but its use for main memory also poses challenges such as limited write endurance with at most 107 writes per bit cell before failure. This paper describes techniques to enhance the lifetime of PCM when used for main memory. Our techniques are (a) writeback minimization with new cache replacement policies, (b) avoidance of unnecessary writes, which write only the bit cells that are actually changed, and (c) endurance management with a novel PCM-aware swap algorithm for wear-leveling. A failure detection algorithm is also incorporated to improve the reliability of PCM. With these approaches, the lifetime of a PCM main memory is increased from just a few days to over 8 years.
相变存储器(PCM)作为一种主要的存储技术,具有实现大幅度节能的巨大潜力。PCM具有理想的能量和可伸缩性特性,但是将其用于主存储器也带来了挑战,例如写入持久性有限,故障前每位单元最多写107次。本文介绍了提高PCM用作主存时寿命的技术。我们的技术是(a)使用新的缓存替换策略最小化回写,(b)避免不必要的写,只写实际更改的位单元,以及(c)使用用于损耗均衡的新颖pcm感知交换算法进行持久性管理。为了提高PCM的可靠性,还引入了故障检测算法。通过这些方法,PCM主存储器的寿命从几天增加到8年以上。
{"title":"Increasing PCM main memory lifetime","authors":"A. Ferreira, Miao Zhou, S. Bock, B. Childers, R. Melhem, D. Mossé","doi":"10.1109/DATE.2010.5456923","DOIUrl":"https://doi.org/10.1109/DATE.2010.5456923","url":null,"abstract":"The introduction of Phase-Change Memory (PCM) as a main memory technology has great potential to achieve a large energy reduction. PCM has desirable energy and scalability properties, but its use for main memory also poses challenges such as limited write endurance with at most 107 writes per bit cell before failure. This paper describes techniques to enhance the lifetime of PCM when used for main memory. Our techniques are (a) writeback minimization with new cache replacement policies, (b) avoidance of unnecessary writes, which write only the bit cells that are actually changed, and (c) endurance management with a novel PCM-aware swap algorithm for wear-leveling. A failure detection algorithm is also incorporated to improve the reliability of PCM. With these approaches, the lifetime of a PCM main memory is increased from just a few days to over 8 years.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128595084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 209
A low-area flexible MIMO detector for WiFi/WiMAX standards 适用于WiFi/WiMAX标准的低面积灵活MIMO检测器
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5457073
N. M. Madani, T. Thorolfsson, W. R. Davis
MIMO wireless technology is required to increase the data rates for a broad range of applications, including low cost mobile devices. In this paper we present a very low area reconfigurable MIMO detector which achieves a high throughput of 103Mbps and uses 27 Kilo Gates when implemented in a commercial 180nm CMOS process. The low area is achieved by the proposed in-place architecture. This architecture implements the K-best algorithm and reduces area 4-fold compared to the widely used multi-stage architecture, while provides reconfigurability in terms of antenna configuration during real-time operation.
为了提高包括低成本移动设备在内的广泛应用的数据速率,需要MIMO无线技术。在本文中,我们提出了一种非常低面积可重构的MIMO探测器,当在商业180nm CMOS工艺中实现时,它实现了103Mbps的高吞吐量,并使用27个基洛门。低面积是通过提议的就地建筑来实现的。该体系结构实现了K-best算法,与广泛使用的多阶段体系结构相比,面积减少了4倍,同时在实时运行期间提供了天线配置的可重构性。
{"title":"A low-area flexible MIMO detector for WiFi/WiMAX standards","authors":"N. M. Madani, T. Thorolfsson, W. R. Davis","doi":"10.1109/DATE.2010.5457073","DOIUrl":"https://doi.org/10.1109/DATE.2010.5457073","url":null,"abstract":"MIMO wireless technology is required to increase the data rates for a broad range of applications, including low cost mobile devices. In this paper we present a very low area reconfigurable MIMO detector which achieves a high throughput of 103Mbps and uses 27 Kilo Gates when implemented in a commercial 180nm CMOS process. The low area is achieved by the proposed in-place architecture. This architecture implements the K-best algorithm and reduces area 4-fold compared to the widely used multi-stage architecture, while provides reconfigurability in terms of antenna configuration during real-time operation.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127374261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Vision for cross-layer optimization to address the dual challenges of energy and reliability 展望跨层优化,解决能源和可靠性的双重挑战
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5456959
A. DeHon, H. Quinn, N. Carter
We are rapidly approaching an inflection point where the conventional target of producing perfect, identical transistors that operate without upset can no longer be maintained while continuing to reduce the energy per operation. With power requirements already limiting chip performance, continuing to demand perfect, upset-free transistors would mean the end of scaling benefits. The big challenges in device variability and reliability are driven by uncommon tails in distributions, infrequent upsets, one-size-fits-all technology requirements, and a lack of information about the context of each operation. Solutions co-designed across traditional layer boundaries in our system stack can change the game, allowing architecture and software (a) to compensate for uncommon variation, environments, and events, (b) to pass down invariants and requirements for the computation, and (c) to monitor the health of collections of devices. Cross-layer codesign provides a path to continue extracting benefits from further scaled technologies despite the fact that they may be less predictable and more variable. While some limited multi-layer mitigation strategies do exist, to move forward redefining traditional layer abstractions and developing a framework that facilitates cross-layer collaboration is necessary.
我们正迅速接近一个拐点,在这个拐点上,传统的目标是生产完美的、完全相同的晶体管,在不破坏的情况下工作,而在每次操作中继续减少能量。由于功率要求已经限制了芯片的性能,继续要求完美的、无干扰的晶体管将意味着规模优势的终结。设备可变性和可靠性方面的重大挑战是由分布中不常见的尾部、不常见的故障、一刀切的技术要求以及缺乏有关每次操作背景的信息所驱动的。在我们的系统堆栈中,跨传统层边界共同设计的解决方案可以改变游戏规则,允许架构和软件(a)补偿不常见的变化、环境和事件,(b)传递计算的不变量和要求,以及(c)监控设备集合的健康状况。跨层协同设计提供了一条途径,可以继续从进一步扩展的技术中获取好处,尽管它们可能更不可预测、更可变。虽然确实存在一些有限的多层缓解策略,但要向前推进,重新定义传统的层抽象并开发促进跨层协作的框架是必要的。
{"title":"Vision for cross-layer optimization to address the dual challenges of energy and reliability","authors":"A. DeHon, H. Quinn, N. Carter","doi":"10.1109/DATE.2010.5456959","DOIUrl":"https://doi.org/10.1109/DATE.2010.5456959","url":null,"abstract":"We are rapidly approaching an inflection point where the conventional target of producing perfect, identical transistors that operate without upset can no longer be maintained while continuing to reduce the energy per operation. With power requirements already limiting chip performance, continuing to demand perfect, upset-free transistors would mean the end of scaling benefits. The big challenges in device variability and reliability are driven by uncommon tails in distributions, infrequent upsets, one-size-fits-all technology requirements, and a lack of information about the context of each operation. Solutions co-designed across traditional layer boundaries in our system stack can change the game, allowing architecture and software (a) to compensate for uncommon variation, environments, and events, (b) to pass down invariants and requirements for the computation, and (c) to monitor the health of collections of devices. Cross-layer codesign provides a path to continue extracting benefits from further scaled technologies despite the fact that they may be less predictable and more variable. While some limited multi-layer mitigation strategies do exist, to move forward redefining traditional layer abstractions and developing a framework that facilitates cross-layer collaboration is necessary.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127341934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
An active vision system for fall detection and posture recognition in elderly healthcare 一种用于老年人跌倒检测和姿势识别的主动视觉系统
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5457055
G. Diraco, A. Leone, P. Siciliano
The paper presents an active vision system for the automatic detection of falls and the recognition of several postures for elderly homecare applications. A wall-mounted Time-Of-Flight camera provides accurate measurements of the acquired scene in all illumination conditions, allowing the reliable detection of critical events. Preliminarily, an off-line calibration procedure estimates the external camera parameters automatically without landmarks, calibration patterns or user intervention. The calibration procedure searches for different planes in the scene selecting the one that accomplishes the floor plane constraints. Subsequently, the moving regions are detected in real-time by applying a Bayesian segmentation to the whole 3D points cloud. The distance of the 3D human centroid from the floor plane is evaluated by using the previously defined calibration parameters and the corresponding trend is used as feature in a thresholding-based clustering for fall detection. The fall detection shows high performances in terms of efficiency and reliability on a large real dataset in which almost one half of events are falls acquired in different conditions. The posture recognition is carried out by using both the 3D human centroid distance from the floor plane and the orientation of the body spine estimated by applying a topological approach to the range images. Experimental results on synthetic data validate the correctness of the proposed posture recognition approach.
本文提出了一种用于老年人家庭护理中跌倒自动检测和几种姿势识别的主动视觉系统。壁挂式飞行时间(Time-Of-Flight)相机可在所有照明条件下对采集的场景进行精确测量,从而可靠地检测关键事件。初步,离线校准程序自动估计外部相机参数,不需要地标,校准模式或用户干预。标定过程在场景中搜索不同的平面,选择完成地板平面约束的平面。随后,通过对整个三维点云进行贝叶斯分割,实时检测运动区域。利用预先定义的校准参数评估三维人体质心与地板平面的距离,并将相应的趋势作为特征在基于阈值的聚类中进行跌倒检测。在一个大型真实数据集上,几乎有一半的事件是在不同条件下获得的跌倒,在效率和可靠性方面显示出很高的性能。该方法利用三维人体质心到地面的距离和通过对距离图像应用拓扑方法估计的人体脊柱方向来进行姿态识别。综合数据的实验结果验证了所提姿态识别方法的正确性。
{"title":"An active vision system for fall detection and posture recognition in elderly healthcare","authors":"G. Diraco, A. Leone, P. Siciliano","doi":"10.1109/DATE.2010.5457055","DOIUrl":"https://doi.org/10.1109/DATE.2010.5457055","url":null,"abstract":"The paper presents an active vision system for the automatic detection of falls and the recognition of several postures for elderly homecare applications. A wall-mounted Time-Of-Flight camera provides accurate measurements of the acquired scene in all illumination conditions, allowing the reliable detection of critical events. Preliminarily, an off-line calibration procedure estimates the external camera parameters automatically without landmarks, calibration patterns or user intervention. The calibration procedure searches for different planes in the scene selecting the one that accomplishes the floor plane constraints. Subsequently, the moving regions are detected in real-time by applying a Bayesian segmentation to the whole 3D points cloud. The distance of the 3D human centroid from the floor plane is evaluated by using the previously defined calibration parameters and the corresponding trend is used as feature in a thresholding-based clustering for fall detection. The fall detection shows high performances in terms of efficiency and reliability on a large real dataset in which almost one half of events are falls acquired in different conditions. The posture recognition is carried out by using both the 3D human centroid distance from the floor plane and the orientation of the body spine estimated by applying a topological approach to the range images. Experimental results on synthetic data validate the correctness of the proposed posture recognition approach.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"14 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128987908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 137
NBTI modeling in the framework of temperature variation 温度变化框架下的NBTI模型
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5457196
Seyab, S. Hamdioui
Negative Bias Temperature Instability (NBTI) has become an important reliability concern for nano-scaled Complementary Metal Oxide Semiconductor (CMOS) devices. In this paper, we present an analysis of temperature impact on various sub-processes that contribute to NBTI degradation. We demonstrate our analysis on 90nm industrial design operating in temperature range 25–125°C. The key temperature impacts observed in our simulation are: (a) the threshold voltage increase in P-type Metal Oxide Semiconductor (PMOS) due to NBTI is very sensitive to temperature, and increases by 34% due to the temperature increment, (b) the hole mobility in PMOS inversion layer reduces by 11% with the temperature increment, and (c) the temperature has a marginal impact on the transistor delay, that increases by 3% with the temperature increment.
负偏置温度不稳定性(NBTI)已成为纳米级互补金属氧化物半导体(CMOS)器件可靠性的重要问题。在本文中,我们分析了温度对NBTI降解的各个子过程的影响。我们展示了在温度范围为25-125°C的90nm工业设计上的分析。在我们的模拟中观察到的关键温度影响是:(a)由于NBTI引起的p型金属氧化物半导体(PMOS)的阈值电压对温度非常敏感,随着温度的增加而增加34%;(b) PMOS反转层的空穴迁移率随着温度的增加而降低11%;(c)温度对晶体管延迟的影响很小,随着温度的增加而增加3%。
{"title":"NBTI modeling in the framework of temperature variation","authors":"Seyab, S. Hamdioui","doi":"10.1109/DATE.2010.5457196","DOIUrl":"https://doi.org/10.1109/DATE.2010.5457196","url":null,"abstract":"Negative Bias Temperature Instability (NBTI) has become an important reliability concern for nano-scaled Complementary Metal Oxide Semiconductor (CMOS) devices. In this paper, we present an analysis of temperature impact on various sub-processes that contribute to NBTI degradation. We demonstrate our analysis on 90nm industrial design operating in temperature range 25–125°C. The key temperature impacts observed in our simulation are: (a) the threshold voltage increase in P-type Metal Oxide Semiconductor (PMOS) due to NBTI is very sensitive to temperature, and increases by 34% due to the temperature increment, (b) the hole mobility in PMOS inversion layer reduces by 11% with the temperature increment, and (c) the temperature has a marginal impact on the transistor delay, that increases by 3% with the temperature increment.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128899011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Fault diagnosis of analog circuits based on machine learning 基于机器学习的模拟电路故障诊断
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5457099
K. Huang, H. Stratigopoulos, S. Mir
We discuss a fault diagnosis scheme for analog integrated circuits. Our approach is based on an assemblage of learning machines that are trained beforehand to guide us through diagnosis decisions. The central learning machine is a defect filter that distinguishes failing devices due to gross defects (hard faults) from failing devices due to excessive parametric deviations (soft faults). Thus, the defect filter is key in developing a unified hard/soft fault diagnosis approach. Two types of diagnosis can be carried out according to the decision of the defect filter: hard faults are diagnosed using a multi-class classifier, whereas soft faults are diagnosed using inverse regression functions. We show how this approach can be used to single out diagnostic scenarios in an RF low noise amplifier (LNA).
讨论了模拟集成电路的故障诊断方案。我们的方法是基于一组学习机器,这些机器事先经过训练,可以指导我们做出诊断决策。中央学习机是一个缺陷过滤器,可以区分由于严重缺陷(硬故障)导致的故障设备和由于参数偏差过大(软故障)导致的故障设备。因此,缺陷滤波是建立统一的硬/软故障诊断方法的关键。根据缺陷滤波器的选择,可进行两种诊断:采用多类分类器诊断硬故障,采用逆回归函数诊断软故障。我们展示了如何使用这种方法在RF低噪声放大器(LNA)中挑选诊断场景。
{"title":"Fault diagnosis of analog circuits based on machine learning","authors":"K. Huang, H. Stratigopoulos, S. Mir","doi":"10.1109/DATE.2010.5457099","DOIUrl":"https://doi.org/10.1109/DATE.2010.5457099","url":null,"abstract":"We discuss a fault diagnosis scheme for analog integrated circuits. Our approach is based on an assemblage of learning machines that are trained beforehand to guide us through diagnosis decisions. The central learning machine is a defect filter that distinguishes failing devices due to gross defects (hard faults) from failing devices due to excessive parametric deviations (soft faults). Thus, the defect filter is key in developing a unified hard/soft fault diagnosis approach. Two types of diagnosis can be carried out according to the decision of the defect filter: hard faults are diagnosed using a multi-class classifier, whereas soft faults are diagnosed using inverse regression functions. We show how this approach can be used to single out diagnostic scenarios in an RF low noise amplifier (LNA).","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130646102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
enBudget: A Run-Time Adaptive Predictive Energy-Budgeting scheme for energy-aware Motion Estimation in H.264/MPEG-4 AVC video encoder enBudget: H.264/MPEG-4 AVC视频编码器中用于能量感知运动估计的运行时自适应预测能量预算方案
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5457093
M. Shafique, L. Bauer, J. Henkel
The limited energy resources in portable multimedia devices require the reduction of encoding complexity. The complex Motion Estimation (ME) scheme of H.264/MPEG-4 AVC accounts for a major part of the encoder energy. In this paper we present a Run-Time Adaptive Predictive Energy Budgeting (enBudget) scheme for energy-aware ME that predicts the energy budget for different video frames and different Macroblocks (MBs) in an adaptive manner considering the run-time changing scenarios of available energy, video frame characteristics, and user-defined coding constraints while keeping a good video quality. It assigns different Energy-Quality Classes to different video frames and fine-tunes at MB level depending upon the predictive energy quota in order to cope with above-mentioned run-time unpredictable scenarios. Compared to UMHexagonS, EPZS, and FastME, our enBudget scheme for energy-aware ME achieves an energy saving of up to 93%, 90%, 88% (average 88%, 77%, 66%), respectively. It suffers from an average Peak Signal to Noise Ratio (PSNR) loss of 0.29 dB compared to Full Search. We also demonstrate that enBudget is equally beneficial to various other state-of-the-art fast adaptive MEs (e.g.). We have evaluated our scheme for ASIC and various FPGAs.
便携式多媒体设备有限的能量资源要求降低编码复杂度。H.264/MPEG-4 AVC的复杂运动估计(complex Motion Estimation, ME)方案占了编码器能量的很大一部分。在本文中,我们提出了一种运行时自适应预测能量预算(enBudget)方案,该方案在保持良好视频质量的同时,考虑到可用能量的运行时变化场景、视频帧特征和用户自定义编码约束,以自适应的方式预测不同视频帧和不同宏块(mb)的能量预算。为了应对上述运行时不可预测的场景,它根据预测的能量配额为不同的视频帧分配不同的能量质量类,并在MB级别进行微调。与umhexagon、EPZS和FastME相比,我们的enBudget节能方案节能高达93%、90%、88%(平均分别为88%、77%和66%)。与全搜索相比,它的平均峰值信噪比(PSNR)损失为0.29 dB。我们还证明,enBudget同样有利于各种其他最先进的快速自适应MEs(例如)。我们已经在ASIC和各种fpga上评估了我们的方案。
{"title":"enBudget: A Run-Time Adaptive Predictive Energy-Budgeting scheme for energy-aware Motion Estimation in H.264/MPEG-4 AVC video encoder","authors":"M. Shafique, L. Bauer, J. Henkel","doi":"10.1109/DATE.2010.5457093","DOIUrl":"https://doi.org/10.1109/DATE.2010.5457093","url":null,"abstract":"The limited energy resources in portable multimedia devices require the reduction of encoding complexity. The complex Motion Estimation (ME) scheme of H.264/MPEG-4 AVC accounts for a major part of the encoder energy. In this paper we present a Run-Time Adaptive Predictive Energy Budgeting (enBudget) scheme for energy-aware ME that predicts the energy budget for different video frames and different Macroblocks (MBs) in an adaptive manner considering the run-time changing scenarios of available energy, video frame characteristics, and user-defined coding constraints while keeping a good video quality. It assigns different Energy-Quality Classes to different video frames and fine-tunes at MB level depending upon the predictive energy quota in order to cope with above-mentioned run-time unpredictable scenarios. Compared to UMHexagonS, EPZS, and FastME, our enBudget scheme for energy-aware ME achieves an energy saving of up to 93%, 90%, 88% (average 88%, 77%, 66%), respectively. It suffers from an average Peak Signal to Noise Ratio (PSNR) loss of 0.29 dB compared to Full Search. We also demonstrate that enBudget is equally beneficial to various other state-of-the-art fast adaptive MEs (e.g.). We have evaluated our scheme for ASIC and various FPGAs.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130880449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Cost modeling and cycle-accurate co-simulation of heterogeneous multiprocessor systems 异构多处理机系统的成本建模与周期精确联合仿真
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5457006
S. V. Haastregt, E. Halm, B. Kienhuis
In this paper, we present a method to analyze different implementations of stream-based applications on heterogeneous multiprocessor systems. We take both resource usage and performance constraints into account. For the first aspect we use an empirical cost model. For the second aspect we build a network of cycle-accurate processor simulators. The simulation and resource cost estimation have been integrated in an existing framework, allowing one to generate fast exploration simulations, cycle-accurate simulations and FPGA implementations from a single system level specification. We show that with our methodology cycle-accurate performance numbers of candidate systems can be obtained. In our experiments with the QR and MJPEG applications, we found that the error of our resource cost model is below two percent.
在本文中,我们提出了一种方法来分析基于流的应用程序在异构多处理器系统上的不同实现。我们同时考虑了资源使用和性能约束。对于第一个方面,我们使用经验成本模型。在第二方面,我们建立了一个周期精确的处理器模拟器网络。仿真和资源成本估算已经集成到现有框架中,允许从单个系统级规范生成快速勘探仿真,周期精确仿真和FPGA实现。我们表明,使用我们的方法可以获得候选系统的周期精确性能数字。在我们对QR和MJPEG应用程序的实验中,我们发现我们的资源成本模型的误差低于2%。
{"title":"Cost modeling and cycle-accurate co-simulation of heterogeneous multiprocessor systems","authors":"S. V. Haastregt, E. Halm, B. Kienhuis","doi":"10.1109/DATE.2010.5457006","DOIUrl":"https://doi.org/10.1109/DATE.2010.5457006","url":null,"abstract":"In this paper, we present a method to analyze different implementations of stream-based applications on heterogeneous multiprocessor systems. We take both resource usage and performance constraints into account. For the first aspect we use an empirical cost model. For the second aspect we build a network of cycle-accurate processor simulators. The simulation and resource cost estimation have been integrated in an existing framework, allowing one to generate fast exploration simulations, cycle-accurate simulations and FPGA implementations from a single system level specification. We show that with our methodology cycle-accurate performance numbers of candidate systems can be obtained. In our experiments with the QR and MJPEG applications, we found that the error of our resource cost model is below two percent.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127918734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Parallel X-fault simulation with critical path tracing technique 基于关键路径跟踪技术的并行x -故障仿真
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5456929
R. Ubar, S. Devadze, J. Raik, A. Jutman
In this paper, a new very fast fault simulation method to handle the X-fault model is proposed. The method is based on a two-phase procedure. In the first phase, a parallel exact critical path fault tracing is used to determine all the detected stuck-at faults in the circuit, and in the second phase a postprocess is launched which will determine the detectability of X-faults.
本文提出了一种新的快速故障仿真方法来处理x故障模型。该方法基于两阶段程序。在第一阶段,使用并行精确关键路径故障跟踪来确定电路中检测到的所有卡滞故障,在第二阶段启动后处理来确定x故障的可检测性。
{"title":"Parallel X-fault simulation with critical path tracing technique","authors":"R. Ubar, S. Devadze, J. Raik, A. Jutman","doi":"10.1109/DATE.2010.5456929","DOIUrl":"https://doi.org/10.1109/DATE.2010.5456929","url":null,"abstract":"In this paper, a new very fast fault simulation method to handle the X-fault model is proposed. The method is based on a two-phase procedure. In the first phase, a parallel exact critical path fault tracing is used to determine all the detected stuck-at faults in the circuit, and in the second phase a postprocess is launched which will determine the detectability of X-faults.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129242561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
An efficient transistor-level piecewise-linear macromodeling approach for model order reduction of nonlinear circuits 一种用于非线性电路降阶的晶体管级分段线性宏建模方法
Pub Date : 2010-03-08 DOI: 10.1109/DATE.2010.5457083
Xiaoda Pan, Fan Yang, Xuan Zeng, Yangfeng Su
Trajectory piecewise-linear macromodeling (TPWL) technique has been widely employed to characterize strong nonlinear circuits, and makes the reduction of the strong nonlinear circuits possible. The trajectory piecewise-linear macromodeling technique linearizes nonlinear circuits around multiple expansion points which are extracted from state trajectories driven by training inputs. However, the accuracy of the trajectory piecewise-linear macromodeling technique heavily relies on the extracted expansion points and the training inputs. It will lead to large error in simulation if state vector reaches regions far away from the extracted expansion points. In this paper, we propose an efficient transistor-level piecewise linearization scheme for macromodeling of nonlinear circuits. Piecewise linear models are first built for each transistor. The macromodel of the whole nonlinear circuit is then constructed by combining all the piecewise-linear models of the transistors together with appropriate weight functions. The proposed approach can cover remarkably larger state space than the TPWL method. By using the complete piecewise-linear models of the transistors, the constructed piecewise-linear models of the nonlinear circuits are capable of covering the whole state space of the nonlinear circuits. More importantly, model order reduction of the proposed transistor-level piecewise linearization macromodel is also possible, which makes the proposed method a potentially good macromodeling approach for model order reduction of nonlinear circuits.
轨迹分段线性宏观建模(TPWL)技术被广泛应用于强非线性电路的表征,使强非线性电路的简化成为可能。轨迹分段线性宏观建模技术将围绕多个扩展点的非线性电路线性化,这些扩展点是从训练输入驱动的状态轨迹中提取的。然而,轨迹分段线性宏建模技术的精度很大程度上依赖于提取的展开点和训练输入。如果状态向量到达距离提取的展开点较远的区域,将导致仿真误差较大。本文提出了一种有效的晶体管级分段线性化方案,用于非线性电路的宏观建模。首先为每个晶体管建立分段线性模型。然后将所有晶体管的分段线性模型与适当的权函数结合起来,构建整个非线性电路的宏观模型。该方法可以覆盖比TPWL方法更大的状态空间。利用晶体管的完整分段线性模型,所构建的非线性电路的分段线性模型能够覆盖非线性电路的整个状态空间。更重要的是,所提出的晶体管级分段线性化宏观模型的模型降阶也是可能的,这使得所提出的方法成为非线性电路模型降阶的一种潜在的良好的宏观建模方法。
{"title":"An efficient transistor-level piecewise-linear macromodeling approach for model order reduction of nonlinear circuits","authors":"Xiaoda Pan, Fan Yang, Xuan Zeng, Yangfeng Su","doi":"10.1109/DATE.2010.5457083","DOIUrl":"https://doi.org/10.1109/DATE.2010.5457083","url":null,"abstract":"Trajectory piecewise-linear macromodeling (TPWL) technique has been widely employed to characterize strong nonlinear circuits, and makes the reduction of the strong nonlinear circuits possible. The trajectory piecewise-linear macromodeling technique linearizes nonlinear circuits around multiple expansion points which are extracted from state trajectories driven by training inputs. However, the accuracy of the trajectory piecewise-linear macromodeling technique heavily relies on the extracted expansion points and the training inputs. It will lead to large error in simulation if state vector reaches regions far away from the extracted expansion points. In this paper, we propose an efficient transistor-level piecewise linearization scheme for macromodeling of nonlinear circuits. Piecewise linear models are first built for each transistor. The macromodel of the whole nonlinear circuit is then constructed by combining all the piecewise-linear models of the transistors together with appropriate weight functions. The proposed approach can cover remarkably larger state space than the TPWL method. By using the complete piecewise-linear models of the transistors, the constructed piecewise-linear models of the nonlinear circuits are capable of covering the whole state space of the nonlinear circuits. More importantly, model order reduction of the proposed transistor-level piecewise linearization macromodel is also possible, which makes the proposed method a potentially good macromodeling approach for model order reduction of nonlinear circuits.","PeriodicalId":432902,"journal":{"name":"2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)","volume":"144 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128781865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1