首页 > 最新文献

2011 14th Euromicro Conference on Digital System Design最新文献

英文 中文
Human++: Key Challenges and Trade-offs in Embedded System Design for Personal Health Care human++:个人医疗保健嵌入式系统设计中的关键挑战和权衡
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.115
H. D. Groot, M. Ashouei, J. Penders, V. Pop, M. Vidojkovic, B. Gyselinckx, R. Yazicioglu
The cost of health care in first-world countries is increasing dramatically as a result of advances in medicine, a population that is becoming older and an increasingly unhealthy lifestyle. Personal health care concepts where sensors within and around the body monitor and measure all kind of physiological signals can be an addition to medicare with high benefits. This concept allows patients to stay in their home environment and hence have a better quality of life with lower costs involved. For these reasons research and development is ongoing on many body worn and implantable sensor nodes. In this paper it is shown that application knowledge and understanding the contribution of different components to the system power consumption is the best starting point to make optimal trade-offs in the system design. This will minimize the overall power consumption of a sensor node without losing track of the major functionality needed. Besides the importance of system optimization, it is also shown that new components and circuit techniques need to be developed to achieve orders of magnitude increase in energy efficiency. This is a must to realize ultra-thin electrocardiogram patches as well as more demanding nodes with a small form factor like real-time Electro Encephalogram processing for brain computer interaction or neuro-implants.
由于医学进步、人口老龄化和生活方式日益不健康,第一世界国家的保健费用正在急剧增加。个人医疗保健概念,即身体内部和周围的传感器监测和测量各种生理信号,可以成为医疗保险的一个高收益补充。这一概念允许患者呆在他们的家庭环境中,从而以更低的成本获得更好的生活质量。由于这些原因,许多可穿戴和植入式传感器节点的研究和开发正在进行中。本文指出,应用知识和了解不同组件对系统功耗的贡献是在系统设计中做出最佳权衡的最佳起点。这将最大限度地减少传感器节点的总体功耗,而不会丢失所需的主要功能。除了系统优化的重要性外,还表明需要开发新的组件和电路技术,以实现能源效率的数量级提高。这是实现超薄心电图贴片以及更小尺寸的节点(如用于脑机交互或神经植入的实时脑电图处理)的必要条件。
{"title":"Human++: Key Challenges and Trade-offs in Embedded System Design for Personal Health Care","authors":"H. D. Groot, M. Ashouei, J. Penders, V. Pop, M. Vidojkovic, B. Gyselinckx, R. Yazicioglu","doi":"10.1109/DSD.2011.115","DOIUrl":"https://doi.org/10.1109/DSD.2011.115","url":null,"abstract":"The cost of health care in first-world countries is increasing dramatically as a result of advances in medicine, a population that is becoming older and an increasingly unhealthy lifestyle. Personal health care concepts where sensors within and around the body monitor and measure all kind of physiological signals can be an addition to medicare with high benefits. This concept allows patients to stay in their home environment and hence have a better quality of life with lower costs involved. For these reasons research and development is ongoing on many body worn and implantable sensor nodes. In this paper it is shown that application knowledge and understanding the contribution of different components to the system power consumption is the best starting point to make optimal trade-offs in the system design. This will minimize the overall power consumption of a sensor node without losing track of the major functionality needed. Besides the importance of system optimization, it is also shown that new components and circuit techniques need to be developed to achieve orders of magnitude increase in energy efficiency. This is a must to realize ultra-thin electrocardiogram patches as well as more demanding nodes with a small form factor like real-time Electro Encephalogram processing for brain computer interaction or neuro-implants.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125146000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Novel Hardware Architecture for Implementing the Inner Loop of the SHA-2 Algorithms 实现SHA-2算法内循环的新型硬件架构
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.75
I. Algredo-Badillo, C. F. Uribe, R. Cumplido, M. Morales-Sandoval
Cryptographic algorithms are used to enable security services that are the core of modern communication systems. In particular, Hash functions algorithms are widely used to provide services of data integrity and authentication. These algorithms are based on performing a number of complex operations on the input data, thus it is important to count with novel designs that can be efficiently mapped to hardware architectures. Hash functions perform internal operations in an iterative fashion, which open the possibility of exploring several implementation strategies. In the paper, two different schemes to improve the performance of the hardware implementation of the SHA-2 family of algorithms are proposed. The main focus of the proposed schemes is to reduce the critical path by reordering the operations required at each iteration of the algorithm. Implementation results on an FPGA device show an improvement on the performance on the SHA-256 algorithm when compared against similar previously proposed approaches.
加密算法用于实现安全服务,这是现代通信系统的核心。其中,哈希函数算法被广泛用于提供数据完整性和认证服务。这些算法基于对输入数据执行许多复杂的操作,因此使用能够有效地映射到硬件架构的新颖设计是很重要的。哈希函数以迭代的方式执行内部操作,这为探索多种实现策略提供了可能性。本文提出了两种不同的方案来提高SHA-2系列算法的硬件实现性能。提出的方案的主要重点是通过在算法的每次迭代中重新排序所需的操作来减少关键路径。在FPGA器件上的实现结果表明,与先前提出的类似方法相比,SHA-256算法的性能有所提高。
{"title":"Novel Hardware Architecture for Implementing the Inner Loop of the SHA-2 Algorithms","authors":"I. Algredo-Badillo, C. F. Uribe, R. Cumplido, M. Morales-Sandoval","doi":"10.1109/DSD.2011.75","DOIUrl":"https://doi.org/10.1109/DSD.2011.75","url":null,"abstract":"Cryptographic algorithms are used to enable security services that are the core of modern communication systems. In particular, Hash functions algorithms are widely used to provide services of data integrity and authentication. These algorithms are based on performing a number of complex operations on the input data, thus it is important to count with novel designs that can be efficiently mapped to hardware architectures. Hash functions perform internal operations in an iterative fashion, which open the possibility of exploring several implementation strategies. In the paper, two different schemes to improve the performance of the hardware implementation of the SHA-2 family of algorithms are proposed. The main focus of the proposed schemes is to reduce the critical path by reordering the operations required at each iteration of the algorithm. Implementation results on an FPGA device show an improvement on the performance on the SHA-256 algorithm when compared against similar previously proposed approaches.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116513311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Pre-silicon Characterization of NIST SHA-3 Final Round Candidates NIST SHA-3最后一轮候选人的预硅表征
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.74
Xu Guo, Meeta Srivastav, Sinan Huang, D. Ganta, Michael B. Henry, L. Nazhandali, P. Schaumont
The NIST SHA-3 competition aims to select a new secure hash standard. Hardware implementation quality is an important factor in evaluating the SHA-3 finalists. However, a comprehensive methodology to benchmark five final round SHA-3 candidates in ASIC is challenging. Many factors need to be considered, including application scenarios, target technologies and optimization goals. This work describes detailed steps in the silicon implementation of a SHA-3 ASIC. The plan of ASIC prototyping with all the SHA-3 finalists, as an integral part of our SHA-3 ASIC evaluation project, is motivated by our previously proposed methodology, which defines a consistent and systematic approach to move a SHA-3 hardware benchmark process from FPGA prototyping to ASIC implementation. We have designed the remaining five SHA-3 candidates in 0.13 $mu m$ IBM process using standard-cell CMOS technology. In this paper, we discuss our proposed methodology for SHA-3 ASIC evaluation and report the latest results based on post-layout simulation of the five SHA-3 finalists with Round 3 tweaks.
NIST SHA-3竞赛旨在选择一种新的安全哈希标准。硬件实现质量是评估SHA-3最终入围者的重要因素。然而,在ASIC中对五个最后一轮SHA-3候选对象进行基准测试的综合方法具有挑战性。需要考虑很多因素,包括应用场景、目标技术和优化目标。这项工作描述了SHA-3 ASIC的硅实现的详细步骤。所有SHA-3决赛入围者的ASIC原型计划,作为我们SHA-3 ASIC评估项目的组成部分,是由我们之前提出的方法所驱动的,该方法定义了一种一致和系统的方法,将SHA-3硬件基准流程从FPGA原型转移到ASIC实现。我们使用标准单元CMOS技术在0.13 $mu m$ IBM工艺中设计了剩余的五个SHA-3候选芯片。在本文中,我们讨论了我们提出的SHA-3 ASIC评估方法,并报告了基于三个SHA-3决赛选手的布局后模拟和第三轮调整的最新结果。
{"title":"Pre-silicon Characterization of NIST SHA-3 Final Round Candidates","authors":"Xu Guo, Meeta Srivastav, Sinan Huang, D. Ganta, Michael B. Henry, L. Nazhandali, P. Schaumont","doi":"10.1109/DSD.2011.74","DOIUrl":"https://doi.org/10.1109/DSD.2011.74","url":null,"abstract":"The NIST SHA-3 competition aims to select a new secure hash standard. Hardware implementation quality is an important factor in evaluating the SHA-3 finalists. However, a comprehensive methodology to benchmark five final round SHA-3 candidates in ASIC is challenging. Many factors need to be considered, including application scenarios, target technologies and optimization goals. This work describes detailed steps in the silicon implementation of a SHA-3 ASIC. The plan of ASIC prototyping with all the SHA-3 finalists, as an integral part of our SHA-3 ASIC evaluation project, is motivated by our previously proposed methodology, which defines a consistent and systematic approach to move a SHA-3 hardware benchmark process from FPGA prototyping to ASIC implementation. We have designed the remaining five SHA-3 candidates in 0.13 $mu m$ IBM process using standard-cell CMOS technology. In this paper, we discuss our proposed methodology for SHA-3 ASIC evaluation and report the latest results based on post-layout simulation of the five SHA-3 finalists with Round 3 tweaks.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"21 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132935184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Thermal Analysis of Job Allocation and Scheduling Schemes for 3D Stacked NoC's 三维堆叠NoC作业分配与调度方案的热分析
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.87
Kameswar Rao Vaddina, A. Rahmani, Khalid Latif, P. Liljeberg, J. Plosila
Three-dimensional technology offers greater device integration, reduced signal delay and reduced interconnect power. It also provides greater design flexibility by allowing heterogeneous integration. However, 3D technology exacerbates the on-chip thermal issues and increases packaging and cooling costs. In this work, a 3D thermal model of a stacked network-on-chip system is developed and thermal analysis is performed in order to analyze different job allocation and scheduling schemes using finite element simulations. The steady-state heat transfer analysis on the 3D stacked structure has been performed. We have analyzed the effect of variation of die power consumption, with and without hotspots, on peak temperatures in different layers of the stack. The optimal die placement solution is also provided based on the maximum temperature attained by the individual silicon dies.
三维技术提供了更大的设备集成度,减少了信号延迟,降低了互连功率。它还通过允许异构集成提供了更大的设计灵活性。然而,3D技术加剧了芯片上的热问题,增加了封装和冷却成本。在这项工作中,建立了一个堆叠的片上网络系统的三维热模型,并进行了热分析,以分析不同的工作分配和调度方案,使用有限元模拟。对三维叠层结构进行了稳态传热分析。我们分析了在有和没有热点的情况下,模具功耗的变化对堆叠不同层的峰值温度的影响。基于单个硅模具所达到的最高温度,还提供了最佳的模具放置方案。
{"title":"Thermal Analysis of Job Allocation and Scheduling Schemes for 3D Stacked NoC's","authors":"Kameswar Rao Vaddina, A. Rahmani, Khalid Latif, P. Liljeberg, J. Plosila","doi":"10.1109/DSD.2011.87","DOIUrl":"https://doi.org/10.1109/DSD.2011.87","url":null,"abstract":"Three-dimensional technology offers greater device integration, reduced signal delay and reduced interconnect power. It also provides greater design flexibility by allowing heterogeneous integration. However, 3D technology exacerbates the on-chip thermal issues and increases packaging and cooling costs. In this work, a 3D thermal model of a stacked network-on-chip system is developed and thermal analysis is performed in order to analyze different job allocation and scheduling schemes using finite element simulations. The steady-state heat transfer analysis on the 3D stacked structure has been performed. We have analyzed the effect of variation of die power consumption, with and without hotspots, on peak temperatures in different layers of the stack. The optimal die placement solution is also provided based on the maximum temperature attained by the individual silicon dies.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133411299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Enhancing Performance Sustainability of Fault Tolerant Routing Algorithms in NoC-Based Architectures 基于noc架构的容错路由算法的性能可持续性增强
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.85
Khalid Latif, A. Rahmani, Kameswar Rao Vaddina, T. Seceleanu, P. Liljeberg, H. Tenhunen
Reliability of embedded systems and devices is becoming a challenge with technology scaling. To deal with the reliability issues, fault tolerant solutions are needed. The design paradigm for future System-on-Chip (SoC) implementation is Network-on-Chip (NoC). Fault tolerance in NoC can be achieved at many abstraction levels. Many fault tolerant architectures and routing algorithms have already been proposed for NoC but the utilization of resources, affected indirectly by faults is yet to be addressed. In this paper, we propose a NoC architecture, which sustains the overall system performance by utilizing resources, which cannot be used by other architectures under faults. An approach towards a proper virtual-channel (VC) sharing strategy is proposed, based on communication bandwidth requirements. The technique can be applied to any NoC architecture, including 3-D NoCs. Extensive quantitative experiments with synthetic benchmarks, including uniform, transpose and negative exponential distribution (NED), demonstrate considerable improvement in terms of performance sustainability under faulty conditions compared to existing VC-based NoC architectures.
随着技术的发展,嵌入式系统和设备的可靠性正成为一个挑战。为了处理可靠性问题,需要容错解决方案。未来片上系统(SoC)实现的设计范式是片上网络(NoC)。NoC中的容错可以在许多抽象级别上实现。针对NoC已经提出了许多容错体系结构和路由算法,但受故障间接影响的资源利用率尚未得到解决。在本文中,我们提出了一种NoC架构,它通过利用资源来维持系统的整体性能,而其他架构在故障时无法使用这些资源。提出了一种基于通信带宽需求的虚拟信道(VC)共享策略。该技术可以应用于任何NoC架构,包括3d NoC。大量的合成基准定量实验,包括均匀、转置和负指数分布(NED),表明与现有基于vc的NoC架构相比,在故障条件下的性能可持续性方面有了相当大的改善。
{"title":"Enhancing Performance Sustainability of Fault Tolerant Routing Algorithms in NoC-Based Architectures","authors":"Khalid Latif, A. Rahmani, Kameswar Rao Vaddina, T. Seceleanu, P. Liljeberg, H. Tenhunen","doi":"10.1109/DSD.2011.85","DOIUrl":"https://doi.org/10.1109/DSD.2011.85","url":null,"abstract":"Reliability of embedded systems and devices is becoming a challenge with technology scaling. To deal with the reliability issues, fault tolerant solutions are needed. The design paradigm for future System-on-Chip (SoC) implementation is Network-on-Chip (NoC). Fault tolerance in NoC can be achieved at many abstraction levels. Many fault tolerant architectures and routing algorithms have already been proposed for NoC but the utilization of resources, affected indirectly by faults is yet to be addressed. In this paper, we propose a NoC architecture, which sustains the overall system performance by utilizing resources, which cannot be used by other architectures under faults. An approach towards a proper virtual-channel (VC) sharing strategy is proposed, based on communication bandwidth requirements. The technique can be applied to any NoC architecture, including 3-D NoCs. Extensive quantitative experiments with synthetic benchmarks, including uniform, transpose and negative exponential distribution (NED), demonstrate considerable improvement in terms of performance sustainability under faulty conditions compared to existing VC-based NoC architectures.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133901129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Hardware Implementation of a Flexible Tag Platform for Passive RFID Devices 无源RFID设备柔性标签平台的硬件实现
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.43
Thomas Plos, Martin Feldhofer
Radio-frequency identification (RFID) technology has emerged from a simple identification technique towards the enabler for more sophisticated applications like advanced data storage and proof of origin. Even so-called passive RFID tags that receive their power from a radio-frequency field tend to implement more functionality which leads to increased control complexity on them. Current low-cost tags are implemented by hardwired state machines which are inflexible and also inefficient when control complexity increases. In this work, we present a flexible tag platform that is based on a simple 8-bit microcontroller optimized for low chip area and low power consumption. We demonstrate the efficiency of our approach by implementing a near-field communication (NFC) compatible tag with advanced file-access functionality and security features in hardware. Results show that the power consumption of the microcontroller is below 10, textmu A at 106, kHz and that the control part of the flexible tag platform requires a chip area of 10, kGEs making it suitable for low-cost RFID devices.
射频识别(RFID)技术已经从一种简单的识别技术发展成为先进数据存储和原产地证明等更复杂应用的推动者。即使所谓的无源RFID标签从射频场接收能量,也倾向于实现更多的功能,这导致它们的控制复杂性增加。当前的低成本标签是由硬连线状态机实现的,当控制复杂性增加时,这种状态机既不灵活又效率低下。在这项工作中,我们提出了一个灵活的标签平台,该平台基于一个简单的8位微控制器,针对低芯片面积和低功耗进行了优化。我们通过在硬件中实现具有高级文件访问功能和安全特性的近场通信(NFC)兼容标签来证明我们方法的效率。结果表明,该微控制器在106 kHz时的功耗低于10 μ m,柔性标签平台的控制部分需要10 μ m的芯片面积,适合于低成本的RFID设备。
{"title":"Hardware Implementation of a Flexible Tag Platform for Passive RFID Devices","authors":"Thomas Plos, Martin Feldhofer","doi":"10.1109/DSD.2011.43","DOIUrl":"https://doi.org/10.1109/DSD.2011.43","url":null,"abstract":"Radio-frequency identification (RFID) technology has emerged from a simple identification technique towards the enabler for more sophisticated applications like advanced data storage and proof of origin. Even so-called passive RFID tags that receive their power from a radio-frequency field tend to implement more functionality which leads to increased control complexity on them. Current low-cost tags are implemented by hardwired state machines which are inflexible and also inefficient when control complexity increases. In this work, we present a flexible tag platform that is based on a simple 8-bit microcontroller optimized for low chip area and low power consumption. We demonstrate the efficiency of our approach by implementing a near-field communication (NFC) compatible tag with advanced file-access functionality and security features in hardware. Results show that the power consumption of the microcontroller is below 10, textmu A at 106, kHz and that the control part of the flexible tag platform requires a chip area of 10, kGEs making it suitable for low-cost RFID devices.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116098192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Fault Tolerance of Multiple Logic Faults in SRAM-Based FPGA Systems 基于sram的FPGA系统多逻辑故障容错研究
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.33
Farid Lahrach, A. Doumar, E. Châtelet
The very hight levels of integration and submicron device sizes used in current and emerging VLSI technologies for SRAM-based FPGAs lead to higher occurrences of defects and operational faults. Thus, there is a critical need for fault tolerance and reconfiguration techniques for SRAM-based FPGAs to increase chip reliability with field reconfiguration. We first propose a technique utilizing the principle of master slave to tolerate logic or cells in SRAM-based FPGAs. We show that this architectural technique can be used to build redundancy for defect and fault tolerance with limited area and performance overhead. Our algorithm improves reliability of the SRAM-based FPGAs by performing two operations: TMR (triple modular redundancy) (in which CLBs are used to triplicate a logic function whose value is obtained at the voter output) and partitioning (in which the design is partitioned into a set of MSUs (master-slave unit) to reduce the amount of configuration memory required). In response to a component failure, a functionality equivalent MSU that does not rely on the faulty component replaces the affected MSU. Our technique can handle a large numbers of faults (we show tolerance of 16 logic faults in look-up tables LUTs belonging to the same MSU). Experimental results conducted on a subset of the ITC'99 benchmarks demonstrate a high level of reliability in term of fault tolerance with low hardware overhead compared to TMR which has a 5x- 6x area overhead and high power consumption.
当前和新兴的基于sram的fpga的VLSI技术中使用的非常高的集成水平和亚微米器件尺寸导致更高的缺陷和操作故障发生率。因此,基于sram的fpga迫切需要容错和重构技术,以通过现场重构来提高芯片的可靠性。我们首先提出了一种利用主从原理在基于sram的fpga中容忍逻辑或单元的技术。我们展示了这种体系结构技术可以在有限的面积和性能开销下为缺陷和容错构建冗余。我们的算法通过执行两个操作来提高基于sram的fpga的可靠性:TMR(三重模块化冗余)(其中clb用于复制逻辑函数,其值在选民输出处获得)和分区(其中设计被划分为一组msu(主从单元)以减少所需的配置内存量)。作为对组件故障的响应,一个不依赖于故障组件的功能等效的MSU替换受影响的MSU。我们的技术可以处理大量的错误(我们在属于同一个MSU的查找表lut中显示了16个逻辑错误的容忍度)。在ITC'99基准测试的一个子集上进行的实验结果表明,与具有5 - 6倍面积开销和高功耗的TMR相比,在容错方面具有高水平的可靠性和低硬件开销。
{"title":"Fault Tolerance of Multiple Logic Faults in SRAM-Based FPGA Systems","authors":"Farid Lahrach, A. Doumar, E. Châtelet","doi":"10.1109/DSD.2011.33","DOIUrl":"https://doi.org/10.1109/DSD.2011.33","url":null,"abstract":"The very hight levels of integration and submicron device sizes used in current and emerging VLSI technologies for SRAM-based FPGAs lead to higher occurrences of defects and operational faults. Thus, there is a critical need for fault tolerance and reconfiguration techniques for SRAM-based FPGAs to increase chip reliability with field reconfiguration. We first propose a technique utilizing the principle of master slave to tolerate logic or cells in SRAM-based FPGAs. We show that this architectural technique can be used to build redundancy for defect and fault tolerance with limited area and performance overhead. Our algorithm improves reliability of the SRAM-based FPGAs by performing two operations: TMR (triple modular redundancy) (in which CLBs are used to triplicate a logic function whose value is obtained at the voter output) and partitioning (in which the design is partitioned into a set of MSUs (master-slave unit) to reduce the amount of configuration memory required). In response to a component failure, a functionality equivalent MSU that does not rely on the faulty component replaces the affected MSU. Our technique can handle a large numbers of faults (we show tolerance of 16 logic faults in look-up tables LUTs belonging to the same MSU). Experimental results conducted on a subset of the ITC'99 benchmarks demonstrate a high level of reliability in term of fault tolerance with low hardware overhead compared to TMR which has a 5x- 6x area overhead and high power consumption.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123460952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Process Variation Reduction for CMOS Logic Operating at Sub-threshold Supply Voltage 在亚阈值电压下CMOS逻辑工作的工艺变化减小
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.21
Bo Liu, H. Pourshaghaghi, Sebastian M. Londono, J. P. D. Gyvez
Sub-threshold circuit design has become a popular approach for building energy efficient digital circuits. The main drawbacks are performance degradation due to the exponentially reduced driving current, and the effect of increased sensitivity to process variation. To obtain energy savings while reducing performance degradation, we propose the design of a robust sub-threshold library and post-silicon tuning using an adaptive fuzzy logic controller which performs body bias scaling. We show that our methodology is able to fix the performance, consequently, making the system more energy efficient and achieving maximum yield.
亚阈值电路设计已成为构建高效节能数字电路的一种流行方法。主要的缺点是由于驱动电流呈指数级降低而导致的性能下降,以及对工艺变化的敏感性增加的影响。为了在降低性能下降的同时获得节能,我们提出了一个鲁棒的亚阈值库设计和使用自适应模糊逻辑控制器进行体偏置缩放的硅后调谐。我们表明,我们的方法能够固定性能,因此,使系统更节能,实现最大产量。
{"title":"Process Variation Reduction for CMOS Logic Operating at Sub-threshold Supply Voltage","authors":"Bo Liu, H. Pourshaghaghi, Sebastian M. Londono, J. P. D. Gyvez","doi":"10.1109/DSD.2011.21","DOIUrl":"https://doi.org/10.1109/DSD.2011.21","url":null,"abstract":"Sub-threshold circuit design has become a popular approach for building energy efficient digital circuits. The main drawbacks are performance degradation due to the exponentially reduced driving current, and the effect of increased sensitivity to process variation. To obtain energy savings while reducing performance degradation, we propose the design of a robust sub-threshold library and post-silicon tuning using an adaptive fuzzy logic controller which performs body bias scaling. We show that our methodology is able to fix the performance, consequently, making the system more energy efficient and achieving maximum yield.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128727257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A Unified Multiply/Accumulate Unit for Pairing-Based Cryptography over Prime, Binary and Ternary Fields 素数、二进制和三元域上基于配对密码的统一乘/累加单位
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.89
Tobias Vejda, J. Großschädl, D. Page
Bilinear maps, or pairings, on elliptic curves are an active area of research in modern cryptology with applications ranging from cryptanalysis (e.g. MOV attack) over identity-based encryption to short signature schemes. Many parameterisations and implementation options for pairing-based cryptography have been investigated in the recent past. Elliptic curves over prime fields are often preferred for software implementation, whereas extension fields of characteristic two and three offer advantages for implementation in hardware. In the ideal case, a hardware accelerator for pairing-based cryptography can support all three types of field to ensure inter-operability with a broad spectrum of applications. This need has motivated the design of so-called unified multipliers, which are basically multipliers that integrate different types of operands (e.g. integers and polynomials) into a single data path. In the present paper, we introduce a unified multiply/accumulate unit for signed/unsigned integers as well as binary and ternary polynomials. The multiplier generates partial products using a Redundant Signed-Digit (RSD) representation that allows for efficient combination of all three operand types into one data path. In addition, our design takes advantage of a high-radix encoding scheme for integers and binary polynomials to reduce the overall number of partial products and utilise the data path in an optimal way. We compare our multiplier with a previous radix-2 implementation of Ozturk et al and analyse the differences in terms of silicon area and critical path delay. The unified multiply/accumulate unit described in this paper can be used in embedded systems like smart cards, either as arithmetic core of a cryptographic co-processor, or as functional unit of an application-specific processor.
椭圆曲线上的双线性映射或配对是现代密码学中一个活跃的研究领域,其应用范围从基于身份的加密的密码分析(例如MOV攻击)到短签名方案。在最近的过去已经研究了许多基于配对的加密的参数化和实现选项。在软件实现中,通常首选素数域上的椭圆曲线,而特征二和特征三的扩展域则为硬件实现提供了优势。在理想情况下,用于基于配对的加密的硬件加速器可以支持所有三种类型的字段,以确保与广泛的应用程序的互操作性。这种需求激发了所谓的统一乘法器的设计,它基本上是将不同类型的操作数(例如整数和多项式)集成到单个数据路径中的乘法器。本文介绍了有符号整数和无符号整数以及二元多项式和三元多项式的统一乘法/累加单元。乘法器使用冗余带符号数字(RSD)表示生成部分乘积,该表示允许将所有三种操作数类型有效地组合到一个数据路径中。此外,我们的设计利用整数和二进制多项式的高基数编码方案来减少部分乘积的总数,并以最优的方式利用数据路径。我们将我们的乘法器与Ozturk等人之前的基数-2实现进行了比较,并分析了硅面积和关键路径延迟方面的差异。本文所描述的统一乘/累加单元既可以作为加密协处理器的算术核心,也可以作为专用处理器的功能单元,应用于智能卡等嵌入式系统。
{"title":"A Unified Multiply/Accumulate Unit for Pairing-Based Cryptography over Prime, Binary and Ternary Fields","authors":"Tobias Vejda, J. Großschädl, D. Page","doi":"10.1109/DSD.2011.89","DOIUrl":"https://doi.org/10.1109/DSD.2011.89","url":null,"abstract":"Bilinear maps, or pairings, on elliptic curves are an active area of research in modern cryptology with applications ranging from cryptanalysis (e.g. MOV attack) over identity-based encryption to short signature schemes. Many parameterisations and implementation options for pairing-based cryptography have been investigated in the recent past. Elliptic curves over prime fields are often preferred for software implementation, whereas extension fields of characteristic two and three offer advantages for implementation in hardware. In the ideal case, a hardware accelerator for pairing-based cryptography can support all three types of field to ensure inter-operability with a broad spectrum of applications. This need has motivated the design of so-called unified multipliers, which are basically multipliers that integrate different types of operands (e.g. integers and polynomials) into a single data path. In the present paper, we introduce a unified multiply/accumulate unit for signed/unsigned integers as well as binary and ternary polynomials. The multiplier generates partial products using a Redundant Signed-Digit (RSD) representation that allows for efficient combination of all three operand types into one data path. In addition, our design takes advantage of a high-radix encoding scheme for integers and binary polynomials to reduce the overall number of partial products and utilise the data path in an optimal way. We compare our multiplier with a previous radix-2 implementation of Ozturk et al and analyse the differences in terms of silicon area and critical path delay. The unified multiply/accumulate unit described in this paper can be used in embedded systems like smart cards, either as arithmetic core of a cryptographic co-processor, or as functional unit of an application-specific processor.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127443234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automatic Interface Generation for Component Reuse in HW-SW Partitioning HW-SW分区中组件复用的自动接口生成
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.105
N. Bombieri, F. Fummi, S. Vinco, D. Quaglia
HW-SW partitioning is a key problem in HW-SW code sign of embedded systems studied extensively in the past. All proposed approaches are top down flows, that start from a homogeneous formal specification of the system and determine an optimal partitioning. Thus, the proposed techniques do not exploit reuse nor reconsider the HWSW partitioning of an already designed platform. This paper proposes an extension of traditional flows that allows reuse and automatic generation of components and interfaces. The final flow has been applied to a complex industrial platform to prove the effectiveness and the advantages of the proposed approach.
HW-SW分划是嵌入式系统HW-SW码号研究的一个关键问题。所有建议的方法都是自上而下的流程,从系统的同构形式化规范开始,并确定最优划分。因此,所提出的技术不利用重用,也不重新考虑已经设计的平台的HWSW分区。本文提出了一个传统流的扩展,允许重用和自动生成组件和接口。最后将该流程应用于一个复杂的工业平台,验证了该方法的有效性和优越性。
{"title":"Automatic Interface Generation for Component Reuse in HW-SW Partitioning","authors":"N. Bombieri, F. Fummi, S. Vinco, D. Quaglia","doi":"10.1109/DSD.2011.105","DOIUrl":"https://doi.org/10.1109/DSD.2011.105","url":null,"abstract":"HW-SW partitioning is a key problem in HW-SW code sign of embedded systems studied extensively in the past. All proposed approaches are top down flows, that start from a homogeneous formal specification of the system and determine an optimal partitioning. Thus, the proposed techniques do not exploit reuse nor reconsider the HWSW partitioning of an already designed platform. This paper proposes an extension of traditional flows that allows reuse and automatic generation of components and interfaces. The final flow has been applied to a complex industrial platform to prove the effectiveness and the advantages of the proposed approach.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130757038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2011 14th Euromicro Conference on Digital System Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1