首页 > 最新文献

2007 Design, Automation & Test in Europe Conference & Exhibition最新文献

英文 中文
Simulation Platform for UHF RFID 超高频RFID仿真平台
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364410
V. Derbek, C. Steger, R. Weiss, Daniel Wischounig, Josef Preishuber-Pflügl, M. Pistauer
Developing modern integrated and embedded systems require well-designed processes to ensure flexibility and independency. These features are related to exchangeability of hardware targets and to the ability of choosing the target at a very late stage in the implementation process. Especially in the field of ultra high frequency radio frequency identification (UHF RFID) the model-based design approach leads to expected results. Beside a clear design process, which is applied in this work to build the required system architecture, the scope for UHF RFID simulations is defined and an extendable platform based on the MathWorks Matlab Simulinkreg is developed. This simulation platform, based on a multi-processor hardware target, using a Texas Instruments TMS320C6416 digital signal processor is able to run UHF RFID tag simulations of very high complexity. The highest effort is made to ensure flexibility to handle future simulation models on the same hardware target, realized by the continuous design and implementation flow of this platform based on model-based design
开发现代集成和嵌入式系统需要精心设计的流程,以确保灵活性和独立性。这些特性与硬件目标的可互换性以及在实现过程的非常后期阶段选择目标的能力有关。特别是在超高频射频识别(UHF RFID)领域,基于模型的设计方法可以达到预期的效果。除了明确的设计流程(用于构建所需的系统架构)之外,还定义了UHF RFID仿真的范围,并开发了基于MathWorks Matlab Simulinkreg的可扩展平台。该仿真平台基于多处理器硬件目标,采用德州仪器TMS320C6416数字信号处理器,能够运行非常复杂的超高频RFID标签仿真。通过基于模型设计的平台的连续设计和实现流程,最大限度地保证了在同一硬件目标上处理未来仿真模型的灵活性
{"title":"Simulation Platform for UHF RFID","authors":"V. Derbek, C. Steger, R. Weiss, Daniel Wischounig, Josef Preishuber-Pflügl, M. Pistauer","doi":"10.1109/DATE.2007.364410","DOIUrl":"https://doi.org/10.1109/DATE.2007.364410","url":null,"abstract":"Developing modern integrated and embedded systems require well-designed processes to ensure flexibility and independency. These features are related to exchangeability of hardware targets and to the ability of choosing the target at a very late stage in the implementation process. Especially in the field of ultra high frequency radio frequency identification (UHF RFID) the model-based design approach leads to expected results. Beside a clear design process, which is applied in this work to build the required system architecture, the scope for UHF RFID simulations is defined and an extendable platform based on the MathWorks Matlab Simulinkreg is developed. This simulation platform, based on a multi-processor hardware target, using a Texas Instruments TMS320C6416 digital signal processor is able to run UHF RFID tag simulations of very high complexity. The highest effort is made to ensure flexibility to handle future simulation models on the same hardware target, realized by the continuous design and implementation flow of this platform based on model-based design","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128946108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Automatic Application Specific Floating-point Unit Generation 特定于应用程序的浮点单元自动生成
Pub Date : 2007-04-16 DOI: 10.5555/1266366.1266464
Yee Jern Chong, S. Parameswaran
This paper describes the creation of custom floating point units (FPUs) for application specific instruction set processors (ASIPs). ASIPs allow the customization of processors for use in embedded systems by extending the instruction set, which enhances the performance of an application or a class of applications. These extended instructions are manifested as separate hardware blocks, making the creation of any necessary floating point instructions quite unwieldy. On the other hand, using a predefined FPU includes a large monolithic hardware block with considerable number of unused instructions. A customized FPU will overcome these drawbacks, yet the manual creation of one is a time consuming, error prone process. This paper presents a methodology for automatically generating floating-point units (FPUs) that are customized for specific applications at the instruction level. Generated FPUs comply with the IEEE754 standard, which is an advantage over FP format customization. Custom FPUs were generated for several Mediabench applications. Area savings over a fully-featured FPU without resource sharing of 26%-80% without resource sharing and 33%-87% with resource sharing, were obtained. Clock period increased in some cases by up to 9.5% due to resource sharing
本文描述了为特定应用指令集处理器(asip)创建自定义浮点单元(fpu)。通过扩展指令集,api允许定制用于嵌入式系统的处理器,从而增强应用程序或一类应用程序的性能。这些扩展指令表现为单独的硬件块,使得创建任何必要的浮点指令非常笨拙。另一方面,使用预定义的FPU包括一个大的单片硬件块,其中包含大量未使用的指令。定制FPU将克服这些缺点,但是手工创建一个FPU是一个耗时且容易出错的过程。本文提出了一种在指令级为特定应用程序定制的自动生成浮点单元(fpu)的方法。生成的fpu符合IEEE754标准,这是fpu格式自定义的优势。为几个mediabbench应用程序生成了自定义fpu。与无资源共享的全功能FPU相比,无资源共享时可节省26%-80%的面积,有资源共享时可节省33%-87%的面积。由于资源共享,时钟周期在某些情况下增加了9.5%
{"title":"Automatic Application Specific Floating-point Unit Generation","authors":"Yee Jern Chong, S. Parameswaran","doi":"10.5555/1266366.1266464","DOIUrl":"https://doi.org/10.5555/1266366.1266464","url":null,"abstract":"This paper describes the creation of custom floating point units (FPUs) for application specific instruction set processors (ASIPs). ASIPs allow the customization of processors for use in embedded systems by extending the instruction set, which enhances the performance of an application or a class of applications. These extended instructions are manifested as separate hardware blocks, making the creation of any necessary floating point instructions quite unwieldy. On the other hand, using a predefined FPU includes a large monolithic hardware block with considerable number of unused instructions. A customized FPU will overcome these drawbacks, yet the manual creation of one is a time consuming, error prone process. This paper presents a methodology for automatically generating floating-point units (FPUs) that are customized for specific applications at the instruction level. Generated FPUs comply with the IEEE754 standard, which is an advantage over FP format customization. Custom FPUs were generated for several Mediabench applications. Area savings over a fully-featured FPU without resource sharing of 26%-80% without resource sharing and 33%-87% with resource sharing, were obtained. Clock period increased in some cases by up to 9.5% due to resource sharing","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126374462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Tiny and Efficient Wireless Ad-hoc Protocol for Low-cost Sensor Networks 一种用于低成本传感器网络的小型高效无线自组织协议
Pub Date : 2007-04-16 DOI: 10.1145/1266366.1266709
P. Gburzynski, B. Kaminska, W. Olesinski
The authors introduce a simple ad-hoc routing scheme that operates in the true spirit of ad-hoc networking, i.e., in a modeless fashion, without neighborhood discovery or explicit point-to-point forwarding, while offering a high (and tunable) degree of reliability, fault-tolerance and robustness. Being aimed at truly tiny devices (e.g., with 1KB of RAM), the scheme can automatically take advantage of extra memory resources to improve the quality of routes for critical nodes. In contrast to some popular low-cost solutions, like ZigBeetrade the approach involves a single node type and exhibits lower resource requirements. The presented scheme has been verified in an industrial deployment with stringent quality of service requirements
作者介绍了一种简单的ad-hoc路由方案,该方案在ad-hoc网络的真正精神中运行,即以非模态方式,没有邻居发现或显式点对点转发,同时提供高(可调)程度的可靠性,容错性和鲁棒性。针对真正微小的设备(例如,只有1KB的RAM),该方案可以自动利用额外的内存资源来提高关键节点的路由质量。与一些流行的低成本解决方案(如ZigBeetrade)相比,该方法只涉及单个节点类型,资源需求更低。该方案已在具有严格服务质量要求的工业部署中得到验证
{"title":"A Tiny and Efficient Wireless Ad-hoc Protocol for Low-cost Sensor Networks","authors":"P. Gburzynski, B. Kaminska, W. Olesinski","doi":"10.1145/1266366.1266709","DOIUrl":"https://doi.org/10.1145/1266366.1266709","url":null,"abstract":"The authors introduce a simple ad-hoc routing scheme that operates in the true spirit of ad-hoc networking, i.e., in a modeless fashion, without neighborhood discovery or explicit point-to-point forwarding, while offering a high (and tunable) degree of reliability, fault-tolerance and robustness. Being aimed at truly tiny devices (e.g., with 1KB of RAM), the scheme can automatically take advantage of extra memory resources to improve the quality of routes for critical nodes. In contrast to some popular low-cost solutions, like ZigBeetrade the approach involves a single node type and exhibits lower resource requirements. The presented scheme has been verified in an industrial deployment with stringent quality of service requirements","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122570331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Efficient High-Performance ASIC Implementation of JPEG-LS Encoder JPEG-LS编码器的高效高性能ASIC实现
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364584
Markos E. Papadonikolakis, Vasilleios Pantazis, A. Kakarountas
This paper introduces an innovative design which implements a high-performance JPEG-LS encoder. The encoding process follows the principles of the JPEG-LS lossless mode. The proposed implementation consists of an efficient pipelined JPEG-LS encoder, which operates at a significantly higher encoding rate than any other JPEG-LS hardware or software implementation while keeping area small
本文介绍了一种实现高性能JPEG-LS编码器的创新设计。编码过程遵循JPEG-LS无损模式的原则。提出的实现由一个高效的流水线式JPEG-LS编码器组成,该编码器的编码速率明显高于任何其他JPEG-LS硬件或软件实现,同时保持面积小
{"title":"Efficient High-Performance ASIC Implementation of JPEG-LS Encoder","authors":"Markos E. Papadonikolakis, Vasilleios Pantazis, A. Kakarountas","doi":"10.1109/DATE.2007.364584","DOIUrl":"https://doi.org/10.1109/DATE.2007.364584","url":null,"abstract":"This paper introduces an innovative design which implements a high-performance JPEG-LS encoder. The encoding process follows the principles of the JPEG-LS lossless mode. The proposed implementation consists of an efficient pipelined JPEG-LS encoder, which operates at a significantly higher encoding rate than any other JPEG-LS hardware or software implementation while keeping area small","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126956075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 62
Efficient and Scalable Compiler-Directed Energy Optimization for Realtime Applications 针对实时应用的高效和可扩展的编译器定向能量优化
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364386
Po-Kuan Huang, S. Ghiasi
We present a compilation technique that targets realtime applications running on embedded processors with combined dynamic voltage scaling (DVS) and adaptive body biasing (ABB) capabilities. Considering the delay and energy penalty of switching between operating modes of the processor, our compiler judiciously inserts mode switch instructions in selected locations of the code and generates executable binary that is guaranteed to meet the deadline constraint. More importantly, our algorithm runs very fast and comes reasonably close to the theoretical limit of energy optimization using DVS+ABB. At 65 nm technology, we improve the energy dissipation of the generated code by an average of11.4% under deadline constraints. While our technique's improvement in energy dissipation over conventional DVS is marginal (3%) at 130nm, the average improvement continues to grow to 4.7%, 8.8% and 15.4% for 90nm, 65nm and 45nm technology nodes, respectively. Compared to a recent ILP-based competitor, we improve the runtime by more than three orders of magnitude, while producing improved results
我们提出了一种针对运行在嵌入式处理器上的实时应用程序的编译技术,该处理器具有动态电压缩放(DVS)和自适应体偏置(ABB)功能。考虑到在处理器的工作模式之间切换的延迟和能量损失,我们的编译器明智地在代码的选定位置插入模式切换指令,并生成保证满足最后期限约束的可执行二进制文件。更重要的是,我们的算法运行速度非常快,并且相当接近使用DVS+ABB进行能量优化的理论极限。在65nm技术下,在截止日期限制下,我们将生成代码的能量消耗平均提高了11.4%。虽然我们的技术在130nm节点上对传统分布式交换机的能耗改善很小(3%),但在90nm、65nm和45nm技术节点上,平均改善幅度分别达到4.7%、8.8%和15.4%。与最近基于ilp的竞争对手相比,我们将运行时间提高了三个数量级以上,同时产生了更好的结果
{"title":"Efficient and Scalable Compiler-Directed Energy Optimization for Realtime Applications","authors":"Po-Kuan Huang, S. Ghiasi","doi":"10.1109/DATE.2007.364386","DOIUrl":"https://doi.org/10.1109/DATE.2007.364386","url":null,"abstract":"We present a compilation technique that targets realtime applications running on embedded processors with combined dynamic voltage scaling (DVS) and adaptive body biasing (ABB) capabilities. Considering the delay and energy penalty of switching between operating modes of the processor, our compiler judiciously inserts mode switch instructions in selected locations of the code and generates executable binary that is guaranteed to meet the deadline constraint. More importantly, our algorithm runs very fast and comes reasonably close to the theoretical limit of energy optimization using DVS+ABB. At 65 nm technology, we improve the energy dissipation of the generated code by an average of11.4% under deadline constraints. While our technique's improvement in energy dissipation over conventional DVS is marginal (3%) at 130nm, the average improvement continues to grow to 4.7%, 8.8% and 15.4% for 90nm, 65nm and 45nm technology nodes, respectively. Compared to a recent ILP-based competitor, we improve the runtime by more than three orders of magnitude, while producing improved results","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130029413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Slow Write Driver Faults in 65nm SRAM Technology: Analysis and March Test Solution 65nm SRAM技术的慢写驱动故障:分析和三月测试解决方案
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364647
A. Ney, P. Girard, C. Landrault, S. Pravossoudovitch, A. Virazel, M. Bastian
This paper presents an analysis of the electrical origins of slow write driver faults (SWDFs) (van de Goor et al., 2004) that may affect SRAM write drivers in 65nm technology. This type of fault is the consequence of resistive-open defects in the control part of the write driver. It involves an erroneous write operation when the same write driver performs two successive write operations with opposite data values. In the first part of the paper, we present the SWDF electrical phenomena and their consequences on the SRAM functioning. Next, we show how SWDFs can be sensitized and observed and how a standard March test is able to detect this type of fault
本文分析了缓慢写入驱动器故障(swdf)的电气根源(van de Goor et al., 2004),这些故障可能会影响65nm技术中的SRAM写入驱动器。这种类型的故障是由于写驱动器控制部分的电阻打开缺陷造成的。当同一个写驱动程序对相反的数据值执行两个连续的写操作时,它涉及一个错误的写操作。在本文的第一部分,我们介绍了SWDF电现象及其对SRAM功能的影响。接下来,我们将展示如何敏化和观察swdf,以及标准March测试如何能够检测这种类型的故障
{"title":"Slow Write Driver Faults in 65nm SRAM Technology: Analysis and March Test Solution","authors":"A. Ney, P. Girard, C. Landrault, S. Pravossoudovitch, A. Virazel, M. Bastian","doi":"10.1109/DATE.2007.364647","DOIUrl":"https://doi.org/10.1109/DATE.2007.364647","url":null,"abstract":"This paper presents an analysis of the electrical origins of slow write driver faults (SWDFs) (van de Goor et al., 2004) that may affect SRAM write drivers in 65nm technology. This type of fault is the consequence of resistive-open defects in the control part of the write driver. It involves an erroneous write operation when the same write driver performs two successive write operations with opposite data values. In the first part of the paper, we present the SWDF electrical phenomena and their consequences on the SRAM functioning. Next, we show how SWDFs can be sensitized and observed and how a standard March test is able to detect this type of fault","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127589329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Leightweight Middleware for Seamless HW-SW Interoperability, with Application to Wireless Sensor Networks 用于无线传感器网络的HW-SW无缝互操作性轻量级中间件
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364431
F. Villanueva, D. Villa, F. Moya, Jesús Barba, F. Rincón, J.C. Lopez
HW-SW interoperability by means of standard distributed object middlewares has been proved to be useful in the design of new and challenging applications for ubiquitous computing, and ambient intelligence environments. Wireless sensor networks are considered to be essential for the proper deployment of these applications, but they impose new constraints in the design of the corresponding communication infrastructure: low-cost middleware implementations that can fit into tiny wireless devices are needed. In this paper, a novel approach for the development of pervasive environments based on an ultra low-cost implementation of standard distributed object middlewares (such as CORBA or ICE) is presented. A fully functional prototype supporting full interoperability with ZeroC ICE is described in detail. Available implementations range from the smallest microcontrollers in the market, to the tiniest embedded Java virtual machines, and even a low-end FPGA
通过标准分布式对象中间件实现的HW-SW互操作性已被证明在为普适计算和环境智能环境设计新的和具有挑战性的应用程序方面是有用的。无线传感器网络被认为是正确部署这些应用程序的必要条件,但它们在相应的通信基础设施的设计中施加了新的限制:需要适合小型无线设备的低成本中间件实现。本文提出了一种基于标准分布式对象中间件(如CORBA或ICE)的超低成本实现来开发普适环境的新方法。详细描述了支持与ZeroC ICE完全互操作性的全功能原型。可用的实现范围从市场上最小的微控制器到最小的嵌入式Java虚拟机,甚至是低端的FPGA
{"title":"Leightweight Middleware for Seamless HW-SW Interoperability, with Application to Wireless Sensor Networks","authors":"F. Villanueva, D. Villa, F. Moya, Jesús Barba, F. Rincón, J.C. Lopez","doi":"10.1109/DATE.2007.364431","DOIUrl":"https://doi.org/10.1109/DATE.2007.364431","url":null,"abstract":"HW-SW interoperability by means of standard distributed object middlewares has been proved to be useful in the design of new and challenging applications for ubiquitous computing, and ambient intelligence environments. Wireless sensor networks are considered to be essential for the proper deployment of these applications, but they impose new constraints in the design of the corresponding communication infrastructure: low-cost middleware implementations that can fit into tiny wireless devices are needed. In this paper, a novel approach for the development of pervasive environments based on an ultra low-cost implementation of standard distributed object middlewares (such as CORBA or ICE) is presented. A fully functional prototype supporting full interoperability with ZeroC ICE is described in detail. Available implementations range from the smallest microcontrollers in the market, to the tiniest embedded Java virtual machines, and even a low-end FPGA","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126484047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Instruction-Set Customization for Real-Time Embedded Systems 实时嵌入式系统指令集定制
Pub Date : 2007-04-16 DOI: 10.1145/1266366.1266690
Huynh Phung Huynh, T. Mitra
Application-specific customization of the instruction set helps embedded processors achieve significant performance and power efficiency. In this paper, we explore customization in the context of multi-tasking real-time embedded systems. We propose efficient algorithms to select the optimal set of custom instructions for a task set under two popular real-time scheduling policies. Our algorithms minimize the processor utilization through customization while satisfying the task deadlines and the constraint on silicon area. Experimental evaluation with various task sets shows that appropriate customization can achieve significant reduction in the processor utilization and the energy consumption
特定于应用程序的指令集定制有助于嵌入式处理器实现显著的性能和功耗效率。在本文中,我们探讨了多任务实时嵌入式系统背景下的定制。我们提出了在两种常用的实时调度策略下,为任务集选择最优自定义指令集的有效算法。我们的算法在满足任务期限和硅片面积限制的情况下,通过自定义将处理器利用率降至最低。各种任务集的实验评估表明,适当的定制可以显著降低处理器的利用率和能耗
{"title":"Instruction-Set Customization for Real-Time Embedded Systems","authors":"Huynh Phung Huynh, T. Mitra","doi":"10.1145/1266366.1266690","DOIUrl":"https://doi.org/10.1145/1266366.1266690","url":null,"abstract":"Application-specific customization of the instruction set helps embedded processors achieve significant performance and power efficiency. In this paper, we explore customization in the context of multi-tasking real-time embedded systems. We propose efficient algorithms to select the optimal set of custom instructions for a task set under two popular real-time scheduling policies. Our algorithms minimize the processor utilization through customization while satisfying the task deadlines and the constraint on silicon area. Experimental evaluation with various task sets shows that appropriate customization can achieve significant reduction in the processor utilization and the energy consumption","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132815406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Energy Evaluation of Software Implementations of Block Ciphers under Memory Constraints 内存约束下分组密码软件实现的能量评估
Pub Date : 2007-04-16 DOI: 10.1145/1266366.1266607
J. Großschädl, S. Tillich, Christian Rechberger, M. Hofmann, M. Medwed
Software implementations of modern block ciphers often require large lookup tables along with code size increasing optimizations like loop unrolling to reach peak performance on general-purpose processors. Therefore, block ciphers are difficult to implement efficiently on embedded devices like cell phones or sensor nodes where run-time memory and program ROM are scarce resources. In this paper, the performance, energy consumption, runtime memory requirements, and code size of the five block ciphers RC6, Rijndael, Serpent, Twofish, and XTEA on the StrongARM SA-1100 processor was analyzed and compared. Most previous evaluations of block ciphers considered performance as the sole metric of interest and did not care about memory requirements or code size. In contrast to previous work, this study of the performance and energy characteristics of block ciphers has been conducted with "lightweight" implementations which restrict the size of lookup tables to 1 kB and also impose constraints on the code size. The author found that Rijndael and RC6 can be well optimized for high performance and energy efficiency, while at the same time meeting the demand for low memory (RAM and ROM) footprint. In addition, the impact of key expansion and modes of operation on the overall performance and energy consumption of each block cipher was discussed. Simulation results show that RC6 is the most energy-efficient block cipher under memory constraints and thus the best choice for resource-restricted devices
现代块密码的软件实现通常需要大型查找表以及代码大小不断增加的优化,如循环展开,以在通用处理器上达到峰值性能。因此,分组密码很难在诸如手机或传感器节点等嵌入式设备上有效实现,因为这些设备的运行时内存和程序ROM是稀缺资源。本文对五种分组密码RC6、Rijndael、Serpent、Twofish和XTEA在StrongARM SA-1100处理器上的性能、能耗、运行时内存需求和代码大小进行了分析和比较。以前对分组密码的大多数评估都将性能视为唯一感兴趣的指标,而不关心内存需求或代码大小。与之前的工作相比,这项对分组密码的性能和能量特征的研究是通过“轻量级”实现进行的,该实现将查找表的大小限制在1 kB,并对代码大小施加约束。作者发现Rijndael和RC6可以很好地优化高性能和能效,同时满足低内存(RAM和ROM)占用的需求。此外,还讨论了密钥扩展和操作方式对分组密码整体性能和能耗的影响。仿真结果表明,在内存约束下,RC6是最节能的分组密码,是资源受限设备的最佳选择
{"title":"Energy Evaluation of Software Implementations of Block Ciphers under Memory Constraints","authors":"J. Großschädl, S. Tillich, Christian Rechberger, M. Hofmann, M. Medwed","doi":"10.1145/1266366.1266607","DOIUrl":"https://doi.org/10.1145/1266366.1266607","url":null,"abstract":"Software implementations of modern block ciphers often require large lookup tables along with code size increasing optimizations like loop unrolling to reach peak performance on general-purpose processors. Therefore, block ciphers are difficult to implement efficiently on embedded devices like cell phones or sensor nodes where run-time memory and program ROM are scarce resources. In this paper, the performance, energy consumption, runtime memory requirements, and code size of the five block ciphers RC6, Rijndael, Serpent, Twofish, and XTEA on the StrongARM SA-1100 processor was analyzed and compared. Most previous evaluations of block ciphers considered performance as the sole metric of interest and did not care about memory requirements or code size. In contrast to previous work, this study of the performance and energy characteristics of block ciphers has been conducted with \"lightweight\" implementations which restrict the size of lookup tables to 1 kB and also impose constraints on the code size. The author found that Rijndael and RC6 can be well optimized for high performance and energy efficiency, while at the same time meeting the demand for low memory (RAM and ROM) footprint. In addition, the impact of key expansion and modes of operation on the overall performance and energy consumption of each block cipher was discussed. Simulation results show that RC6 is the most energy-efficient block cipher under memory constraints and thus the best choice for resource-restricted devices","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124002600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
Hard Real-Time Reconfiguration Port Scheduling 硬实时重配置端口调度
Pub Date : 2007-04-16 DOI: 10.1109/DATE.2007.364578
F. Dittmann, Stefan Frank
When modern partially and dynamically reconfigurable FPGAs are to be used as resources in hard real-time systems, the two dimensions area and time have to be considered in the focus of availability and deadlines. In particular, area requirements must be guaranteed for the tasks' duration. While execution environments that abstract the space demand of tasks exist and methods for occupancy of resources over time are discussed in the literature, few works focus on another fundamental bottleneck, the reconfiguration port. As all resource requests are served by this mutually exclusive device, profound concepts for scheduling the port access are vital requirements for FPGA realtime scheduling. Nevertheless, as the port must be accessed sequentially, we can inherit and apply monoprocessor scheduling concepts that are well researched. In this paper, we introduce monoprocessor scheduling algorithms for the reconfiguration port of FPGAs
当现代部分可动态重构fpga作为资源应用于硬实时系统时,必须将面积和时间两个维度作为可用性和最后期限的重点加以考虑。特别是,必须保证任务持续时间的面积需求。虽然存在抽象任务空间需求的执行环境,并且文献中讨论了随时间占用资源的方法,但很少有作品关注另一个基本瓶颈,即重新配置端口。由于所有资源请求都由这个互斥的设备提供服务,因此对端口访问调度的深刻概念是FPGA实时调度的重要要求。然而,由于端口必须顺序访问,我们可以继承和应用已经研究好的单处理器调度概念。本文介绍了fpga重构端口的单处理器调度算法
{"title":"Hard Real-Time Reconfiguration Port Scheduling","authors":"F. Dittmann, Stefan Frank","doi":"10.1109/DATE.2007.364578","DOIUrl":"https://doi.org/10.1109/DATE.2007.364578","url":null,"abstract":"When modern partially and dynamically reconfigurable FPGAs are to be used as resources in hard real-time systems, the two dimensions area and time have to be considered in the focus of availability and deadlines. In particular, area requirements must be guaranteed for the tasks' duration. While execution environments that abstract the space demand of tasks exist and methods for occupancy of resources over time are discussed in the literature, few works focus on another fundamental bottleneck, the reconfiguration port. As all resource requests are served by this mutually exclusive device, profound concepts for scheduling the port access are vital requirements for FPGA realtime scheduling. Nevertheless, as the port must be accessed sequentially, we can inherit and apply monoprocessor scheduling concepts that are well researched. In this paper, we introduce monoprocessor scheduling algorithms for the reconfiguration port of FPGAs","PeriodicalId":298961,"journal":{"name":"2007 Design, Automation & Test in Europe Conference & Exhibition","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124526688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
期刊
2007 Design, Automation & Test in Europe Conference & Exhibition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1