首页 > 最新文献

2011 14th Euromicro Conference on Digital System Design最新文献

英文 中文
A Design Method for Programmable Two-Variable Discrete Function Generators Using Spline and Bilinear Interpolations 基于样条插值和双线性插值的可编程两变量离散函数发生器设计方法
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.94
S. Nakano, Yoichi Wakaba, Shinobu Nagayama, S. Wakabayashi
This paper presents a design method for programmable two-variable discrete (real-valued) function generators based on a piecewise polynomial approximation. To approximate a given discrete function by polynomials efficiently, we propose a hybrid approximation method using both spline and bilinear interpolations. The proposed method can significantly reduce memory size needed to implement a two-variable discrete function by accepting a small approximation error, and thus it can be used to explore design space taking into account a trade-off between memory size and approximation error. Experimental results show that the proposed design method reduces 75% of memory size without losing circuit speed by accepting only 1% error, and the circuits designed by the proposed method achieve about 650 times greater throughput than their software programs. We can automatically synthesize such compact and fast function generators using the proposed design method.
提出了一种基于分段多项式逼近的可编程二变量离散(实值)函数发生器的设计方法。为了用多项式有效地逼近给定的离散函数,我们提出了一种使用样条插值和双线性插值的混合逼近方法。该方法可以通过接受较小的近似误差来显著减少实现双变量离散函数所需的内存大小,因此可以用于探索考虑内存大小和近似误差之间权衡的设计空间。实验结果表明,该设计方法在只接受1%错误的情况下,在不损失电路速度的情况下,减少了75%的内存大小,设计的电路的吞吐量比其软件程序提高了约650倍。采用本文提出的设计方法,可以自动合成这种紧凑、快速的函数生成器。
{"title":"A Design Method for Programmable Two-Variable Discrete Function Generators Using Spline and Bilinear Interpolations","authors":"S. Nakano, Yoichi Wakaba, Shinobu Nagayama, S. Wakabayashi","doi":"10.1109/DSD.2011.94","DOIUrl":"https://doi.org/10.1109/DSD.2011.94","url":null,"abstract":"This paper presents a design method for programmable two-variable discrete (real-valued) function generators based on a piecewise polynomial approximation. To approximate a given discrete function by polynomials efficiently, we propose a hybrid approximation method using both spline and bilinear interpolations. The proposed method can significantly reduce memory size needed to implement a two-variable discrete function by accepting a small approximation error, and thus it can be used to explore design space taking into account a trade-off between memory size and approximation error. Experimental results show that the proposed design method reduces 75% of memory size without losing circuit speed by accepting only 1% error, and the circuits designed by the proposed method achieve about 650 times greater throughput than their software programs. We can automatically synthesize such compact and fast function generators using the proposed design method.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115504355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SoC and Board Modeling for Processor-Centric Board Testing 以处理器为中心的板测试的SoC和板建模
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.79
A. Tsertov, R. Ubar, A. Jutman, S. Devadze
Many contemporary electronic systems are based on System-on-Chips (SoC) such as micro-controllers or signal processors that communicate with many peripheral devices on the system board and beyond. While, SoC test was a topic of extremely high interest during the last decade, the test beyond SoCs didn't get much attention after introduction of Boundary Scan (BS) 30 years ago. It is not a surprise that the restricted capabilities of BS with respect of such modern challenges as dynamic (timing-accurate), at-speed and high-speed testing as well as in-system programming create considerable troubles for test engineers in production environments. In this paper, we point out particular challenges in testing the system's infrastructure beyond the SoCs as well as propose a general modeling methodology for test automation for microprocessor SoC-based system boards. The new so-called "Lego-style" test automation methodology forms a complimentary solution to traditional boundary scan. Together, they provide extended fault coverage that targets shorts, opens, stuck-at faults as well as dynamic faults (e.g. delays and transition faults). The "Legostyle" model allows reducing the labour effort drastically once the library of model components is created.
许多现代电子系统都基于片上系统(SoC),如微控制器或信号处理器,它们与系统板上的许多外围设备进行通信。虽然在过去十年中,SoC测试是一个非常感兴趣的话题,但在30年前引入边界扫描(BS)之后,SoC之外的测试并没有得到太多关注。对于现代挑战,如动态(定时精确)、高速和高速测试以及系统内编程,BS的有限能力给生产环境中的测试工程师带来了相当大的麻烦,这并不奇怪。在本文中,我们指出了在soc之外测试系统基础设施的特殊挑战,并提出了一种用于基于微处理器soc的系统板测试自动化的通用建模方法。新的所谓的“乐高风格”测试自动化方法形成了传统边界扫描的补充解决方案。总之,它们提供了扩展的故障覆盖,目标是短路,打开,卡在故障以及动态故障(例如延迟和转换故障)。“乐高风格”模型允许在创建模型组件库后大幅减少劳动。
{"title":"SoC and Board Modeling for Processor-Centric Board Testing","authors":"A. Tsertov, R. Ubar, A. Jutman, S. Devadze","doi":"10.1109/DSD.2011.79","DOIUrl":"https://doi.org/10.1109/DSD.2011.79","url":null,"abstract":"Many contemporary electronic systems are based on System-on-Chips (SoC) such as micro-controllers or signal processors that communicate with many peripheral devices on the system board and beyond. While, SoC test was a topic of extremely high interest during the last decade, the test beyond SoCs didn't get much attention after introduction of Boundary Scan (BS) 30 years ago. It is not a surprise that the restricted capabilities of BS with respect of such modern challenges as dynamic (timing-accurate), at-speed and high-speed testing as well as in-system programming create considerable troubles for test engineers in production environments. In this paper, we point out particular challenges in testing the system's infrastructure beyond the SoCs as well as propose a general modeling methodology for test automation for microprocessor SoC-based system boards. The new so-called \"Lego-style\" test automation methodology forms a complimentary solution to traditional boundary scan. Together, they provide extended fault coverage that targets shorts, opens, stuck-at faults as well as dynamic faults (e.g. delays and transition faults). The \"Legostyle\" model allows reducing the labour effort drastically once the library of model components is created.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115053370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Compatibility Study of Compile-Time Optimizations for Power and Reliability 编译时功率和可靠性优化的兼容性研究
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.108
G. Nazarian, C. Strydis, G. Gaydadjiev
Historically compiler optimizations have been used mainly for improving embedded systems performance. However, for a wide range of today's power restricted, battery operated embedded devices, power consumption becomes a crucial problem that is addressed by modern compilers. Biomedical implants are one good example of such embedded systems. In addition to power, such devices need to also satisfy high reliability levels. Therefore, performance, power and reliability optimizations should all be considered while designing and programming implantable systems. Various software optimizations, e.g., during compilation, can provide the necessary means to achieve this goal. Additionally the system can be configured to trade-off between the above three factors based on the specific application requirements. In this paper we categorize previous works on compiler optimizations for low power and fault tolerance. Our study considers differences in instruction count and memory overhead, fault coverage and hardware modifications. Finally, the compatibility of different methods from both optimization classes is assessed. Five compatible pairs that can be combined with few or no limitations have been identified.
编译器优化历来主要用于提高嵌入式系统的性能。然而,对于当今范围广泛的功率限制,电池供电的嵌入式设备,功耗成为现代编译器解决的关键问题。生物医学植入物就是这种嵌入式系统的一个很好的例子。除了功率,这样的设备还需要满足高可靠性水平。因此,在设计和编程可植入系统时,应考虑性能、功耗和可靠性优化。各种软件优化,例如,在编译过程中,可以提供实现这一目标的必要手段。此外,可以根据特定的应用程序需求配置系统,在上述三个因素之间进行权衡。在这篇文章中,我们对编译器在低功耗和容错方面的优化进行了分类。我们的研究考虑了指令计数和内存开销、故障覆盖率和硬件修改的差异。最后,对两类优化方法的兼容性进行了评估。已经确定了五对可以很少或没有限制地组合的兼容对。
{"title":"Compatibility Study of Compile-Time Optimizations for Power and Reliability","authors":"G. Nazarian, C. Strydis, G. Gaydadjiev","doi":"10.1109/DSD.2011.108","DOIUrl":"https://doi.org/10.1109/DSD.2011.108","url":null,"abstract":"Historically compiler optimizations have been used mainly for improving embedded systems performance. However, for a wide range of today's power restricted, battery operated embedded devices, power consumption becomes a crucial problem that is addressed by modern compilers. Biomedical implants are one good example of such embedded systems. In addition to power, such devices need to also satisfy high reliability levels. Therefore, performance, power and reliability optimizations should all be considered while designing and programming implantable systems. Various software optimizations, e.g., during compilation, can provide the necessary means to achieve this goal. Additionally the system can be configured to trade-off between the above three factors based on the specific application requirements. In this paper we categorize previous works on compiler optimizations for low power and fault tolerance. Our study considers differences in instruction count and memory overhead, fault coverage and hardware modifications. Finally, the compatibility of different methods from both optimization classes is assessed. Five compatible pairs that can be combined with few or no limitations have been identified.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123014012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Modular Fault Injector for Multiple Fault Dependability and Security Evaluations 多故障可靠性和安全性评估的模块化故障注入器
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.76
J. Grinschgl, Armin Krieg, C. Steger, R. Weiss, H. Bock, J. Haid
The increasing level of integration and decreasing size of circuit elements leads to greater probabilities of operational faults. More sensible electronic devices are also more prone to external in?uences by energizing radiation. Additionally not only natural causes of faults are a concern of today's chip designers. Especially smart cards are exposed to complex attacks through which an adversary tries to extract knowledge from a secured system by putting it into an undefined state. These problems make it increasingly necessary to test a new design for its fault robustness. Several previous publications propose the usage of single bit injection platforms, but the limited impact of these campaigns might not be the right choice to provide a wide fault attack coverage. This paper first introduces a new in-system fault injection strategy for automatic test pattern injection. Secondly, an approach is presented that provides an abstraction of the internal fault injection structures to a more generic high level view. Through this abstraction it is possible to support the task separation of design and test-engineers and to enable the emulation of physical attacks on circuit level. The controller's generalized interface provides the ability to use the developed controller on different systems using the same bus system. The high level of abstraction is combinable with the advantage of high performance autonomous emulations on high end FPGA-platforms.
电路元件集成度的提高和尺寸的减小导致运行故障的概率增大。更敏感的电子设备也更容易外接?通过激发辐射产生能量。此外,不仅自然原因的故障是一个关注今天的芯片设计师。特别是智能卡暴露在复杂的攻击中,攻击者试图通过将其置于未定义状态来从受保护的系统中提取知识。这些问题使得对新设计进行故障鲁棒性测试变得越来越有必要。以前的一些出版物建议使用单比特注入平台,但这些活动的有限影响可能不是提供广泛故障攻击覆盖的正确选择。本文首先介绍了一种新的系统内故障注入策略,用于自动注入测试模式。其次,提出了一种将内部故障注入结构抽象为更通用的高层视图的方法。通过这种抽象,可以支持设计工程师和测试工程师的任务分离,并可以在电路级上对物理攻击进行仿真。控制器的通用接口提供了在使用相同总线系统的不同系统上使用开发的控制器的能力。高抽象水平与高端fpga平台上的高性能自主仿真优势相结合。
{"title":"Modular Fault Injector for Multiple Fault Dependability and Security Evaluations","authors":"J. Grinschgl, Armin Krieg, C. Steger, R. Weiss, H. Bock, J. Haid","doi":"10.1109/DSD.2011.76","DOIUrl":"https://doi.org/10.1109/DSD.2011.76","url":null,"abstract":"The increasing level of integration and decreasing size of circuit elements leads to greater probabilities of operational faults. More sensible electronic devices are also more prone to external in?uences by energizing radiation. Additionally not only natural causes of faults are a concern of today's chip designers. Especially smart cards are exposed to complex attacks through which an adversary tries to extract knowledge from a secured system by putting it into an undefined state. These problems make it increasingly necessary to test a new design for its fault robustness. Several previous publications propose the usage of single bit injection platforms, but the limited impact of these campaigns might not be the right choice to provide a wide fault attack coverage. This paper first introduces a new in-system fault injection strategy for automatic test pattern injection. Secondly, an approach is presented that provides an abstraction of the internal fault injection structures to a more generic high level view. Through this abstraction it is possible to support the task separation of design and test-engineers and to enable the emulation of physical attacks on circuit level. The controller's generalized interface provides the ability to use the developed controller on different systems using the same bus system. The high level of abstraction is combinable with the advantage of high performance autonomous emulations on high end FPGA-platforms.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"373 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116627360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
HMMER Performance Model for Multicore Architectures 多核架构的hmm性能模型
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.111
S. Isaza, Ernst Houtgast, G. Gaydadjiev
Exponential growth in biological sequence data combined with the computationally intensive nature of bioinformatics applications results in a continuously rising demand for processing power. In this paper, we propose a performance model that captures the behavior and performance scalability of HMMER, a bioinformatics application that identifies similarities between protein sequences and a protein family model. With our analytical model, the optimal master-worker ratio for a user scenario can be estimated. The model is evaluated and is found accurate with less than 2% error. We applied our model to a widely used heterogeneous multicore, the Cell BE, using the PPE and SPEs as master and workers respectively. Experimental results show that for the current parallelization strategy, the I/O speed at which the database is read from disk and the inputs pre-processing are the two most limiting factors in the Cell BE case.
生物序列数据的指数级增长与生物信息学应用的计算密集型性质相结合,导致对处理能力的需求不断上升。在本文中,我们提出了一个性能模型,该模型捕获了HMMER的行为和性能可扩展性,HMMER是一种生物信息学应用程序,用于识别蛋白质序列和蛋白质家族模型之间的相似性。使用我们的分析模型,可以估计用户场景的最佳主工比例。对模型进行了评估,发现模型精度小于2%。我们将我们的模型应用于广泛使用的异构多核Cell BE,分别使用PPE和spe作为主节点和工作节点。实验结果表明,对于当前的并行化策略,从磁盘读取数据库的I/O速度和输入预处理是Cell BE情况下的两个最大限制因素。
{"title":"HMMER Performance Model for Multicore Architectures","authors":"S. Isaza, Ernst Houtgast, G. Gaydadjiev","doi":"10.1109/DSD.2011.111","DOIUrl":"https://doi.org/10.1109/DSD.2011.111","url":null,"abstract":"Exponential growth in biological sequence data combined with the computationally intensive nature of bioinformatics applications results in a continuously rising demand for processing power. In this paper, we propose a performance model that captures the behavior and performance scalability of HMMER, a bioinformatics application that identifies similarities between protein sequences and a protein family model. With our analytical model, the optimal master-worker ratio for a user scenario can be estimated. The model is evaluated and is found accurate with less than 2% error. We applied our model to a widely used heterogeneous multicore, the Cell BE, using the PPE and SPEs as master and workers respectively. Experimental results show that for the current parallelization strategy, the I/O speed at which the database is read from disk and the inputs pre-processing are the two most limiting factors in the Cell BE case.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129777191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
On the Cascade Implementation of Multiple-Output Sparse Logic Functions 多输出稀疏逻辑函数的级联实现
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.8
V. Dvorák, P. Mikusek
Representation of multiple-output logic functions by Multi-Terminal Binary Decision Diagrams (MTBDDs) is studied for the useful class of sparse logic functions specified by the number of true min-terms. This paper derives upper bounds on the MTBDD width, which determine the size of look-up tables (LUTs) needed for hardware realization of these functions in FPGA logic synthesis. The obtained bounds are generalization of similar known bounds for single-output logic functions. Finally a procedure how to find the optimum mapping of MTBDD to a LUT cascade is presented and illustrated on a set of benchmarks.
研究了用多终端二元决策图(mtbdd)表示多输出逻辑函数的方法。本文推导了MTBDD宽度的上界,它决定了在FPGA逻辑合成中硬件实现这些功能所需的查找表(lut)的大小。所得的界是对已知的单输出逻辑函数的类似界的推广。最后给出了如何找到MTBDD到LUT级联的最佳映射的过程,并在一组基准上进行了说明。
{"title":"On the Cascade Implementation of Multiple-Output Sparse Logic Functions","authors":"V. Dvorák, P. Mikusek","doi":"10.1109/DSD.2011.8","DOIUrl":"https://doi.org/10.1109/DSD.2011.8","url":null,"abstract":"Representation of multiple-output logic functions by Multi-Terminal Binary Decision Diagrams (MTBDDs) is studied for the useful class of sparse logic functions specified by the number of true min-terms. This paper derives upper bounds on the MTBDD width, which determine the size of look-up tables (LUTs) needed for hardware realization of these functions in FPGA logic synthesis. The obtained bounds are generalization of similar known bounds for single-output logic functions. Finally a procedure how to find the optimum mapping of MTBDD to a LUT cascade is presented and illustrated on a set of benchmarks.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116294559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
LastZ: An Ultra Optimized 3D Networks-on-Chip Architecture 一种超优化的3D片上网络架构
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.26
A. Rahmani, P. Liljeberg, J. Plosila, H. Tenhunen
3D IC technology enables NoC architectures to offer greater device integration and shorter interlayer interconnects. The primary 3D NoC architectures such as Symmetric 3D Mesh NoC could not exploit the beneficial feature of a negligible inter-layer distance in 3D chips. To cope with this, 3D NoC-Bus Hybrid architecture was proposed which is a hybrid between packet-switched network and a bus. This architecture is feasible providing both performance and area benefits, while still suffering from naive and straightforward hybridization between NoC and bus media. In this paper, an ultra optimized hybridization scheme is proposed to enhance system performance, power consumption, area and thermal issues of 3D NoC-Bus Hybrid Mesh. The scheme benefits from a rule called emph{LastZ} which enables ultra optimization of the inter-layer communication architecture. In addition, we present a wrapper to preserve the backward compatibility of the proposed architecture for connecting with the existing network interfaces. To estimate the efficiency of the proposed architecture, the system has been simulated using uniform, hotspot 10%, and Negative Exponential Distribution (NED) traffic patterns. Our extensive simulations demonstrate significant area, power, and performance improvements compared to a typical 3D NoC-Bus Hybrid Mesh architecture.
3D集成电路技术使NoC架构能够提供更高的器件集成和更短的层间互连。主要的3D NoC架构,如对称3D网格NoC,不能利用3D芯片中可忽略的层间距离的有利特性。为了解决这一问题,提出了一种分组交换网络与总线相结合的三维NoC-Bus混合架构。这种架构在提供性能和面积优势方面是可行的,但在NoC和总线介质之间仍然存在幼稚和直接的混合问题。本文提出了一种超优化的混合方案,以改善3D NoC-Bus混合网格的系统性能、功耗、面积和热问题。该方案得益于一个名为emph{LastZ}的规则,该规则可以对层间通信架构进行超优化。此外,我们还提供了一个包装器,以保持所提出的体系结构与现有网络接口连接的向后兼容性。为了评估所提出的架构的效率,采用均匀、热点10%和负指数分布(NED)流量模式对系统进行了模拟。与典型的3D NoC-Bus混合网格架构相比,我们的广泛模拟显示了显着的面积,功率和性能改进。
{"title":"LastZ: An Ultra Optimized 3D Networks-on-Chip Architecture","authors":"A. Rahmani, P. Liljeberg, J. Plosila, H. Tenhunen","doi":"10.1109/DSD.2011.26","DOIUrl":"https://doi.org/10.1109/DSD.2011.26","url":null,"abstract":"3D IC technology enables NoC architectures to offer greater device integration and shorter interlayer interconnects. The primary 3D NoC architectures such as Symmetric 3D Mesh NoC could not exploit the beneficial feature of a negligible inter-layer distance in 3D chips. To cope with this, 3D NoC-Bus Hybrid architecture was proposed which is a hybrid between packet-switched network and a bus. This architecture is feasible providing both performance and area benefits, while still suffering from naive and straightforward hybridization between NoC and bus media. In this paper, an ultra optimized hybridization scheme is proposed to enhance system performance, power consumption, area and thermal issues of 3D NoC-Bus Hybrid Mesh. The scheme benefits from a rule called emph{LastZ} which enables ultra optimization of the inter-layer communication architecture. In addition, we present a wrapper to preserve the backward compatibility of the proposed architecture for connecting with the existing network interfaces. To estimate the efficiency of the proposed architecture, the system has been simulated using uniform, hotspot 10%, and Negative Exponential Distribution (NED) traffic patterns. Our extensive simulations demonstrate significant area, power, and performance improvements compared to a typical 3D NoC-Bus Hybrid Mesh architecture.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114806025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
HDL-Mutation Based Simulation Data Generation by Propagation Guided Search 基于hdl突变的传播引导搜索仿真数据生成
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.83
Tao Xie, W. Müller, Florian Letombe
HDL-mutation based fault injection and analysis is considered as an important coverage metric for measuring the quality of design simulation processes [20, 3, 1, 2]. In this work, we try to solve the problem of automatic simulation data generation targeting HDL mutation faults. We follow a search based approach and eliminate the need for symbolic execution and mathematical constraint solving from existing work. An objective cost function is defined on the test input space and serves the guidance of search for fault-detecting test data. This is done by first mapping the simulation traces under a test onto a control and data flow graph structure which is extracted from the design. Then the progress of fault detection can be measured quantitatively on this graph to be the cost value. By minimizing this cost we approach the target test data. The effectiveness of the cost function is investigated under an example neighborhood search scheme. Case study with a floating point arithmetic IP design has shown that the cost function is able to guide effectively the search procedure towards a fault-detecting test. The cost calculation time as the search overhead was also observed to be minor compared to the actual design simulation time.
基于hdl突变的故障注入和分析被认为是衡量设计仿真过程质量的重要覆盖度量[20,3,1,2]。在这项工作中,我们试图解决针对HDL突变故障的自动模拟数据生成问题。我们遵循基于搜索的方法,消除了对现有工作的符号执行和数学约束求解的需要。在测试输入空间上定义一个目标代价函数,用于指导故障检测测试数据的搜索。这是通过首先将测试下的模拟轨迹映射到从设计中提取的控制和数据流图结构来完成的。然后在此图上可以定量地度量故障检测的进度,作为成本值。通过最小化这个成本,我们可以接近目标测试数据。以邻域搜索为例,研究了代价函数的有效性。通过浮点算法IP设计的实例研究表明,代价函数能够有效地指导搜索过程走向故障检测测试。与实际设计仿真时间相比,作为搜索开销的成本计算时间也较小。
{"title":"HDL-Mutation Based Simulation Data Generation by Propagation Guided Search","authors":"Tao Xie, W. Müller, Florian Letombe","doi":"10.1109/DSD.2011.83","DOIUrl":"https://doi.org/10.1109/DSD.2011.83","url":null,"abstract":"HDL-mutation based fault injection and analysis is considered as an important coverage metric for measuring the quality of design simulation processes [20, 3, 1, 2]. In this work, we try to solve the problem of automatic simulation data generation targeting HDL mutation faults. We follow a search based approach and eliminate the need for symbolic execution and mathematical constraint solving from existing work. An objective cost function is defined on the test input space and serves the guidance of search for fault-detecting test data. This is done by first mapping the simulation traces under a test onto a control and data flow graph structure which is extracted from the design. Then the progress of fault detection can be measured quantitatively on this graph to be the cost value. By minimizing this cost we approach the target test data. The effectiveness of the cost function is investigated under an example neighborhood search scheme. Case study with a floating point arithmetic IP design has shown that the cost function is able to guide effectively the search procedure towards a fault-detecting test. The cost calculation time as the search overhead was also observed to be minor compared to the actual design simulation time.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125246818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Rapid and Accurate Leakage Power Estimation for Nano-CMOS Circuits 纳米cmos电路泄漏功率的快速准确估计
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.92
M. Bryk, L. Józwiak, W. Kuzmicz
This paper addresses the crucial problem of static power reduction for circuits implemented in nano-CMOS technologies. Its solution requires accurate and rapid power estimation, but the known power simulators are not accurate and quick at the same time. The paper proposes and discusses a new rapid and very accurate leakage power estimation method and related simulator. The maximum estimation error of the simulator is within 5%, with an average error of only 0.57%, and run-times in the range of seconds, while for the same circuits HSPICE runs for hours or days.
本文讨论了在纳米cmos技术中实现的电路的静态功耗降低的关键问题。其解决方案要求准确、快速的功率估计,但目前已知的功率模拟器并不能同时做到准确和快速。本文提出并讨论了一种新的快速、高精度的泄漏功率估计方法及其仿真器。模拟器的最大估计误差在5%以内,平均误差仅为0.57%,运行时间在秒范围内,而对于相同的电路,HSPICE的运行时间为数小时或数天。
{"title":"Rapid and Accurate Leakage Power Estimation for Nano-CMOS Circuits","authors":"M. Bryk, L. Józwiak, W. Kuzmicz","doi":"10.1109/DSD.2011.92","DOIUrl":"https://doi.org/10.1109/DSD.2011.92","url":null,"abstract":"This paper addresses the crucial problem of static power reduction for circuits implemented in nano-CMOS technologies. Its solution requires accurate and rapid power estimation, but the known power simulators are not accurate and quick at the same time. The paper proposes and discusses a new rapid and very accurate leakage power estimation method and related simulator. The maximum estimation error of the simulator is within 5%, with an average error of only 0.57%, and run-times in the range of seconds, while for the same circuits HSPICE runs for hours or days.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125261270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Architectures for Fast Modular Multiplication 快速模块化乘法的体系结构
Pub Date : 2011-08-31 DOI: 10.1109/DSD.2011.60
Ahmet Aris, S. Yalcin, G. Saldamli
Modular multiplication is the key ingredient needed to realize most public-key cryptographic primitives. In a modular setting, multiplications are carried in two steps: namely a usual integer arithmetic followed by a reduction step. Progress in any of these steps naturally improves the modular multiplication but it is not possible to interleave the best algorithms of these stages. In this study, we propose architectures for recently proposed method of interleaving the Karatsuba-Ofman multiplier and bipartite modular reduction on the upper most layer of Karatsuba-Ofman's recursion. We manage to come up with a high performance modular multiplication architecture by taking the advantage of a fast multiplication and a parallel reduction method.
模乘法是实现大多数公钥加密原语所需的关键成分。在模设置中,乘法分两步进行:即通常的整数运算,然后是约简步骤。这些步骤中的任何一步的进展都自然地改进了模乘法,但不可能将这些阶段的最佳算法交叉使用。在这项研究中,我们提出了最近提出的在Karatsuba-Ofman递归的最上层将Karatsuba-Ofman乘法器和二分模约简交织的方法的结构。我们通过利用快速乘法和并行约简方法的优势,设法提出了一个高性能的模块化乘法架构。
{"title":"Architectures for Fast Modular Multiplication","authors":"Ahmet Aris, S. Yalcin, G. Saldamli","doi":"10.1109/DSD.2011.60","DOIUrl":"https://doi.org/10.1109/DSD.2011.60","url":null,"abstract":"Modular multiplication is the key ingredient needed to realize most public-key cryptographic primitives. In a modular setting, multiplications are carried in two steps: namely a usual integer arithmetic followed by a reduction step. Progress in any of these steps naturally improves the modular multiplication but it is not possible to interleave the best algorithms of these stages. In this study, we propose architectures for recently proposed method of interleaving the Karatsuba-Ofman multiplier and bipartite modular reduction on the upper most layer of Karatsuba-Ofman's recursion. We manage to come up with a high performance modular multiplication architecture by taking the advantage of a fast multiplication and a parallel reduction method.","PeriodicalId":267187,"journal":{"name":"2011 14th Euromicro Conference on Digital System Design","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121681069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2011 14th Euromicro Conference on Digital System Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1