首页 > 最新文献

2013 International Symposium on Rapid System Prototyping (RSP)最新文献

英文 中文
An implementation of a distributed fault-tolerant mechanism for 2D mesh NoCs 二维网格noc分布式容错机制的实现
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683954
C. Marcon, Alexandre M. Amory, F. T. Bortolon, T. Webber, Thomas Volpato, Jader Munareto
Advances in design integration have enabled the integration of large Multiprocessor Systems-on-Chip (MPSoC). Such systems are prone to the execution of complex applications if high degree of parallelism is employed on the communication infrastructure. Network-on-Chip (NoC) has emerged as a new communication paradigm for large MPSoCs with advantages such as the increase of reliability on components interactions. However, device's integration may convey few shortcomings during MPSoC manufacturing and operation, for instance, the vulnerability to faults. This paper describes Phoenix, which is a direct mesh NoC with fault detection scheme. The proposed architecture explores a fault-tolerant mechanism, which is implemented in a distributed manner as a fault monitor on processors and routers. Results demonstrate that Phoenix can be scalable in view of the stabilization time regarding to faults incidence, allowing MPSoC operation even with the occurrence of a large number of faults.
设计集成的进步使大型多处理器片上系统(MPSoC)的集成成为可能。如果在通信基础设施上采用高度并行性,这样的系统容易执行复杂的应用程序。片上网络(NoC)已成为大型mpsoc的一种新的通信模式,具有提高组件交互可靠性等优点。然而,在MPSoC的制造和运行过程中,器件的集成可能会带来一些缺点,例如,容易出现故障。本文介绍了一种带故障检测的直接网格NoC方案Phoenix。所提出的体系结构探索了一种容错机制,该机制以分布式方式实现,作为处理器和路由器上的故障监视器。结果表明,鉴于故障发生的稳定时间,Phoenix具有可扩展性,即使发生大量故障,MPSoC也可以正常运行。
{"title":"An implementation of a distributed fault-tolerant mechanism for 2D mesh NoCs","authors":"C. Marcon, Alexandre M. Amory, F. T. Bortolon, T. Webber, Thomas Volpato, Jader Munareto","doi":"10.1109/RSP.2013.6683954","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683954","url":null,"abstract":"Advances in design integration have enabled the integration of large Multiprocessor Systems-on-Chip (MPSoC). Such systems are prone to the execution of complex applications if high degree of parallelism is employed on the communication infrastructure. Network-on-Chip (NoC) has emerged as a new communication paradigm for large MPSoCs with advantages such as the increase of reliability on components interactions. However, device's integration may convey few shortcomings during MPSoC manufacturing and operation, for instance, the vulnerability to faults. This paper describes Phoenix, which is a direct mesh NoC with fault detection scheme. The proposed architecture explores a fault-tolerant mechanism, which is implemented in a distributed manner as a fault monitor on processors and routers. Results demonstrate that Phoenix can be scalable in view of the stabilization time regarding to faults incidence, allowing MPSoC operation even with the occurrence of a large number of faults.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122560203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quota setting router architecture for quality of service in GALS NoC GALS NoC中服务质量的配额设置路由器架构
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683957
Kazem Cheshmi, M. Soltaniyeh, S. Mohammadi, Jelena Trajkovic
Network on Chip (NoC) is a new communication paradigm for emerging multi- and many-core architectures. Despite major benefits, like scalability and power efficiency, it suffers from lack of guaranteed bounded latency. Many contemporary applications, like multimedia and real-time applications, require such a guarantee. The growth of these applications in embedded systems emphasizes the need for guaranteed services in NoCs. Additionally, increasing numbers of cores in NoCs highlights the clock distribution issue. Globally asynchronous locally synchronous (GALS) NoC architectures propose to solve this issue through using asynchronous routers to connect synchronous blocks. This paper presents a novel approach for guaranteed service in a GALS NoC by using router with set port quota. We propose a novel router architecture which facilitates guaranteed latency for accessing shared media. Our simulations show up to 39% improvement in latency, with a negligible (up to 5%) power overhead.
片上网络(NoC)是新兴的多核和多核架构的一种新的通信范式。尽管具有可伸缩性和能效等主要优点,但它的缺点是缺乏保证的有限延迟。许多当代应用,如多媒体和实时应用,都需要这样的保证。嵌入式系统中这些应用程序的增长强调了noc中有保障服务的需求。此外,noc中内核数量的增加突出了时钟分布问题。全局异步本地同步(global asynchronous local synchronous, GALS) NoC架构通过使用异步路由器连接同步块来解决这个问题。提出了一种在GALS NoC中使用设置端口配额的路由器来保证服务的新方法。我们提出了一种新的路由器架构,以保证访问共享媒体的延迟。我们的模拟显示,延迟提高了39%,而功耗开销可以忽略不计(最多5%)。
{"title":"Quota setting router architecture for quality of service in GALS NoC","authors":"Kazem Cheshmi, M. Soltaniyeh, S. Mohammadi, Jelena Trajkovic","doi":"10.1109/RSP.2013.6683957","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683957","url":null,"abstract":"Network on Chip (NoC) is a new communication paradigm for emerging multi- and many-core architectures. Despite major benefits, like scalability and power efficiency, it suffers from lack of guaranteed bounded latency. Many contemporary applications, like multimedia and real-time applications, require such a guarantee. The growth of these applications in embedded systems emphasizes the need for guaranteed services in NoCs. Additionally, increasing numbers of cores in NoCs highlights the clock distribution issue. Globally asynchronous locally synchronous (GALS) NoC architectures propose to solve this issue through using asynchronous routers to connect synchronous blocks. This paper presents a novel approach for guaranteed service in a GALS NoC by using router with set port quota. We propose a novel router architecture which facilitates guaranteed latency for accessing shared media. Our simulations show up to 39% improvement in latency, with a negligible (up to 5%) power overhead.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115809265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Emulation-based design evaluation of reader/smart card systems 基于仿真的读卡器/智能卡系统设计评价
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683962
N. Druml, M. Menghin, Daniel Kroisleitner, C. Steger, R. Weiss, H. Bock, J. Haid
Design exploration and evaluation are essential tasks during a product's development cycle. Simulation and hardware emulation are common techniques to explore and evaluate the functionality of hardware/software designs. However, when it comes to distributed secure applications, like contactless reader/smart card systems, non-functional design properties and system aspects (e.g., conctactless power transfer, power consumption) have to be regarded too. State-of-the-art simulation-based and emulation-based design exploration tools cover these design issues and system aspects only to some extent. Here we present a design exploration framework for complete reader/smart card systems using state-of-the-art model-based emulation and estimation techniques. This novel system-based approach is of high importance because of the high availability of battery powered mobile readers (i.e. smart phones) and novel mobile application fields. Contactless power transfer and power consumption analyses of reader and smart cards can be performed for each clock cycle and in real time. Thus, novel system-level power and security optimization techniques can be evaluated considering the reader/smart card system as a whole. We demonstrate the application of our exploration framework by means of a typical Diffie-Hellman key exchange between reader and smart card and highlight power optimization possibilities.
设计探索和评估是产品开发周期中必不可少的任务。仿真和硬件仿真是探索和评估硬件/软件设计功能的常用技术。然而,当涉及到分布式安全应用程序时,如非接触式读卡器/智能卡系统,非功能设计属性和系统方面(例如,非接触式功率传输,功耗)也必须考虑。最先进的基于仿真和基于仿真的设计探索工具只在一定程度上涵盖了这些设计问题和系统方面。在这里,我们提出了一个完整的阅读器/智能卡系统的设计探索框架,使用最先进的基于模型的仿真和估计技术。这种新颖的基于系统的方法非常重要,因为电池供电的移动阅读器(即智能手机)的高可用性和新的移动应用领域。阅读器和智能卡的非接触式功率传输和功耗分析可以在每个时钟周期内实时执行。因此,可以将读卡器/智能卡系统作为一个整体来评估新的系统级功率和安全优化技术。我们通过在读卡器和智能卡之间进行典型的迪菲-赫尔曼密钥交换来演示我们的探索框架的应用,并强调功率优化的可能性。
{"title":"Emulation-based design evaluation of reader/smart card systems","authors":"N. Druml, M. Menghin, Daniel Kroisleitner, C. Steger, R. Weiss, H. Bock, J. Haid","doi":"10.1109/RSP.2013.6683962","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683962","url":null,"abstract":"Design exploration and evaluation are essential tasks during a product's development cycle. Simulation and hardware emulation are common techniques to explore and evaluate the functionality of hardware/software designs. However, when it comes to distributed secure applications, like contactless reader/smart card systems, non-functional design properties and system aspects (e.g., conctactless power transfer, power consumption) have to be regarded too. State-of-the-art simulation-based and emulation-based design exploration tools cover these design issues and system aspects only to some extent. Here we present a design exploration framework for complete reader/smart card systems using state-of-the-art model-based emulation and estimation techniques. This novel system-based approach is of high importance because of the high availability of battery powered mobile readers (i.e. smart phones) and novel mobile application fields. Contactless power transfer and power consumption analyses of reader and smart cards can be performed for each clock cycle and in real time. Thus, novel system-level power and security optimization techniques can be evaluated considering the reader/smart card system as a whole. We demonstrate the application of our exploration framework by means of a typical Diffie-Hellman key exchange between reader and smart card and highlight power optimization possibilities.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114766956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
YAPPA: A compiler-based parallelization framework for irregular applications on MPSoCs YAPPA:基于编译器的并行化框架,用于mpsoc上的不规则应用程序
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683968
S. Lovergine, Antonino Tumeo, Oreste Villa, Fabrizio Ferrandi
Modern embedded systems include hundreds of cores. Because of the difficulty in providing a fast, coherent memory architecture, these systems usually rely on noncoherent, non-uniform memory architectures with private memories for each core. However, programming these systems poses significant challenges. The developer must extract large amounts of parallelism, while orchestrating communication among cores to optimize application performance. These issues become even more significant with irregular applications, which present data sets difficult to partition, unpredictable memory accesses, unbalanced control flow and fine grained communication. Hand-optimizing every single aspect is hard and time-consuming, and it often does not lead to the expected performance. There is a growing gap between such complex and highly-parallel architectures and the high level languages used to describe the specification, which were designed for simpler systems and do not consider these new issues. In this paper we introduce YAPPA (Yet Another Parallel Programming Approach), a compilation framework for the automatic parallelization of irregular applications on modern MPSoCs based on LLVM. We start by considering an efficient parallel programming approach for irregular applications on distributed memory systems. We then propose a set of transformations that can reduce the development and optimization effort. The results of our initial prototype confirm the correctness of the proposed approach.
现代嵌入式系统包含数百个内核。由于难以提供快速、一致的内存体系结构,这些系统通常依赖于每个核心具有私有内存的非一致、非统一的内存体系结构。然而,对这些系统进行编程带来了巨大的挑战。开发人员必须提取大量的并行性,同时编排核心之间的通信以优化应用程序性能。对于不规则应用程序,这些问题变得更加严重,这些不规则应用程序呈现难以分区的数据集、不可预测的内存访问、不平衡的控制流和细粒度通信。手动优化每一个方面都是困难和耗时的,而且它通常不会带来预期的性能。这种复杂和高度并行的体系结构与用于描述规范的高级语言之间的差距越来越大,这些语言是为更简单的系统设计的,没有考虑这些新问题。在本文中,我们介绍了YAPPA (Yet Another Parallel Programming Approach),这是一个基于LLVM的现代mpsoc上不规则应用程序自动并行化的编译框架。我们首先考虑分布式内存系统上不规则应用程序的高效并行编程方法。然后我们提出一组可以减少开发和优化工作的转换。我们的初始原型的结果证实了所提出的方法的正确性。
{"title":"YAPPA: A compiler-based parallelization framework for irregular applications on MPSoCs","authors":"S. Lovergine, Antonino Tumeo, Oreste Villa, Fabrizio Ferrandi","doi":"10.1109/RSP.2013.6683968","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683968","url":null,"abstract":"Modern embedded systems include hundreds of cores. Because of the difficulty in providing a fast, coherent memory architecture, these systems usually rely on noncoherent, non-uniform memory architectures with private memories for each core. However, programming these systems poses significant challenges. The developer must extract large amounts of parallelism, while orchestrating communication among cores to optimize application performance. These issues become even more significant with irregular applications, which present data sets difficult to partition, unpredictable memory accesses, unbalanced control flow and fine grained communication. Hand-optimizing every single aspect is hard and time-consuming, and it often does not lead to the expected performance. There is a growing gap between such complex and highly-parallel architectures and the high level languages used to describe the specification, which were designed for simpler systems and do not consider these new issues. In this paper we introduce YAPPA (Yet Another Parallel Programming Approach), a compilation framework for the automatic parallelization of irregular applications on modern MPSoCs based on LLVM. We start by considering an efficient parallel programming approach for irregular applications on distributed memory systems. We then propose a set of transformations that can reduce the development and optimization effort. The results of our initial prototype confirm the correctness of the proposed approach.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127361338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Rapid safety evaluation of hardware architectural designs compliant with ISO 26262 符合ISO 26262的硬件架构设计的快速安全评估
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683960
N. Adler, S. Otten, Markus Mohrhard, K. Müller-Glaser
The international standard ISO 26262 “Road vehicles - Functional safety” claims qualitative and quantitative analysis of hardware designs at the appropriate level of abstraction. For large-scaled hardware designs, these evaluations have to be initiated early in development adequate to hardware architectural design and not delayed to hardware detailed design at the level of electronic schematics. Therefore, we describe a structural modeling and annotation of failure data for hardware architectural designs. Based on a top-down qualitative fault tree analysis, the classification of hardware failure modes in context of system behavior can be determined according to ISO 26262. Using these classifications and assumed failure rates, we facilitate a rapid quantitative safety analysis regarding evaluation of the hardware architectural metrics and evaluation of safety goal violations.
国际标准ISO 26262“道路车辆-功能安全”要求在适当的抽象级别上对硬件设计进行定性和定量分析。对于大型硬件设计,这些评估必须在硬件架构设计的开发早期开始,而不是延迟到电子原理图级别的硬件详细设计。因此,我们描述了硬件建筑设计中故障数据的结构建模和注释。基于自顶向下的定性故障树分析,可以根据ISO 26262确定系统行为背景下硬件故障模式的分类。使用这些分类和假设的故障率,我们促进了关于硬件架构度量和安全目标违反评估的快速定量安全分析。
{"title":"Rapid safety evaluation of hardware architectural designs compliant with ISO 26262","authors":"N. Adler, S. Otten, Markus Mohrhard, K. Müller-Glaser","doi":"10.1109/RSP.2013.6683960","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683960","url":null,"abstract":"The international standard ISO 26262 “Road vehicles - Functional safety” claims qualitative and quantitative analysis of hardware designs at the appropriate level of abstraction. For large-scaled hardware designs, these evaluations have to be initiated early in development adequate to hardware architectural design and not delayed to hardware detailed design at the level of electronic schematics. Therefore, we describe a structural modeling and annotation of failure data for hardware architectural designs. Based on a top-down qualitative fault tree analysis, the classification of hardware failure modes in context of system behavior can be determined according to ISO 26262. Using these classifications and assumed failure rates, we facilitate a rapid quantitative safety analysis regarding evaluation of the hardware architectural metrics and evaluation of safety goal violations.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127009960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Rapid design and prototyping of a reconfigurable decoder architecture for QC-LDPC codes QC-LDPC码可重构解码器架构的快速设计与原型设计
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683963
Purushotham Murugappa, Vianney Lapôtre, A. Baghdadi, M. Jézéquel
Many modern and emerging designs require having efficient dynamically reconfigurable and reprogrammable processors. However, when the implemented design needs an upgrade, newly added features have to be quickly supported and validated. This is clearly noticed in modern receivers of recent wireless communication standards that feature continuously different frame lengths and code rates for the channel decoder. This paper explores with an example the possibility of realizing a flexible channel decoder to implement and validate new/incremental algorithm changes with fast turnaround time in design. An application specific instruction-set processor (ASIP) is proposed as flexible core that can decode low-density parity-check (LDPC) codes with the various block sizes and code rates as specified in WiFi and WiMAX standards. Furthermore, the proposed architecture enables quick support of other Quasi-Cyclic LDPC (QC-LDPC) codes, e.g. DVB-S2, with simple incremental hardware changes at design time.
许多现代和新兴的设计需要具有高效的动态可重构和可重新编程的处理器。但是,当实现的设计需要升级时,必须快速支持和验证新添加的功能。这在最近无线通信标准的现代接收器中清楚地注意到,该标准具有连续不同的帧长度和信道解码器的码率。本文通过一个例子探讨了实现灵活通道解码器的可能性,以实现和验证设计中快速周转时间的新/增量算法更改。提出了一种应用专用指令集处理器(ASIP)作为灵活核心,可以解码WiFi和WiMAX标准中规定的具有不同块大小和码率的低密度奇偶校验(LDPC)码。此外,所提出的架构能够快速支持其他准循环LDPC (QC-LDPC)代码,例如DVB-S2,只需在设计时进行简单的增量硬件更改。
{"title":"Rapid design and prototyping of a reconfigurable decoder architecture for QC-LDPC codes","authors":"Purushotham Murugappa, Vianney Lapôtre, A. Baghdadi, M. Jézéquel","doi":"10.1109/RSP.2013.6683963","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683963","url":null,"abstract":"Many modern and emerging designs require having efficient dynamically reconfigurable and reprogrammable processors. However, when the implemented design needs an upgrade, newly added features have to be quickly supported and validated. This is clearly noticed in modern receivers of recent wireless communication standards that feature continuously different frame lengths and code rates for the channel decoder. This paper explores with an example the possibility of realizing a flexible channel decoder to implement and validate new/incremental algorithm changes with fast turnaround time in design. An application specific instruction-set processor (ASIP) is proposed as flexible core that can decode low-density parity-check (LDPC) codes with the various block sizes and code rates as specified in WiFi and WiMAX standards. Furthermore, the proposed architecture enables quick support of other Quasi-Cyclic LDPC (QC-LDPC) codes, e.g. DVB-S2, with simple incremental hardware changes at design time.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"216 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129445987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
MAMPSx: A design framework for rapid synthesis of predictable heterogeneous MPSoCs MAMPSx:快速合成可预测异构mpsoc的设计框架
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683970
Shakith Fernando, Firew Siyoum, Yifan He, Akash Kumar, H. Corporaal
Heterogeneous Multiprocessor System-on-Chips (HMPSoC) are becoming popular as a means of meeting energy efficiency requirements of modern embedded systems. However, as these HMPSoCs run multimedia applications as well, they also need to meet real-time requirements. Designing these predictable HMPSoCs is a key challenge, as the current design methods for these platforms are either semi-automated, non-predictable, or have limited heterogeneity. In this paper, we propose a design framework to generate and program HMPSoC designs in a rapid and predictable manner. It takes the application specifications and the architecture model as input and generates the entire HMPSoC, for FPGA prototyping, that meets the throughput constraints. The experimental results show that our framework can provide a conservative bound on the worst-case throughput of the FPGA implementation. We also present results of a case study that computes the area-power trade-offs of an industrial vision application. The entire design space exploration of all configurations was completed in 8 hours. A tool-chain targeting the Xilinx Zynq FPGA is also presented.
异构多处理器片上系统(HMPSoC)作为满足现代嵌入式系统能效要求的一种手段正日益流行。然而,由于这些hmpsoc也运行多媒体应用程序,它们还需要满足实时性要求。设计这些可预测的hmpsoc是一个关键挑战,因为目前这些平台的设计方法要么是半自动化的,要么是不可预测的,要么是有限的异构性。在本文中,我们提出了一个设计框架,以快速和可预测的方式生成和编程HMPSoC设计。它将应用规范和架构模型作为输入,并生成整个HMPSoC,用于FPGA原型设计,满足吞吐量限制。实验结果表明,该框架可以为FPGA实现的最坏情况吞吐量提供一个保守的边界。我们还介绍了一个案例研究的结果,该研究计算了工业视觉应用的面积-功率权衡。所有配置的整个设计空间探索在8小时内完成。提出了一种针对Xilinx Zynq FPGA的工具链。
{"title":"MAMPSx: A design framework for rapid synthesis of predictable heterogeneous MPSoCs","authors":"Shakith Fernando, Firew Siyoum, Yifan He, Akash Kumar, H. Corporaal","doi":"10.1109/RSP.2013.6683970","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683970","url":null,"abstract":"Heterogeneous Multiprocessor System-on-Chips (HMPSoC) are becoming popular as a means of meeting energy efficiency requirements of modern embedded systems. However, as these HMPSoCs run multimedia applications as well, they also need to meet real-time requirements. Designing these predictable HMPSoCs is a key challenge, as the current design methods for these platforms are either semi-automated, non-predictable, or have limited heterogeneity. In this paper, we propose a design framework to generate and program HMPSoC designs in a rapid and predictable manner. It takes the application specifications and the architecture model as input and generates the entire HMPSoC, for FPGA prototyping, that meets the throughput constraints. The experimental results show that our framework can provide a conservative bound on the worst-case throughput of the FPGA implementation. We also present results of a case study that computes the area-power trade-offs of an industrial vision application. The entire design space exploration of all configurations was completed in 8 hours. A tool-chain targeting the Xilinx Zynq FPGA is also presented.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"77 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123270953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
FlexOE: A congestion-aware routing algorithm for NoCs FlexOE:用于noc的拥塞感知路由算法
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683958
O. A. D. L. Junior, V. Fresse, F. Rousseau
Networks-on-Chip (NoCs) are currently the most appropriate communication structure for many-core embedded systems. Those networks support many real-time data flows. Their performance depends directly on the routing strategy. In this paper, we present a new congestion-aware routing algorithm (FlexOE) based on a simple and flexible scheme of prioritized sets of rules. These sets of rules are based on the Odd-Even turn model, minimal paths checking, congestion information from adjacent routers and availability of output path. The algorithm FlexOE developed is integrated on a Hermes NoC, and then implemented on an FPGA. The evaluation results point out that FlexOE has greater performances than reference algorithms for some test scenarios and similar performances for others test scenarios.
片上网络(noc)是目前最适合多核嵌入式系统的通信结构。这些网络支持许多实时数据流。它们的性能直接取决于路由策略。在本文中,我们提出了一种新的基于简单灵活的优先规则集方案的拥塞感知路由算法(FlexOE)。这些规则集基于奇偶回合模型、最小路径检查、来自相邻路由器的拥塞信息和输出路径的可用性。FlexOE开发的算法集成在Hermes NoC上,然后在FPGA上实现。评估结果表明,FlexOE在某些测试场景下的性能优于参考算法,而在其他测试场景下的性能与参考算法相似。
{"title":"FlexOE: A congestion-aware routing algorithm for NoCs","authors":"O. A. D. L. Junior, V. Fresse, F. Rousseau","doi":"10.1109/RSP.2013.6683958","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683958","url":null,"abstract":"Networks-on-Chip (NoCs) are currently the most appropriate communication structure for many-core embedded systems. Those networks support many real-time data flows. Their performance depends directly on the routing strategy. In this paper, we present a new congestion-aware routing algorithm (FlexOE) based on a simple and flexible scheme of prioritized sets of rules. These sets of rules are based on the Odd-Even turn model, minimal paths checking, congestion information from adjacent routers and availability of output path. The algorithm FlexOE developed is integrated on a Hermes NoC, and then implemented on an FPGA. The evaluation results point out that FlexOE has greater performances than reference algorithms for some test scenarios and similar performances for others test scenarios.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125774386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Performance modeling for designing NoC-based multiprocessors 基于noc的多处理器设计的性能建模
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683955
Takashi Nakada, Shinobu Miwa, K. Yano, Hiroshi Nakamura
Network-on-Chip (NoC) based multiprocessors have become popular as a scalable alternative to classical bus architectures. The performance evaluation of NoC-based multiprocessors is largely based on simulation. However, precise simulation is extremely slow. Additionally, there are many design parameters that affect the total performance. Therefore, it is practically impossible to use the precise simulation for the design space exploration purposes. To alleviate this problem, prototyping NoC systems and estimating their performances are critically important. In this paper, we present a generalized novel performance model that combined with the simulations for designing NoC-based multiprocessors. We revealed that the performance impact of cache and network latencies are dominant. Moreover, network congestion rarely happens under near appropriate configuration. Thus, the performance model is mainly constructed using the hardware parameters and the statistics that obtained from a simple cache simulation that is separated from the network behavior. The proposed performance model is used not only to obtain fast and accurate performance, but also to guide the NoC-based multiprocessor design space exploration. The accuracy of our approach and its practical use are illustrated through simulation. The results showed that proposed model can estimate performance with only 3.4% error on average and 21% at worst. We also confirmed that our evaluation framework can estimate 360 times faster than the brute force full system simulation.
基于片上网络(NoC)的多处理器已经成为经典总线架构的可扩展替代方案。基于noc的多处理器的性能评估主要基于仿真。然而,精确的模拟是非常缓慢的。此外,还有许多设计参数会影响总体性能。因此,用精确的仿真来设计空间探索实际上是不可能的。为了缓解这一问题,原型NoC系统和评估其性能是至关重要的。在本文中,我们提出了一个通用的新型性能模型,并结合仿真设计了基于noc的多处理器。我们发现缓存和网络延迟对性能的影响占主导地位。此外,在适当的配置下,网络拥塞很少发生。因此,性能模型主要使用硬件参数和从与网络行为分离的简单缓存模拟中获得的统计数据来构建。所提出的性能模型不仅可以获得快速准确的性能,而且可以指导基于noc的多处理器设计空间探索。通过仿真说明了该方法的准确性和实际应用。结果表明,该模型的平均误差为3.4%,最坏误差为21%。我们还证实,我们的评估框架的估计速度比蛮力全系统模拟快360倍。
{"title":"Performance modeling for designing NoC-based multiprocessors","authors":"Takashi Nakada, Shinobu Miwa, K. Yano, Hiroshi Nakamura","doi":"10.1109/RSP.2013.6683955","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683955","url":null,"abstract":"Network-on-Chip (NoC) based multiprocessors have become popular as a scalable alternative to classical bus architectures. The performance evaluation of NoC-based multiprocessors is largely based on simulation. However, precise simulation is extremely slow. Additionally, there are many design parameters that affect the total performance. Therefore, it is practically impossible to use the precise simulation for the design space exploration purposes. To alleviate this problem, prototyping NoC systems and estimating their performances are critically important. In this paper, we present a generalized novel performance model that combined with the simulations for designing NoC-based multiprocessors. We revealed that the performance impact of cache and network latencies are dominant. Moreover, network congestion rarely happens under near appropriate configuration. Thus, the performance model is mainly constructed using the hardware parameters and the statistics that obtained from a simple cache simulation that is separated from the network behavior. The proposed performance model is used not only to obtain fast and accurate performance, but also to guide the NoC-based multiprocessor design space exploration. The accuracy of our approach and its practical use are illustrated through simulation. The results showed that proposed model can estimate performance with only 3.4% error on average and 21% at worst. We also confirmed that our evaluation framework can estimate 360 times faster than the brute force full system simulation.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132739199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A flexible framework for modeling and simulation of multipurpose wireless networks 一种灵活的多用途无线网络建模与仿真框架
Pub Date : 2013-12-16 DOI: 10.1109/RSP.2013.6683964
Vinicius Bohrer, Ramon Fernandes, C. Marcon, T. Webber, L. Poehls, R. Czekster, Fabiano Hessel
The emergence of wireless networks has contributed to a growing number of studies and protocols regarding its performance and reliability requirements, among others. Several issues have to be considered when deploying such devices under harsh environmental conditions. These issues often force the designer to adopt decisions that are usually difficult to verify in real world settings. In order to mitigate such problems, an alternative resides in the use of simulation models for both homogeneous and heterogeneous devices. This paper describes an event-based Wireless Network Simulator (WiNeS) for devices operating in several topologies and configurations of networks. WiNeS is a Java-based framework specially built to support customized network options that offers hybrid simulation for virtual and physical nodes in the same environment. Some of WiNeS' features include the computation of maximum communication distances among devices in 2D and 3D spatial node distributions as well as pairing rules to evaluate the nodes connectivity.
无线网络的出现促进了越来越多的关于其性能和可靠性要求的研究和协议。在恶劣的环境条件下部署此类设备时,必须考虑几个问题。这些问题通常会迫使设计师做出难以在现实环境中验证的决策。为了减轻这些问题,另一种方法是对同质和异质设备使用模拟模型。本文描述了一个基于事件的无线网络模拟器(WiNeS),用于在多种拓扑和网络配置中运行的设备。WiNeS是一个基于java的框架,专门用于支持定制的网络选项,为同一环境中的虚拟和物理节点提供混合模拟。WiNeS的一些功能包括计算2D和3D空间节点分布中设备之间的最大通信距离,以及评估节点连通性的配对规则。
{"title":"A flexible framework for modeling and simulation of multipurpose wireless networks","authors":"Vinicius Bohrer, Ramon Fernandes, C. Marcon, T. Webber, L. Poehls, R. Czekster, Fabiano Hessel","doi":"10.1109/RSP.2013.6683964","DOIUrl":"https://doi.org/10.1109/RSP.2013.6683964","url":null,"abstract":"The emergence of wireless networks has contributed to a growing number of studies and protocols regarding its performance and reliability requirements, among others. Several issues have to be considered when deploying such devices under harsh environmental conditions. These issues often force the designer to adopt decisions that are usually difficult to verify in real world settings. In order to mitigate such problems, an alternative resides in the use of simulation models for both homogeneous and heterogeneous devices. This paper describes an event-based Wireless Network Simulator (WiNeS) for devices operating in several topologies and configurations of networks. WiNeS is a Java-based framework specially built to support customized network options that offers hybrid simulation for virtual and physical nodes in the same environment. Some of WiNeS' features include the computation of maximum communication distances among devices in 2D and 3D spatial node distributions as well as pairing rules to evaluate the nodes connectivity.","PeriodicalId":227927,"journal":{"name":"2013 International Symposium on Rapid System Prototyping (RSP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130742817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2013 International Symposium on Rapid System Prototyping (RSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1