首页 > 最新文献

M-SCOPES最新文献

英文 中文
Designer-in-the-loop recoding of ESL models using static parallel access conflict analysis 基于静态并行访问冲突分析的ESL模型设计器在环编码
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2463599
Xu Han, Weiwei Chen, R. Dömer
At the Electronic System Level (ESL), a well-defined design model enables early design space exploration and automatic synthesis on custom multiprocessor platforms. However, the initial design model is usually manually recoded from unstructured and sequential source code. To efficiently create cleanly structured and parallel models, this paper proposes a designer-in-the-loop approach on Eclipse platform where the system model is analyzed and recoded using automated functions. Particularly, advanced static analysis at compile time can guarantee that the parallelism in the model is safe and free from race conditions. Experiments using the tool with a class of graduate students show significant productivity gains and error reduction in model creation.
在电子系统级(ESL),一个定义良好的设计模型可以在定制的多处理器平台上进行早期的设计空间探索和自动合成。然而,最初的设计模型通常是从非结构化和顺序的源代码手动重新编码的。为了有效地创建结构清晰的并行模型,本文在Eclipse平台上提出了一种设计者在环的方法,该方法使用自动化功能对系统模型进行分析和重新编码。特别是,编译时的高级静态分析可以保证模型中的并行性是安全的,并且没有竞争条件。在一个研究生班级中使用该工具的实验表明,在模型创建中显著提高了生产率并减少了错误。
{"title":"Designer-in-the-loop recoding of ESL models using static parallel access conflict analysis","authors":"Xu Han, Weiwei Chen, R. Dömer","doi":"10.1145/2463596.2463599","DOIUrl":"https://doi.org/10.1145/2463596.2463599","url":null,"abstract":"At the Electronic System Level (ESL), a well-defined design model enables early design space exploration and automatic synthesis on custom multiprocessor platforms. However, the initial design model is usually manually recoded from unstructured and sequential source code. To efficiently create cleanly structured and parallel models, this paper proposes a designer-in-the-loop approach on Eclipse platform where the system model is analyzed and recoded using automated functions. Particularly, advanced static analysis at compile time can guarantee that the parallelism in the model is safe and free from race conditions. Experiments using the tool with a class of graduate students show significant productivity gains and error reduction in model creation.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115230555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Runtime resource allocation for software pipelines 软件管道的运行时资源分配
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2486156
J. Jahn, S. Kobbe, Santiago Pagani, Jian-Jia Chen, J. Henkel
Systems continue to comprise a rapidly growing number of cores on a single chip to gain performance benefits from parallel processing. A key challenge is how their computational resources can be used efficiently, which depends to a large degree on how their resources are allocated to the applications. In this paper, we describe our current research for addressing this challenge and highlight current and upcoming hurdles that need to be addressed.
系统继续在单个芯片上包含快速增长的核心数量,以从并行处理中获得性能优势。一个关键的挑战是如何有效地使用它们的计算资源,这在很大程度上取决于如何将它们的资源分配给应用程序。在本文中,我们描述了我们目前为应对这一挑战而进行的研究,并强调了当前和即将到来的需要解决的障碍。
{"title":"Runtime resource allocation for software pipelines","authors":"J. Jahn, S. Kobbe, Santiago Pagani, Jian-Jia Chen, J. Henkel","doi":"10.1145/2463596.2486156","DOIUrl":"https://doi.org/10.1145/2463596.2486156","url":null,"abstract":"Systems continue to comprise a rapidly growing number of cores on a single chip to gain performance benefits from parallel processing. A key challenge is how their computational resources can be used efficiently, which depends to a large degree on how their resources are allocated to the applications. In this paper, we describe our current research for addressing this challenge and highlight current and upcoming hurdles that need to be addressed.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114225298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reducing startup time of a deterministic virtualizing runtime environment 减少确定性虚拟化运行时环境的启动时间
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2463604
Martin Däumler, Matthias Werner
Virtualized runtime environments like Java Virtual Machine (JVM) or Microsoft .NET's Common Language Runtime (CLR) introduce additional challenges to real-time software development. Since applications for such environments are usually deployed in platform independent intermediate code, one issue is the timing of code transformation from intermediate code into native code. We have developed a solution for this problem, so that code transformation is suitable for real-time systems. It combines pre-compilation of intermediate code with the elimination of indirect references in native code. The gain of determinism comes with an increased application startup time. In this paper we present an optimization that utilizes an Ahead-of-Time compiler to reduce the startup time while keeping the real-time suitable timing behaviour. In an experiment we compare our approach with existing ones and demonstrate its benefits for certain application cases.
虚拟运行时环境,如Java虚拟机(JVM)或Microsoft . net的公共语言运行时(CLR),给实时软件开发带来了额外的挑战。由于用于此类环境的应用程序通常部署在与平台无关的中间代码中,因此一个问题是代码从中间代码转换为本机代码的时间。我们已经为这个问题开发了一个解决方案,使代码转换适用于实时系统。它结合了中间代码的预编译和消除本机代码中的间接引用。确定性的增加伴随着应用程序启动时间的增加。在本文中,我们提出了一种优化方法,利用提前编译器来减少启动时间,同时保持实时合适的定时行为。在一个实验中,我们将我们的方法与现有的方法进行了比较,并在某些应用案例中证明了它的优点。
{"title":"Reducing startup time of a deterministic virtualizing runtime environment","authors":"Martin Däumler, Matthias Werner","doi":"10.1145/2463596.2463604","DOIUrl":"https://doi.org/10.1145/2463596.2463604","url":null,"abstract":"Virtualized runtime environments like Java Virtual Machine (JVM) or Microsoft .NET's Common Language Runtime (CLR) introduce additional challenges to real-time software development. Since applications for such environments are usually deployed in platform independent intermediate code, one issue is the timing of code transformation from intermediate code into native code. We have developed a solution for this problem, so that code transformation is suitable for real-time systems. It combines pre-compilation of intermediate code with the elimination of indirect references in native code. The gain of determinism comes with an increased application startup time. In this paper we present an optimization that utilizes an Ahead-of-Time compiler to reduce the startup time while keeping the real-time suitable timing behaviour. In an experiment we compare our approach with existing ones and demonstrate its benefits for certain application cases.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"39 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125736998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimal placement of bank selection instructions in polynomial time 在多项式时间内银行选择指令的最优放置
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2463598
P. K. Krause
We present the first approach to Optimal Placement of Bank Selection Instructions in Polynomial Time; previous approaches were not optimal or did not provably run in polynomial time. Our approach requires the input program to be structured, which is automatically true for many programming languages and for others, such as C, is equivalent to a bound on the number of goto labels per function. When not restricted to structured programs, the problem is NP-hard. A prototype implementation in a mainstream compiler for embedded systems shows the practical feasibility of our approach. Our approach and implementation are easy to retarget for different optimization goals and architectures.
我们提出了在多项式时间内银行选择指令最优放置的第一种方法;以前的方法不是最优的,或者不能证明在多项式时间内运行。我们的方法要求输入程序是结构化的,这对于许多编程语言和其他语言(如C语言)来说是自动正确的,相当于每个函数的goto标签数量的限制。当不局限于结构化程序时,这个问题是np困难的。在嵌入式系统的主流编译器中的原型实现表明了我们的方法的实际可行性。我们的方法和实现很容易针对不同的优化目标和架构进行重新定位。
{"title":"Optimal placement of bank selection instructions in polynomial time","authors":"P. K. Krause","doi":"10.1145/2463596.2463598","DOIUrl":"https://doi.org/10.1145/2463596.2463598","url":null,"abstract":"We present the first approach to Optimal Placement of Bank Selection Instructions in Polynomial Time; previous approaches were not optimal or did not provably run in polynomial time. Our approach requires the input program to be structured, which is automatically true for many programming languages and for others, such as C, is equivalent to a bound on the number of goto labels per function. When not restricted to structured programs, the problem is NP-hard. A prototype implementation in a mainstream compiler for embedded systems shows the practical feasibility of our approach. Our approach and implementation are easy to retarget for different optimization goals and architectures.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127055349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Constraint-based code generation 基于约束的代码生成
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2486155
Roberto Castañeda Lozano, Gabriel Hjort Blindell, M. Carlsson, Frej Drejhammar, Christian Schulte
Compiler back-ends generate assembly code by solving three main tasks: instruction selection, register allocation and instruction scheduling. We introduce constraint models and solving techniques for these code generation tasks and describe how the models can be composed to generate code in unison. The use of constraint programming, a technique to model and solve combinatorial problems, makes code generation simple, flexible, robust and potentially optimal.
编译器后端通过解决三个主要任务来生成汇编代码:指令选择、寄存器分配和指令调度。我们为这些代码生成任务引入了约束模型和求解技术,并描述了如何将这些模型组合起来以一致地生成代码。约束编程是一种建模和解决组合问题的技术,它的使用使代码生成变得简单、灵活、健壮并且可能是最优的。
{"title":"Constraint-based code generation","authors":"Roberto Castañeda Lozano, Gabriel Hjort Blindell, M. Carlsson, Frej Drejhammar, Christian Schulte","doi":"10.1145/2463596.2486155","DOIUrl":"https://doi.org/10.1145/2463596.2486155","url":null,"abstract":"Compiler back-ends generate assembly code by solving three main tasks: instruction selection, register allocation and instruction scheduling. We introduce constraint models and solving techniques for these code generation tasks and describe how the models can be composed to generate code in unison. The use of constraint programming, a technique to model and solve combinatorial problems, makes code generation simple, flexible, robust and potentially optimal.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128440957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On the dictionary compression for Java card environment 关于Java卡环境下的字典压缩
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2463605
Massimiliano Zilli, Wolfgang Raschke, Johannes Loinig, R. Weiss, C. Steger
Java Card is a Java running environment developed for low-end embedded systems such as smart cards. In this context of scarce resources, ROM size plays a very important role and compression techniques help reducing program sizes as much as possible. Dictionary compression is the most promising technique and has been taken in consideration in this field by several authors. Java Card can adopt a dictionary compression scheme, substituting repeated sequences of bytecodes with new macros stored into a dictionary. This approach does not break the Java Card standard, but requires the use of an ad hoc Java virtual machine and an additional custom component in the converted applet (CAP) file. This paper presents two derived compaction techniques and discusses two scenarios: the first adopts an adaptive (dynamic) dictionary, while the second uses a static one. Although the base dictionary compression technique performs better with an adaptive dictionary, the two proposed techniques perform very close to the base one with a static dictionary. Moreover, we present a different compression mechanism based on re-engineering the CAP file through subroutines. This last technique achieves a higher compression rate, but it is fully compliant with the existing Java Card environments.
Java Card是为智能卡等低端嵌入式系统开发的Java运行环境。在这种资源稀缺的情况下,ROM大小起着非常重要的作用,压缩技术有助于尽可能地减小程序大小。字典压缩是最有前途的技术,已经被许多作者考虑在这个领域。Java Card可以采用字典压缩方案,用存储在字典中的新宏替换字节码的重复序列。这种方法不会破坏Java Card标准,但需要在转换的applet (CAP)文件中使用一个特别的Java虚拟机和一个额外的自定义组件。本文提出了两种派生的压缩技术,并讨论了两种场景:第一种采用自适应(动态)字典,而第二种使用静态字典。尽管基本字典压缩技术在使用自适应字典时性能更好,但所提出的两种技术的性能非常接近使用静态字典的基本字典压缩技术。此外,我们还提出了一种基于子程序重构CAP文件的不同压缩机制。最后一种技术实现了更高的压缩率,但它完全符合现有的Java Card环境。
{"title":"On the dictionary compression for Java card environment","authors":"Massimiliano Zilli, Wolfgang Raschke, Johannes Loinig, R. Weiss, C. Steger","doi":"10.1145/2463596.2463605","DOIUrl":"https://doi.org/10.1145/2463596.2463605","url":null,"abstract":"Java Card is a Java running environment developed for low-end embedded systems such as smart cards. In this context of scarce resources, ROM size plays a very important role and compression techniques help reducing program sizes as much as possible. Dictionary compression is the most promising technique and has been taken in consideration in this field by several authors.\u0000 Java Card can adopt a dictionary compression scheme, substituting repeated sequences of bytecodes with new macros stored into a dictionary. This approach does not break the Java Card standard, but requires the use of an ad hoc Java virtual machine and an additional custom component in the converted applet (CAP) file. This paper presents two derived compaction techniques and discusses two scenarios: the first adopts an adaptive (dynamic) dictionary, while the second uses a static one. Although the base dictionary compression technique performs better with an adaptive dictionary, the two proposed techniques perform very close to the base one with a static dictionary. Moreover, we present a different compression mechanism based on re-engineering the CAP file through subroutines. This last technique achieves a higher compression rate, but it is fully compliant with the existing Java Card environments.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128944672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Dataflow analysis for multiprocessor systems with non-starvation-free schedulers 具有非无饥饿调度程序的多处理器系统的数据流分析
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2463603
J. Hausmans, M. Wiggers, Stefan J. Geuns, M. Bekooij
Dataflow analysis techniques are suitable for the temporal analysis of real-time stream processing applications. However, the applicability of these models is currently limited to systems with starvation-free schedulers, such as Time-Division Multiplexing (TDM) schedulers. Removal of this limitation would broaden the application domain of dataflow analysis techniques significantly. In this paper we present a temporal analysis technique for Homogeneous Synchronous Dataflow (HSDF) graphs, that is also applicable for systems with non-starvation-free schedulers. Unlike existing dataflow analysis techniques, the proposed analysis technique makes use of an enabling-jitter characterization and iterative fixed-point computation. The presented approach is applicable for arbitrary (cyclic) graph topologies. Buffer capacity constraints are taken into account during the analysis and sufficient buffer capacities can be determined afterwards. The approach presented in this paper is the first approach that considers non-starvation-free schedulers in combination with arbitrary HSDF graphs The proposed dataflow analysis technique is implemented in a tool. This tool is used to evaluate the analysis technique using examples that illustrate some important differences with other temporal analysis methods. The case-study discusses how the method presented in this paper can be used to solve a problem with the inaccuracy of the temporal analysis results of a real-time stream processing system. This stream processing system consists of an FM receiver together with a DAB receiver application which both share a Digital Signal Processor (DSP).
数据流分析技术适用于实时流处理应用的时间分析。然而,这些模型的适用性目前仅限于具有无饥饿调度程序的系统,例如时分多路复用(TDM)调度程序。消除这一限制将大大拓宽数据流分析技术的应用领域。在本文中,我们提出了一种同构同步数据流(HSDF)图的时间分析技术,该技术也适用于具有非无饥饿调度程序的系统。与现有的数据流分析技术不同,所提出的分析技术利用了使能抖动表征和迭代不动点计算。该方法适用于任意(循环)图拓扑。在分析过程中考虑了缓冲容量的限制,之后可以确定足够的缓冲容量。本文提出的方法是第一个考虑非无饥饿调度器与任意HSDF图相结合的方法,所提出的数据流分析技术在工具中实现。该工具用于通过示例来评估分析技术,这些示例说明了与其他时间分析方法的一些重要差异。案例研究讨论了本文提出的方法如何用于解决实时流处理系统的时间分析结果不准确的问题。该流处理系统由FM接收器和DAB接收器应用程序组成,两者共享一个数字信号处理器(DSP)。
{"title":"Dataflow analysis for multiprocessor systems with non-starvation-free schedulers","authors":"J. Hausmans, M. Wiggers, Stefan J. Geuns, M. Bekooij","doi":"10.1145/2463596.2463603","DOIUrl":"https://doi.org/10.1145/2463596.2463603","url":null,"abstract":"Dataflow analysis techniques are suitable for the temporal analysis of real-time stream processing applications. However, the applicability of these models is currently limited to systems with starvation-free schedulers, such as Time-Division Multiplexing (TDM) schedulers. Removal of this limitation would broaden the application domain of dataflow analysis techniques significantly.\u0000 In this paper we present a temporal analysis technique for Homogeneous Synchronous Dataflow (HSDF) graphs, that is also applicable for systems with non-starvation-free schedulers. Unlike existing dataflow analysis techniques, the proposed analysis technique makes use of an enabling-jitter characterization and iterative fixed-point computation.\u0000 The presented approach is applicable for arbitrary (cyclic) graph topologies. Buffer capacity constraints are taken into account during the analysis and sufficient buffer capacities can be determined afterwards. The approach presented in this paper is the first approach that considers non-starvation-free schedulers in combination with arbitrary HSDF graphs\u0000 The proposed dataflow analysis technique is implemented in a tool. This tool is used to evaluate the analysis technique using examples that illustrate some important differences with other temporal analysis methods. The case-study discusses how the method presented in this paper can be used to solve a problem with the inaccuracy of the temporal analysis results of a real-time stream processing system. This stream processing system consists of an FM receiver together with a DAB receiver application which both share a Digital Signal Processor (DSP).","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114645939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Cyclo-static DataFlow phases scheduling optimization for buffer sizes minimization 循环静态数据流阶段调度优化缓冲区大小最小化
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2463602
M. Benazouz, Alix Munier Kordon
Cyclo-Static DataFlow (CSDF) is a powerful model for the specification of DSP applications. However, as in any asynchronous model, the synchronization of the different communicating tasks (processes) is made through buffers that have to be sized such that timing constraints are met. In this paper, we want to determine buffer sizes such that the throughput constraint is satisfied. This problem has been proved to be of exponential complexity. Exact techniques to solve this problem are too time and/or space consuming because of the self-timed schedule needed to evaluate the maximum throughput. Therefore, a periodic schedule is used. Each CSDF actor is associated with a period that satisfies the throughput constraint and sufficient buffer sizes are derived in polynomial time. However, within a period, an actor phases can be scheduled in different manners which impacts the evaluation of sufficient buffer sizes. This paper presents a Min-Max Linear Program that derives an optimized periodic phases scheduling per CSDF actor in order to minimize buffer sizes. It is shown through different applications that this Min-Max Linear Program allows to obtain close to optimal values while running in polynomial time.
循环静态数据流(CSDF)是一种功能强大的DSP应用规范模型。然而,与任何异步模型一样,不同通信任务(进程)的同步是通过缓冲区进行的,缓冲区必须调整大小,以满足时间约束。在本文中,我们想要确定缓冲区大小,以满足吞吐量约束。这个问题已被证明具有指数复杂度。解决此问题的精确技术过于耗费时间和/或空间,因为需要自定时计划来评估最大吞吐量。因此,使用周期调度。每个CSDF参与者都与满足吞吐量约束的周期相关联,并且在多项式时间内推导出足够的缓冲区大小。然而,在一个周期内,可以以不同的方式安排参与者阶段,这会影响对足够缓冲区大小的评估。本文提出了一个最小-最大线性规划,该规划导出了一个优化的周期阶段调度,每个CSDF参与者以最小化缓冲区大小。通过不同的应用表明,该最小-最大线性程序允许在多项式时间内运行时获得接近最优值。
{"title":"Cyclo-static DataFlow phases scheduling optimization for buffer sizes minimization","authors":"M. Benazouz, Alix Munier Kordon","doi":"10.1145/2463596.2463602","DOIUrl":"https://doi.org/10.1145/2463596.2463602","url":null,"abstract":"Cyclo-Static DataFlow (CSDF) is a powerful model for the specification of DSP applications. However, as in any asynchronous model, the synchronization of the different communicating tasks (processes) is made through buffers that have to be sized such that timing constraints are met. In this paper, we want to determine buffer sizes such that the throughput constraint is satisfied. This problem has been proved to be of exponential complexity. Exact techniques to solve this problem are too time and/or space consuming because of the self-timed schedule needed to evaluate the maximum throughput. Therefore, a periodic schedule is used. Each CSDF actor is associated with a period that satisfies the throughput constraint and sufficient buffer sizes are derived in polynomial time. However, within a period, an actor phases can be scheduled in different manners which impacts the evaluation of sufficient buffer sizes. This paper presents a Min-Max Linear Program that derives an optimized periodic phases scheduling per CSDF actor in order to minimize buffer sizes. It is shown through different applications that this Min-Max Linear Program allows to obtain close to optimal values while running in polynomial time.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132542521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
NoC simulation in heterogeneous architectures for PGAS programming model 面向PGAS编程模型的异构体系结构NoC仿真
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2463606
Sascha Roloff, A. Weichslgartner, Jan Heisswolf, Frank Hannig, J. Teich
Multi- and many-core systems become more and more mainstream and therefore new communication infrastructures like Networks-on-Chip (NoC) and new programming languages like IBM's X10 with its partitioned global address space (PGAS) are introduced. In this paper we present an X10-based simulator, which is capable to simulate the network traffic that occurs inside the X10 program. This holistic approach enables to simulate the functionality and the indicated traffic together, in contrast to pure network simulators where usually only synthetic traffic or traces are used. We explain how the communication overhead is extracted from the X10 run-time and how to simulate the NoC behavior. In experiments we show that the proposed simulator is up to 10 x faster than a comparable SystemC-based simulator and at the same time preserves high accuracy. Furthermore, we present a quality and simulation speed tradeoff by using different simulation modes for a set of real world parallel applications.
多核和多核系统变得越来越主流,因此引入了新的通信基础设施,如片上网络(NoC)和新的编程语言,如IBM的X10及其分区的全局地址空间(PGAS)。在本文中,我们提出了一个基于X10的模拟器,它能够模拟发生在X10程序内部的网络流量。这种整体方法可以同时模拟功能和指示的流量,而纯网络模拟器通常只使用合成流量或跟踪。我们将解释如何从X10运行时提取通信开销,以及如何模拟NoC行为。在实验中,我们表明所提出的模拟器比基于systemc的模拟器快10倍,同时保持了较高的精度。此外,我们提出了质量和仿真速度的权衡,通过使用不同的仿真模式的一组真实世界的并行应用。
{"title":"NoC simulation in heterogeneous architectures for PGAS programming model","authors":"Sascha Roloff, A. Weichslgartner, Jan Heisswolf, Frank Hannig, J. Teich","doi":"10.1145/2463596.2463606","DOIUrl":"https://doi.org/10.1145/2463596.2463606","url":null,"abstract":"Multi- and many-core systems become more and more mainstream and therefore new communication infrastructures like Networks-on-Chip (NoC) and new programming languages like IBM's X10 with its partitioned global address space (PGAS) are introduced. In this paper we present an X10-based simulator, which is capable to simulate the network traffic that occurs inside the X10 program. This holistic approach enables to simulate the functionality and the indicated traffic together, in contrast to pure network simulators where usually only synthetic traffic or traces are used. We explain how the communication overhead is extracted from the X10 run-time and how to simulate the NoC behavior. In experiments we show that the proposed simulator is up to 10 x faster than a comparable SystemC-based simulator and at the same time preserves high accuracy. Furthermore, we present a quality and simulation speed tradeoff by using different simulation modes for a set of real world parallel applications.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127504191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Embedded on-chip reliability: it's a thermal challenge 嵌入式芯片可靠性:这是一个热挑战
Pub Date : 2013-06-19 DOI: 10.1145/2463596.2488357
J. Henkel
For more than four decades Moore's Law has provided a steady exponential grow where each new technology node provided a win-win situation as shrinking features sizes not only led to more complex circuits but also led to faster and less expensive embedded on-chip systems. As Moore's Law approaches physical limits, though, reliability becomes a severe problem: aging effects like electro migration, NBTI, increased susceptibility against soft errors etc. increasingly jeopardize reliability. The talk starts with an overview of aging and soft error effects and deducts that many reliability-threatening effects are directly or indirectly related to thermal issues. The talk gives some background on thermal issues and also presents effective solutions that scale especially with respect to multi-core systems.
四十多年来,摩尔定律提供了一个稳定的指数增长,每个新技术节点都提供了一个双赢的局面,因为缩小的特征尺寸不仅导致了更复杂的电路,而且还导致了更快、更便宜的嵌入式片上系统。然而,随着摩尔定律接近物理极限,可靠性成为了一个严重的问题:老化效应,如电迁移、NBTI、对软错误的敏感性增加等,日益危及可靠性。该演讲首先概述了老化和软误差效应,并推断出许多威胁可靠性的效应与热问题直接或间接相关。该演讲给出了一些关于热问题的背景,并提出了有效的解决方案,特别是在多核系统方面。
{"title":"Embedded on-chip reliability: it's a thermal challenge","authors":"J. Henkel","doi":"10.1145/2463596.2488357","DOIUrl":"https://doi.org/10.1145/2463596.2488357","url":null,"abstract":"For more than four decades Moore's Law has provided a steady exponential grow where each new technology node provided a win-win situation as shrinking features sizes not only led to more complex circuits but also led to faster and less expensive embedded on-chip systems. As Moore's Law approaches physical limits, though, reliability becomes a severe problem: aging effects like electro migration, NBTI, increased susceptibility against soft errors etc. increasingly jeopardize reliability. The talk starts with an overview of aging and soft error effects and deducts that many reliability-threatening effects are directly or indirectly related to thermal issues. The talk gives some background on thermal issues and also presents effective solutions that scale especially with respect to multi-core systems.","PeriodicalId":344517,"journal":{"name":"M-SCOPES","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128201142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
M-SCOPES
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1