首页 > 最新文献

2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation最新文献

英文 中文
A Side-channel Attack Resistant Programmable PKC Coprocessor for Embedded Applications 用于嵌入式应用的抗侧信道攻击可编程PKC协处理器
N. Mentens, K. Sakiyama, L. Batina, B. Preneel, I. Verbauwhede
This paper describes the design of a programmable coprocessor for public key cryptography (PKC) on an FPGA. The implementation provides a very broad range of functions together with countermeasures against side-channel analysis (SCA) attacks. The functions are implemented in a hierarchical manner, where all levels are accessible by the user. This makes the coprocessor very flexible and particularly suitable to be used in embedded environments where the border between hardware and software needs to be decided depending on the application. Especially for RSA, the resulting implementation on an XC3S5000 FPGA, from the low-cost Spartan series of Xilinx, shows comparable performance figures compared to the state-of- the-art in PKC coprocessors.
本文介绍了一种基于FPGA的可编程公钥加密协处理器的设计。该实现提供了非常广泛的功能以及针对侧信道分析(SCA)攻击的对策。这些功能以分层方式实现,用户可以访问所有级别。这使得协处理器非常灵活,特别适合在需要根据应用决定硬件和软件之间边界的嵌入式环境中使用。特别是对于RSA,在Xilinx低成本Spartan系列的XC3S5000 FPGA上实现的结果显示,与最先进的PKC协处理器相比,其性能数据相当。
{"title":"A Side-channel Attack Resistant Programmable PKC Coprocessor for Embedded Applications","authors":"N. Mentens, K. Sakiyama, L. Batina, B. Preneel, I. Verbauwhede","doi":"10.1109/ICSAMOS.2007.4285751","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285751","url":null,"abstract":"This paper describes the design of a programmable coprocessor for public key cryptography (PKC) on an FPGA. The implementation provides a very broad range of functions together with countermeasures against side-channel analysis (SCA) attacks. The functions are implemented in a hierarchical manner, where all levels are accessible by the user. This makes the coprocessor very flexible and particularly suitable to be used in embedded environments where the border between hardware and software needs to be decided depending on the application. Especially for RSA, the resulting implementation on an XC3S5000 FPGA, from the low-cost Spartan series of Xilinx, shows comparable performance figures compared to the state-of- the-art in PKC coprocessors.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134008994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Design Space Exploration of Media Processors: A Parameterized Scheduler 媒体处理器的设计空间探索:一个参数化调度程序
G. P. Vayá, J. Martín-Langerwerf, Piriya Taptimthong, P. Pirsch
This paper describes an enhanced list scheduling algorithm used on a parameterized assembler. The assembler, which is configurable in terms of architectural parameters, is used on a new environment system for exploring and optimizing VLIW architectures for multimedia applications. A generic VLIW architecture with a novel register file structure is used as a base architecture. The proposed scheduling algorithm includes sophisticated features. A backtracking technique allows to undo inappropriate scheduling decisions, while an advanced resource conflict function allows to work with different VLIW architecture configurations. Moreover, local register allocation in conjunction with the instruction scheduling process is also implemented for obtaining better code compaction. Two different multimedia tasks are implemented to check the correctness of the generated code for different architecture configurations. The code compaction efficiency, when scheduling these applications for different VLIW architecture configurations with a partitioned register file and limited number of functional units, reaches up to 94% of the compaction efficiency for the same configuration with an unconstrained register file and unlimited number of functional units.
本文介绍了一种用于参数化汇编器的增强型列表调度算法。该汇编器可根据体系结构参数进行配置,并用于一个新的环境系统,用于探索和优化多媒体应用程序的VLIW体系结构。使用具有新颖寄存器文件结构的通用VLIW体系结构作为基本体系结构。所提出的调度算法包含复杂的特征。回溯技术允许撤消不适当的调度决策,而高级资源冲突功能允许使用不同的VLIW体系结构配置。此外,为了获得更好的代码压缩,还实现了与指令调度过程相结合的本地寄存器分配。实现了两个不同的多媒体任务来检查针对不同架构配置生成的代码的正确性。当使用分区的寄存器文件和有限数量的功能单元为不同的VLIW体系结构配置调度这些应用程序时,代码压缩效率达到具有不受约束的寄存器文件和无限数量的功能单元的相同配置的压缩效率的94%。
{"title":"Design Space Exploration of Media Processors: A Parameterized Scheduler","authors":"G. P. Vayá, J. Martín-Langerwerf, Piriya Taptimthong, P. Pirsch","doi":"10.1109/ICSAMOS.2007.4285732","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285732","url":null,"abstract":"This paper describes an enhanced list scheduling algorithm used on a parameterized assembler. The assembler, which is configurable in terms of architectural parameters, is used on a new environment system for exploring and optimizing VLIW architectures for multimedia applications. A generic VLIW architecture with a novel register file structure is used as a base architecture. The proposed scheduling algorithm includes sophisticated features. A backtracking technique allows to undo inappropriate scheduling decisions, while an advanced resource conflict function allows to work with different VLIW architecture configurations. Moreover, local register allocation in conjunction with the instruction scheduling process is also implemented for obtaining better code compaction. Two different multimedia tasks are implemented to check the correctness of the generated code for different architecture configurations. The code compaction efficiency, when scheduling these applications for different VLIW architecture configurations with a partitioned register file and limited number of functional units, reaches up to 94% of the compaction efficiency for the same configuration with an unconstrained register file and unlimited number of functional units.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129405888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
COSMOS: A System-Level Modelling and Simulation Framework for Coprocessor-Coupled Reconfigurable Systems COSMOS:协处理器耦合可重构系统的系统级建模与仿真框架
Kehuai Wu, J. Madsen
Dynamically reconfigurable systems demand complicated run-time management. Due to resource constraints and reconfiguration latencies, efficient reconfiguration strategies that can reduce the overhead cost of dynamic reconfiguration need to be studied. In this paper, we i) propose a reconfigurable task model which extends the classical real-time task model to support the additional states and latencies needed to capture dynamically reconfigurable behavior, ii) propose a coprocessor- coupled reconfigurable architecture which has hardware runtime support for task execution, task reallocation and resource management, and iii) present a SystemC based framework to model and simulate coprocessor-coupled reconfigurable systems. We illustrate how COSMOS may be used to capture the dynamic behavior of such systems and emphasize the need for capturing the system aspects of such systems in order to deal with future design challenges of dynamically reconfigurable systems.
动态可重构系统需要复杂的运行时管理。由于资源约束和重构延迟,需要研究有效的重构策略,以降低动态重构的开销成本。在本文中,我们i)提出了一个可重构任务模型,它扩展了经典的实时任务模型,以支持捕获动态可重构行为所需的额外状态和延迟;ii)提出了一个协处理器耦合的可重构架构,该架构具有硬件运行时对任务执行、任务重新分配和资源管理的支持;iii)提出了一个基于SystemC的框架来建模和模拟协处理器耦合的可重构系统。我们说明了如何使用COSMOS来捕获此类系统的动态行为,并强调需要捕获此类系统的系统方面,以便处理动态可重构系统的未来设计挑战。
{"title":"COSMOS: A System-Level Modelling and Simulation Framework for Coprocessor-Coupled Reconfigurable Systems","authors":"Kehuai Wu, J. Madsen","doi":"10.1109/ICSAMOS.2007.4285743","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285743","url":null,"abstract":"Dynamically reconfigurable systems demand complicated run-time management. Due to resource constraints and reconfiguration latencies, efficient reconfiguration strategies that can reduce the overhead cost of dynamic reconfiguration need to be studied. In this paper, we i) propose a reconfigurable task model which extends the classical real-time task model to support the additional states and latencies needed to capture dynamically reconfigurable behavior, ii) propose a coprocessor- coupled reconfigurable architecture which has hardware runtime support for task execution, task reallocation and resource management, and iii) present a SystemC based framework to model and simulate coprocessor-coupled reconfigurable systems. We illustrate how COSMOS may be used to capture the dynamic behavior of such systems and emphasize the need for capturing the system aspects of such systems in order to deal with future design challenges of dynamically reconfigurable systems.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127287865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The ARISE Reconfigurable Instruction Set Extensions Framework ARISE可重构指令集扩展框架
N. Vassiliadis, G. Theodoridis, S. Nikolaidis
In this paper, we introduce the ARISE framework for the systematic extension of typical processors with the necessary infrastructure to support arbitrary number and type of reconfigurable hardware units. ARISE extends the micro-architecture of the processor with an interface to allow the coupling of the hardware units. Furthermore, the instruction set of the processor is extended with instructions which expose to the programmer/compiler the full control of the interface. This control includes the configuration of operations on the hardware units, execution of these operations, and communication of data between the processor and the units. The new instructions are incorporated without the need to redesign the processor instruction set architecture. To evaluate our proposal a model of an ARISE extended MIPS processor has been designed. Using a turbodecoder algorithm as benchmarking application a simulation of the ARISE model has been performed. Performance results show impressive application speedups up to times7.5.
在本文中,我们介绍了用于系统扩展典型处理器的ARISE框架,该框架具有必要的基础设施,以支持任意数量和类型的可重构硬件单元。ARISE扩展了处理器的微体系结构,提供了一个接口,允许硬件单元的耦合。此外,处理器的指令集扩展了一些指令,这些指令向程序员/编译器公开了对接口的完全控制。这种控制包括硬件单元上的操作配置、这些操作的执行以及处理器和单元之间的数据通信。合并新指令无需重新设计处理器指令集体系结构。为了评估我们的建议,我们设计了一个扩展MIPS处理器的模型。采用涡轮解码器算法作为基准应用,对ARISE模型进行了仿真。性能结果显示令人印象深刻的应用程序加速高达7.5倍。
{"title":"The ARISE Reconfigurable Instruction Set Extensions Framework","authors":"N. Vassiliadis, G. Theodoridis, S. Nikolaidis","doi":"10.1109/ICSAMOS.2007.4285746","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285746","url":null,"abstract":"In this paper, we introduce the ARISE framework for the systematic extension of typical processors with the necessary infrastructure to support arbitrary number and type of reconfigurable hardware units. ARISE extends the micro-architecture of the processor with an interface to allow the coupling of the hardware units. Furthermore, the instruction set of the processor is extended with instructions which expose to the programmer/compiler the full control of the interface. This control includes the configuration of operations on the hardware units, execution of these operations, and communication of data between the processor and the units. The new instructions are incorporated without the need to redesign the processor instruction set architecture. To evaluate our proposal a model of an ARISE extended MIPS processor has been designed. Using a turbodecoder algorithm as benchmarking application a simulation of the ARISE model has been performed. Performance results show impressive application speedups up to times7.5.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126730004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Automatic Bus Matrix Synthesis based on Hardware Interface Selection for Fast Communication Design Space Exploration 基于硬件接口选择的快速通信设计空间探索中的总线矩阵自动合成
Ganghee Lee, Seokhyun Lee, Yongjin Ahn, Kiyoung Choi
In this paper, we present an automated bus matrix synthesis flow for efficient system-on-chip communication design space exploration at the transaction level. Especially, we consider hardware interface design, since it affects overall system cost and performance. Depending on the bus interface, a hardware block can be a master or a slave. We propose a method to solve such hardware interface selection problem by analyzing communication behavior statically. In addition, in order to explore communication design space fast, we automatically generate transaction level models for the hardware blocks according to the hardware interface selection. The synthesis result is verified by transaction level simulation with a commercial tool. We give experimental results with JPEG encoder and H.264 encoder to demonstrate the efficiency of the proposed method. The results show that with our automated synthesis flow, the designer can easily and quickly obtain better communication designs through fast design space exploration. More specifically, our hardware interface selection technique is successful in achieving reduction of area of bus matrix by 41.43% with 0.58% performance overhead on average compared to the case of maximum performance.
在本文中,我们提出了一个自动总线矩阵合成流程,用于在事务级进行有效的片上系统通信设计空间探索。特别是,我们考虑硬件接口设计,因为它影响整个系统的成本和性能。根据总线接口的不同,硬件块可以是主设备或从设备。本文提出了一种通过静态分析通信行为来解决硬件接口选择问题的方法。此外,为了快速探索通信设计空间,我们根据硬件接口的选择,自动生成硬件模块的事务级模型。利用商业工具对合成结果进行了事务级仿真验证。以JPEG编码器和H.264编码器为实验对象,验证了该方法的有效性。结果表明,利用我们的自动化合成流程,设计人员可以通过快速的设计空间探索,轻松快速地获得更好的通信设计。更具体地说,与性能最大化的情况相比,我们的硬件接口选择技术成功地将总线矩阵的面积减少了41.43%,平均性能开销为0.58%。
{"title":"Automatic Bus Matrix Synthesis based on Hardware Interface Selection for Fast Communication Design Space Exploration","authors":"Ganghee Lee, Seokhyun Lee, Yongjin Ahn, Kiyoung Choi","doi":"10.1109/ICSAMOS.2007.4285733","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285733","url":null,"abstract":"In this paper, we present an automated bus matrix synthesis flow for efficient system-on-chip communication design space exploration at the transaction level. Especially, we consider hardware interface design, since it affects overall system cost and performance. Depending on the bus interface, a hardware block can be a master or a slave. We propose a method to solve such hardware interface selection problem by analyzing communication behavior statically. In addition, in order to explore communication design space fast, we automatically generate transaction level models for the hardware blocks according to the hardware interface selection. The synthesis result is verified by transaction level simulation with a commercial tool. We give experimental results with JPEG encoder and H.264 encoder to demonstrate the efficiency of the proposed method. The results show that with our automated synthesis flow, the designer can easily and quickly obtain better communication designs through fast design space exploration. More specifically, our hardware interface selection technique is successful in achieving reduction of area of bus matrix by 41.43% with 0.58% performance overhead on average compared to the case of maximum performance.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125276514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
On the Problem of Minimizing Workload Execution Time in SMT Processors SMT处理器中工作负载执行时间最小化问题研究
F. Cazorla, E. Fernández, P. Knijnenburg, Alex Ramírez, R. Sakellariou, M. Valero
Most research work on (simultaneous multithreading processors) SMTs focuses on improving throughput and/or fairness, or on prioritizing some threads over others in a workload. In this paper, we discuss a new problem not previously addressed in the SMT literature. We call this problem workload execution time (WET) minimization. It consists of reducing the total execution time of all threads in a workload. This problem arises in parallel applications, where it is common for a single master thread to spawn several child jobs. The master job cannot continue until all child jobs have finished. Reducing the overall execution time is important to speedup the application. This paper is a first step in analyzing this problem. First, we analyze the WET provided by the best fetch policies turned at improving throughput/fairness. We demonstrate that these policies achieve less than optimum performance. We show that, on average, for the workloads evaluated in this paper, there is space for improvement of up to 18 percentage points. It follows that novel mechanisms trying to reduce WET are required to speedup parallel applications.
大多数关于(同步多线程处理器)smt的研究工作集中在提高吞吐量和/或公平性,或者在工作负载中优先考虑某些线程。在本文中,我们讨论了一个以前在SMT文献中没有提到的新问题。我们将此问题称为工作负载执行时间最小化。它包括减少工作负载中所有线程的总执行时间。这个问题出现在并行应用程序中,其中一个主线程通常会生成几个子作业。在所有子作业完成之前,主作业无法继续。减少总体执行时间对于加速应用程序非常重要。本文是分析这一问题的第一步。首先,我们分析了用于提高吞吐量/公平性的最佳获取策略所提供的WET。我们证明了这些策略的性能不是最优的。我们表明,平均而言,对于本文中评估的工作负载,有高达18个百分点的改进空间。因此,为了加速并行应用程序,需要尝试减少WET的新机制。
{"title":"On the Problem of Minimizing Workload Execution Time in SMT Processors","authors":"F. Cazorla, E. Fernández, P. Knijnenburg, Alex Ramírez, R. Sakellariou, M. Valero","doi":"10.1109/ICSAMOS.2007.4285735","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285735","url":null,"abstract":"Most research work on (simultaneous multithreading processors) SMTs focuses on improving throughput and/or fairness, or on prioritizing some threads over others in a workload. In this paper, we discuss a new problem not previously addressed in the SMT literature. We call this problem workload execution time (WET) minimization. It consists of reducing the total execution time of all threads in a workload. This problem arises in parallel applications, where it is common for a single master thread to spawn several child jobs. The master job cannot continue until all child jobs have finished. Reducing the overall execution time is important to speedup the application. This paper is a first step in analyzing this problem. First, we analyze the WET provided by the best fetch policies turned at improving throughput/fairness. We demonstrate that these policies achieve less than optimum performance. We show that, on average, for the workloads evaluated in this paper, there is space for improvement of up to 18 percentage points. It follows that novel mechanisms trying to reduce WET are required to speedup parallel applications.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122424586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An Interrupt Controller for FPGA-based Multiprocessors 基于fpga的多处理器中断控制器
Antonino Tumeo, Marco Branca, L. Camerini, M. Monchiero, G. Palermo, Fabrizio Ferrandi, D. Sciuto
Interrupt-based programming is widely used for interfacing a processor with peripherals and allowing software threads to interact. Many hardware/software architectures have been proposed in the past to support this kind of programming practice. In the context of FPGA-based multiprocessors this topic has not been thoroughly faced yet. This paper presents the architecture of an interrupt controller for a FPGA-based multiprocessor composed of standard off-of-the-shelf softcores. The main feature of this device is to distribute multiple interrupts across the cores of a multiprocessor. In addition, our architecture supports several advanced features like booking, broadcasting and inter-processor interrupt. On the top of this hardware layer, we provide a software library to effectively exploit this mechanism. We realized a prototype of this system. Our experiments show that our interrupt controller efficiently distributes multiple interrupts on the system.
基于中断的编程广泛用于处理器与外设之间的接口,并允许软件线程进行交互。过去已经提出了许多硬件/软件架构来支持这种编程实践。在基于fpga的多处理器的背景下,这个话题还没有被彻底地面对。本文介绍了一种基于fpga的多处理器中断控制器的体系结构,该多处理器由标准的现成软核组成。该设备的主要特点是在多处理器的核心上分布多个中断。此外,我们的架构支持几个高级功能,如预订、广播和处理器间中断。在这个硬件层的顶层,我们提供了一个软件库来有效地利用这种机制。我们实现了这个系统的原型。实验表明,该中断控制器能有效地将多个中断分配到系统中。
{"title":"An Interrupt Controller for FPGA-based Multiprocessors","authors":"Antonino Tumeo, Marco Branca, L. Camerini, M. Monchiero, G. Palermo, Fabrizio Ferrandi, D. Sciuto","doi":"10.1109/ICSAMOS.2007.4285737","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285737","url":null,"abstract":"Interrupt-based programming is widely used for interfacing a processor with peripherals and allowing software threads to interact. Many hardware/software architectures have been proposed in the past to support this kind of programming practice. In the context of FPGA-based multiprocessors this topic has not been thoroughly faced yet. This paper presents the architecture of an interrupt controller for a FPGA-based multiprocessor composed of standard off-of-the-shelf softcores. The main feature of this device is to distribute multiple interrupts across the cores of a multiprocessor. In addition, our architecture supports several advanced features like booking, broadcasting and inter-processor interrupt. On the top of this hardware layer, we provide a software library to effectively exploit this mechanism. We realized a prototype of this system. Our experiments show that our interrupt controller efficiently distributes multiple interrupts on the system.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133445421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
A Hardware/Software Architecture for Tool Path Computation. An Application to Turning Lathe Machining 刀具轨迹计算的硬件/软件体系结构。在车床加工中的应用
S. Cuenca, A. Martínez-Álvarez, A. Jimeno-Morenilla, J. L. Sánchez
Tool path generation is one of the most complex problems in computer aided manufacturing. Although some efficient strategies have been developed, most of them are only useful for standard machining. The algorithm called virtual digitizing avoids this problem by its own definition but its computing cost is high and makes it difficult for being integrated in standard machining in order to adopt the new ISO standard 14649. Presented in the paper there is a virtual digitizing hardware/software architecture that takes advantage of field programmable gate arrays (FPGAs) to improve the algorithm efficiency and to meet the actual restrictions of the traditional computer numeric control systems at the same time. In order to evaluate the architecture, a prototype was implemented using a commercial reconfigurable platform integrated within a CNC lathe for shoe last machining. The performance of the system for tool path generation was measured for different trajectory and surface precisions using a database of real shoe models. The experiments show a significant speedup for all the cases and maintaining the error of the results below the maximum allowed.
刀具轨迹生成是计算机辅助制造中最复杂的问题之一。虽然已经开发出一些有效的加工策略,但大多数策略仅适用于标准加工。虚拟数字化算法以其自身的定义避免了这一问题,但其计算成本高,难以集成到标准加工中以采用新的ISO 14649标准。本文提出了一种利用现场可编程门阵列(fpga)提高算法效率的虚拟数字化软硬件体系结构,同时满足了传统计算机数控系统的实际限制。为了评估该体系结构,使用集成在CNC车床中的商业可重构平台实现了原型,用于鞋楦加工。利用真实鞋型数据库,对不同轨迹和表面精度下的刀具轨迹生成系统的性能进行了测试。实验表明,在所有情况下都有显著的加速,并且结果的误差保持在允许的最大误差以下。
{"title":"A Hardware/Software Architecture for Tool Path Computation. An Application to Turning Lathe Machining","authors":"S. Cuenca, A. Martínez-Álvarez, A. Jimeno-Morenilla, J. L. Sánchez","doi":"10.1109/ICSAMOS.2007.4285739","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285739","url":null,"abstract":"Tool path generation is one of the most complex problems in computer aided manufacturing. Although some efficient strategies have been developed, most of them are only useful for standard machining. The algorithm called virtual digitizing avoids this problem by its own definition but its computing cost is high and makes it difficult for being integrated in standard machining in order to adopt the new ISO standard 14649. Presented in the paper there is a virtual digitizing hardware/software architecture that takes advantage of field programmable gate arrays (FPGAs) to improve the algorithm efficiency and to meet the actual restrictions of the traditional computer numeric control systems at the same time. In order to evaluate the architecture, a prototype was implemented using a commercial reconfigurable platform integrated within a CNC lathe for shoe last machining. The performance of the system for tool path generation was measured for different trajectory and surface precisions using a database of real shoe models. The experiments show a significant speedup for all the cases and maintaining the error of the results below the maximum allowed.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115806635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Systematic Data Structure Exploration of Multimedia and Network Applications realized Embedded Systems 嵌入式系统中多媒体和网络应用的系统数据结构探索
Lazaros Papadopoulos, Christos Baloukas, N. Zompakis, D. Soudris
In the last years, there is a trend towards network and multimedia applications to be implemented in portable devices. These applications usually contain complex dynamic data structures. The appropriate selection of the dynamic data type (DDT) combination of an application affects the performance and the energy consumption of the whole system. Thus, DDT exploration methodology is used to perform tradeoffs between design factors, such as performance and energy consumption. In this paper we provide a new approach to the DDT exploration procedure, based on a new library of DDTs which remedies the limitations of an existing and allows the DDT optimization of a wide range of application domains. Using the new library, we performed DDT exploration in network and multimedia benchmarks and achieved performance and energy consumption improvements up to 85% and 43% respectively.
在过去的几年里,有一种趋势是在便携式设备上实现网络和多媒体应用程序。这些应用程序通常包含复杂的动态数据结构。应用程序的动态数据类型(DDT)组合的适当选择会影响整个系统的性能和能耗。因此,DDT勘探方法用于在设计因素(如性能和能耗)之间进行权衡。在本文中,我们提供了一种基于新的DDT库的DDT探索程序的新方法,该库弥补了现有DDT库的局限性,并允许对广泛的应用领域进行DDT优化。使用新的库,我们在网络和多媒体基准测试中进行了DDT探索,分别实现了85%和43%的性能和能耗改进。
{"title":"Systematic Data Structure Exploration of Multimedia and Network Applications realized Embedded Systems","authors":"Lazaros Papadopoulos, Christos Baloukas, N. Zompakis, D. Soudris","doi":"10.1109/ICSAMOS.2007.4285734","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285734","url":null,"abstract":"In the last years, there is a trend towards network and multimedia applications to be implemented in portable devices. These applications usually contain complex dynamic data structures. The appropriate selection of the dynamic data type (DDT) combination of an application affects the performance and the energy consumption of the whole system. Thus, DDT exploration methodology is used to perform tradeoffs between design factors, such as performance and energy consumption. In this paper we provide a new approach to the DDT exploration procedure, based on a new library of DDTs which remedies the limitations of an existing and allows the DDT optimization of a wide range of application domains. Using the new library, we performed DDT exploration in network and multimedia benchmarks and achieved performance and energy consumption improvements up to 85% and 43% respectively.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127272924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Design Space Exploration of Configuration Manager for Network Processing Applications 网络处理应用配置管理器的设计空间探索
C. Kachris, S. Vassiliadis
Current FPGAs provide a powerful platform for network processing applications. The main challenge is the exploitation of the reconfiguration to increase the performance of the system. In this paper, a design space exploration framework is presented to design a reconfigurable platform for multi-service network processing applications. An integrated design flow is presented from the system level analytical design to the implementation level. Furthermore, the design of an efficient configuration manager is presented in which the platform adaptation is performed for optimum speedup with minimum overhead taking into account the reconfiguration overhead and the network characteristics (packet type distribution, network stability). Finally, a case study is presented in which the platform is used to process three network flows with different processing requirements.
目前的fpga为网络处理应用提供了一个强大的平台。主要的挑战是利用重新配置来提高系统的性能。本文提出了一个设计空间探索框架,为多业务网络处理应用设计一个可重构平台。提出了从系统级分析设计到实现级的集成设计流程。此外,提出了一种高效配置管理器的设计,其中考虑到重新配置开销和网络特性(数据包类型分布,网络稳定性),以最小的开销执行平台适应以获得最佳加速。最后,给出了一个案例研究,使用该平台对三个不同处理要求的网络流进行了处理。
{"title":"Design Space Exploration of Configuration Manager for Network Processing Applications","authors":"C. Kachris, S. Vassiliadis","doi":"10.1109/ICSAMOS.2007.4285731","DOIUrl":"https://doi.org/10.1109/ICSAMOS.2007.4285731","url":null,"abstract":"Current FPGAs provide a powerful platform for network processing applications. The main challenge is the exploitation of the reconfiguration to increase the performance of the system. In this paper, a design space exploration framework is presented to design a reconfigurable platform for multi-service network processing applications. An integrated design flow is presented from the system level analytical design to the implementation level. Furthermore, the design of an efficient configuration manager is presented in which the platform adaptation is performed for optimum speedup with minimum overhead taking into account the reconfiguration overhead and the network characteristics (packet type distribution, network stability). Finally, a case study is presented in which the platform is used to process three network flows with different processing requirements.","PeriodicalId":106933,"journal":{"name":"2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122258702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1