首页 > 最新文献

Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)最新文献

英文 中文
Pilot studies of wireless sensor networks: Practical experiences 无线传感器网络的初步研究:实践经验
Teemu Laukkarinen, J. Suhonen, T. Hämäläinen, Marko Hännikäinen
For enabling successful field pilots of Wireless Sensor Network (WSN) applications, the network reliability and prototype testing become limiting factors. Application pilot studies need to operate end-to-end, covering the physical durability of devices, embedded software, and infrastructure interfaces and data collection. This paper summarizes our pilot study experiences, and what tools and practices were required. Six lessons are proposed: a systematic pilot template results straightforward pilot completion; shared WSN infrastructure reduces labor; tailored embedded software testing tools are needed; the pilot must be prepared carefully; the WSN technology must be usable for research partners; and the pilot must be maintained and maintenance tools are required in large scale pilots. Our experiences base on over 20 pilot studies and over 1000 deployed devices. This paper describes 11 main pilots, which utilize from 10 to 377 devices per pilot.
为了实现无线传感器网络(WSN)应用的成功现场试验,网络可靠性和原型测试成为限制因素。应用程序试点研究需要端到端的操作,涵盖设备的物理耐久性、嵌入式软件、基础设施接口和数据收集。本文总结了我们的试点研究经验,以及需要哪些工具和实践。提出了六个经验教训:系统的试点模板可以直接完成试点;共享WSN基础设施减少劳动;需要定制的嵌入式软件测试工具;引航员必须认真准备;WSN技术必须可供研究伙伴使用;在大型飞行员中,必须对飞行员进行维护,并且需要维护工具。我们的经验基于20多个试点研究和1000多个已部署设备。本文描述了11个主要导频,每个导频使用10到377个器件。
{"title":"Pilot studies of wireless sensor networks: Practical experiences","authors":"Teemu Laukkarinen, J. Suhonen, T. Hämäläinen, Marko Hännikäinen","doi":"10.1109/DASIP.2011.6136867","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136867","url":null,"abstract":"For enabling successful field pilots of Wireless Sensor Network (WSN) applications, the network reliability and prototype testing become limiting factors. Application pilot studies need to operate end-to-end, covering the physical durability of devices, embedded software, and infrastructure interfaces and data collection. This paper summarizes our pilot study experiences, and what tools and practices were required. Six lessons are proposed: a systematic pilot template results straightforward pilot completion; shared WSN infrastructure reduces labor; tailored embedded software testing tools are needed; the pilot must be prepared carefully; the WSN technology must be usable for research partners; and the pilot must be maintained and maintenance tools are required in large scale pilots. Our experiences base on over 20 pilot studies and over 1000 deployed devices. This paper describes 11 main pilots, which utilize from 10 to 377 devices per pilot.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"97 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120884624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Range-free algorithm for energy-efficient indoor localization in Wireless Sensor Networks 无线传感器网络中节能室内定位的无距离算法
Ville Kaseva, T. Hämäläinen, Marko Hännikäinen
Wireless Sensor Networks (WSNs) form an attractive technology for ubiquitous indoor localization. The localized node lifetime is maximized by using energy-efficient radios and minimizing their active time. However, the most low-cost and low-power radios do not include Received Signal Strength Indicator (RSSI) functionality commonly used for RF-based localization. In this paper, we present a range-free localization algorithm for localized nodes with minimized radio communication and radios without RSSI. The low complexity of the algorithm enables implementation in resource-constrained hardware for in-network localization. We experimented the algorithm using a real WSN implementation. In room-level localization, the area was resolved correctly 96% of the time. The maximum point-based error was 8.70 m. The corresponding values for sub-room-level localization are 100% and 4.20 m. The prototype implementation consumed 1900 B of program memory. The data memory consumption varied from 18 B to 180 B, and the power consumption from 345 μW to 2.48 mW depending on the amount of localization data.
无线传感器网络(WSNs)形成了一种有吸引力的无处不在的室内定位技术。通过使用节能无线电并最小化其活动时间,使局部节点寿命最大化。然而,大多数低成本和低功耗无线电不包括通常用于基于射频定位的接收信号强度指示器(RSSI)功能。在本文中,我们提出了一种无距离定位算法,用于最小化无线电通信和无RSSI无线电的定位节点。该算法的低复杂度使其能够在资源受限的硬件上实现网络内定位。我们使用一个真实的WSN实现来实验该算法。在房间级别的定位中,96%的时间区域被正确解析。基于点的最大误差为8.70 m。子室级定位对应值为100%和4.20 m。原型实现消耗了1900b的程序内存。根据定位数据量的不同,数据存储消耗在18b ~ 180b之间,功耗在345 μW ~ 2.48 mW之间。
{"title":"Range-free algorithm for energy-efficient indoor localization in Wireless Sensor Networks","authors":"Ville Kaseva, T. Hämäläinen, Marko Hännikäinen","doi":"10.1109/DASIP.2011.6136887","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136887","url":null,"abstract":"Wireless Sensor Networks (WSNs) form an attractive technology for ubiquitous indoor localization. The localized node lifetime is maximized by using energy-efficient radios and minimizing their active time. However, the most low-cost and low-power radios do not include Received Signal Strength Indicator (RSSI) functionality commonly used for RF-based localization. In this paper, we present a range-free localization algorithm for localized nodes with minimized radio communication and radios without RSSI. The low complexity of the algorithm enables implementation in resource-constrained hardware for in-network localization. We experimented the algorithm using a real WSN implementation. In room-level localization, the area was resolved correctly 96% of the time. The maximum point-based error was 8.70 m. The corresponding values for sub-room-level localization are 100% and 4.20 m. The prototype implementation consumed 1900 B of program memory. The data memory consumption varied from 18 B to 180 B, and the power consumption from 345 μW to 2.48 mW depending on the amount of localization data.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116503240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Flexible VLIW processor based on FPGA for real-time image processing 基于FPGA的柔性VLIW处理器,用于实时图像处理
V. Brost, Charles Meunier, D. Saptono, Fan Yang
Modern FPGA chips, with their larger memory capacity and reconfigurability potential, are opening new frontiers in rapid prototyping of embedded systems. With the advent of high density FPGAs it is now possible to implement a high performance Very Long Instruction Word (VLIW) processor core in an FPGA. With VLIW architecture, the processor effectiveness depends on the ability of compilers to provide sufficient Instruction Level Parallelism (ILP) from program code. This paper describes research result about enabling the VLIW processor model for real-time processing applications by exploiting FPGA technology. Our goals are to keep the flexibility of processors in order to shorten the development cycle, and to use the powerful FPGA resources in order to increase real-time performance. We present a flexible VLIW VHDL processor model with a variable instruction set and a customizable architecture which allow exploiting intrinsic parallelism of a target application using advanced compiler technology and implementing it in an optimal manner on FPGA. Some common algorithms of image processing were tested and validated on an FPGA Virtex-6 based board using the proposed development cycle. Our approach applies some criteria for co-design tools: flexibility, modularity, performance, and reusability.
现代FPGA芯片具有更大的存储容量和可重构性潜力,正在为嵌入式系统的快速原型设计开辟新的领域。随着高密度FPGA的出现,现在可以在FPGA中实现高性能的甚长指令字(VLIW)处理器核心。对于VLIW体系结构,处理器的有效性取决于编译器从程序代码中提供足够的指令级并行性(ILP)的能力。本文介绍了利用FPGA技术实现VLIW处理器模型用于实时处理应用的研究成果。我们的目标是保持处理器的灵活性,以缩短开发周期,并使用强大的FPGA资源,以提高实时性能。我们提出了一种灵活的VLIW VHDL处理器模型,该模型具有可变指令集和可定制的架构,允许使用先进的编译器技术利用目标应用程序的内在并行性,并以最佳方式在FPGA上实现它。利用所提出的开发周期,在基于Virtex-6的FPGA板上对一些常见的图像处理算法进行了测试和验证。我们的方法应用了协同设计工具的一些标准:灵活性、模块化、性能和可重用性。
{"title":"Flexible VLIW processor based on FPGA for real-time image processing","authors":"V. Brost, Charles Meunier, D. Saptono, Fan Yang","doi":"10.1109/DASIP.2011.6136855","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136855","url":null,"abstract":"Modern FPGA chips, with their larger memory capacity and reconfigurability potential, are opening new frontiers in rapid prototyping of embedded systems. With the advent of high density FPGAs it is now possible to implement a high performance Very Long Instruction Word (VLIW) processor core in an FPGA. With VLIW architecture, the processor effectiveness depends on the ability of compilers to provide sufficient Instruction Level Parallelism (ILP) from program code. This paper describes research result about enabling the VLIW processor model for real-time processing applications by exploiting FPGA technology. Our goals are to keep the flexibility of processors in order to shorten the development cycle, and to use the powerful FPGA resources in order to increase real-time performance. We present a flexible VLIW VHDL processor model with a variable instruction set and a customizable architecture which allow exploiting intrinsic parallelism of a target application using advanced compiler technology and implementing it in an optimal manner on FPGA. Some common algorithms of image processing were tested and validated on an FPGA Virtex-6 based board using the proposed development cycle. Our approach applies some criteria for co-design tools: flexibility, modularity, performance, and reusability.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132613538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Efficient FFT pruning algorithm for non-contiguous OFDM systems 非连续OFDM系统的高效FFT剪枝算法
Roberto Airoldi, F. Garzia, J. Nurmi
This paper presents the study of an efficient trade-off between memory requirements and performance for the implementation of FFT pruning algorithm. FFT pruning algorithm is utilized in NC-OFDM systems to simplify the FFT algorithm complexity in presence of subcarrier sparseness. State-of-the-art implementations offer good performance with the drawback of high resources utilization, i.e. data memory for storage of the configuration matrix. In this work we introduce the partial pruning algorithm as an efficient way to implement FFT pruning, obtaining a balanced trade-off between performance and resources allocation. Cycle accurate simulation results showed that even in presence of low-medium input sparseness levels the proposed algorithm can reduce the computation time of at least a 20% factor, when compared to traditional FFT algorithms and, at the same time, decreases the memory utilization up to a 20% factor over state of the art pruning algorithms.
本文提出了在实现FFT剪枝算法的内存需求和性能之间进行有效权衡的研究。在NC-OFDM系统中采用FFT剪枝算法来简化存在子载波稀疏性的FFT算法复杂度。最先进的实现提供了良好的性能,但缺点是资源利用率高,即用于存储配置矩阵的数据内存。在这项工作中,我们介绍了部分剪枝算法作为实现FFT剪枝的有效方法,在性能和资源分配之间取得了平衡。周期精确的仿真结果表明,与传统的FFT算法相比,即使存在低-中等输入稀疏度水平,所提出的算法也可以将计算时间减少至少20%,同时,与最先进的修剪算法相比,将内存利用率降低高达20%。
{"title":"Efficient FFT pruning algorithm for non-contiguous OFDM systems","authors":"Roberto Airoldi, F. Garzia, J. Nurmi","doi":"10.1109/DASIP.2011.6136869","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136869","url":null,"abstract":"This paper presents the study of an efficient trade-off between memory requirements and performance for the implementation of FFT pruning algorithm. FFT pruning algorithm is utilized in NC-OFDM systems to simplify the FFT algorithm complexity in presence of subcarrier sparseness. State-of-the-art implementations offer good performance with the drawback of high resources utilization, i.e. data memory for storage of the configuration matrix. In this work we introduce the partial pruning algorithm as an efficient way to implement FFT pruning, obtaining a balanced trade-off between performance and resources allocation. Cycle accurate simulation results showed that even in presence of low-medium input sparseness levels the proposed algorithm can reduce the computation time of at least a 20% factor, when compared to traditional FFT algorithms and, at the same time, decreases the memory utilization up to a 20% factor over state of the art pruning algorithms.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131780501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
The Multi-Dataflow Composer tool: A runtime reconfigurable HDL platform composer 多数据流编写器工具:一个运行时可重构的HDL平台编写器
F. Palumbo, N. Carta, L. Raffo
Dataflow Model of Computation (D-MoC) is particularly suitable to close the gap between hardware architects and software developers. Leveraging on the combination of the D-MoC with a coarse-grained reconfigurable approach to hardware design, we propose a tool, the Multi-Dataflow Composer (MDC) tool, able to improve time-to-market of modern complex multi-purpose systems by allowing the derivation of HDL runtime reconfigurable platforms starting from the D-MoC models of the targeted set of applications. MDC tool has proven to provide a considerable on-chip area saving: the 82% of saving has been reached combining of different applications in the image processing domain, adopting a 90 nm CMOS technology. In future the MDC tool, with a very small integration effort, will also be extremely useful to create multi-standard codec platforms for MPEG RVC applications.
数据流计算模型(D-MoC)特别适合于缩小硬件架构师和软件开发人员之间的差距。利用D-MoC与粗粒度可重构硬件设计方法的结合,我们提出了一种工具,Multi-Dataflow Composer (MDC)工具,通过允许从目标应用程序集的D-MoC模型开始衍生HDL运行时可重构平台,可以缩短现代复杂多用途系统的上市时间。MDC工具已被证明可以节省相当大的片上面积:在图像处理领域,采用90纳米CMOS技术,结合不同的应用,节省了82%的面积。在未来,MDC工具,通过非常小的集成工作,也将非常有用,为MPEG RVC应用程序创建多标准编解码器平台。
{"title":"The Multi-Dataflow Composer tool: A runtime reconfigurable HDL platform composer","authors":"F. Palumbo, N. Carta, L. Raffo","doi":"10.1109/DASIP.2011.6136876","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136876","url":null,"abstract":"Dataflow Model of Computation (D-MoC) is particularly suitable to close the gap between hardware architects and software developers. Leveraging on the combination of the D-MoC with a coarse-grained reconfigurable approach to hardware design, we propose a tool, the Multi-Dataflow Composer (MDC) tool, able to improve time-to-market of modern complex multi-purpose systems by allowing the derivation of HDL runtime reconfigurable platforms starting from the D-MoC models of the targeted set of applications. MDC tool has proven to provide a considerable on-chip area saving: the 82% of saving has been reached combining of different applications in the image processing domain, adopting a 90 nm CMOS technology. In future the MDC tool, with a very small integration effort, will also be extremely useful to create multi-standard codec platforms for MPEG RVC applications.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"141 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132184834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Multiplier free filter bank based concept for blocker detection in LTE systems LTE系统中基于无乘法器滤波器组的阻塞检测概念
T. Schlechter
Power efficiency is an important issue in mobile communication systems. Especially for mobile user equipments, the energy budget, limited by a battery, has to be treated carefully. Despite this fact, quite an amount of energy is wasted in todays user equipments, as analog and digital frontend in communication systems are engineered for extracting the wanted signal from a spectral environment defined in the corresponding communication standards with their extremely tough requirements. In a real receiving process those requirements can typically be considered as dramatically less critical. Capturing the environmental transmission conditions and adapting the receiver architecture to the actual needs allows to save energy during the receiving process. An efficient architecture being able to fulfill this task for a typical Long Term Evolution scenario is desired and introduced in this paper. The development of a suitable filterchain is described and a complexity comparison to Fast Fourier Transformation based methods is given.
功率效率是移动通信系统中的一个重要问题。特别是对于移动用户设备,由于电池的限制,能量预算必须谨慎处理。尽管如此,在今天的用户设备中浪费了相当多的能量,因为通信系统中的模拟和数字前端设计用于从相应通信标准中定义的频谱环境中提取所需的信号,其要求极其苛刻。在实际的接收过程中,这些要求通常被认为不那么重要。捕获环境传输条件并使接收器架构适应实际需要,可以在接收过程中节省能源。我们需要一种高效的体系结构,能够在典型的长期演进场景中完成这项任务,本文对此进行了介绍。描述了一种合适的滤波链的发展,并与基于快速傅立叶变换的方法进行了复杂性比较。
{"title":"Multiplier free filter bank based concept for blocker detection in LTE systems","authors":"T. Schlechter","doi":"10.1109/DASIP.2011.6136893","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136893","url":null,"abstract":"Power efficiency is an important issue in mobile communication systems. Especially for mobile user equipments, the energy budget, limited by a battery, has to be treated carefully. Despite this fact, quite an amount of energy is wasted in todays user equipments, as analog and digital frontend in communication systems are engineered for extracting the wanted signal from a spectral environment defined in the corresponding communication standards with their extremely tough requirements. In a real receiving process those requirements can typically be considered as dramatically less critical. Capturing the environmental transmission conditions and adapting the receiver architecture to the actual needs allows to save energy during the receiving process. An efficient architecture being able to fulfill this task for a typical Long Term Evolution scenario is desired and introduced in this paper. The development of a suitable filterchain is described and a complexity comparison to Fast Fourier Transformation based methods is given.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124861944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An approach to self-learning multicore reconfiguration management applied on Robotic Vision 应用于机器人视觉的自学习多核重构管理方法
W. Stechele, J. Hartmann, E. Maehle
Robotic Vision combined with real-time control imposes challenging requirements on embedded computing nodes in robots, exhibiting strong variations in computational load due to dynamically changing activity profiles. Reconfigurable Multiprocessor System-on-Chip offers a solution by efficiently handling the robot's resources, but reconfiguration management seems challenging. The goal of this paper is to present first ideas on self-learning reconfiguration management for Reco nfigurable multicore computing nodes with dynamic reconfiguration of soft-core CPUs and HW accelerators, to support dynamically changing activity profiles in Robotic Vision scenarios.
机器人视觉与实时控制相结合,对机器人的嵌入式计算节点提出了具有挑战性的要求,由于动态变化的活动概况,计算负载表现出强烈的变化。可重构多处理器片上系统通过有效地处理机器人的资源提供了一种解决方案,但重新配置管理似乎具有挑战性。本文的目标是提出对具有软核cpu和硬件加速器动态重构的Reco可配置多核计算节点的自学习重构管理的初步想法,以支持机器人视觉场景中动态变化的活动概况。
{"title":"An approach to self-learning multicore reconfiguration management applied on Robotic Vision","authors":"W. Stechele, J. Hartmann, E. Maehle","doi":"10.1109/DASIP.2011.6136882","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136882","url":null,"abstract":"Robotic Vision combined with real-time control imposes challenging requirements on embedded computing nodes in robots, exhibiting strong variations in computational load due to dynamically changing activity profiles. Reconfigurable Multiprocessor System-on-Chip offers a solution by efficiently handling the robot's resources, but reconfiguration management seems challenging. The goal of this paper is to present first ideas on self-learning reconfiguration management for Reco nfigurable multicore computing nodes with dynamic reconfiguration of soft-core CPUs and HW accelerators, to support dynamically changing activity profiles in Robotic Vision scenarios.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130426670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application workload model generation methodologies for system-level design exploration 用于系统级设计探索的应用程序工作负载模型生成方法
Jukka Saastamoinen, J. Kreku
As most of the applications of embedded system products are realized in software, the performance estimation of software is crucial for successful system design. Significant part of the functionality of these applications is based on services provided by the underlying software libraries. Often used performance evaluation technique today is the system-level performance simulation of the applications and platforms using abstracted workload and execution platform models. The accuracy of the software performance results is dependent on how closely the application workload model reflects actual software as a whole. This paper presents a methodology which combines compiler based user code workload model generation with workload extraction of pre-compiled libraries, while exploiting an overall approach and execution platform model developed previously. Benefit of the proposed methodology compared to earlier solution is experimented using a set of benchmarks.
由于嵌入式系统产品的大部分应用都是在软件中实现的,因此软件的性能评估对系统的成功设计至关重要。这些应用程序的很大一部分功能是基于底层软件库提供的服务。目前常用的性能评估技术是使用抽象的工作负载和执行平台模型对应用程序和平台进行系统级性能模拟。软件性能结果的准确性依赖于应用程序工作负载模型作为一个整体反映实际软件的程度。本文提出了一种基于编译器的用户代码工作负载模型生成和预编译库工作负载提取相结合的方法,同时利用了已有的总体方法和执行平台模型。与先前的解决方案相比,所提出的方法的优点是使用一组基准进行实验。
{"title":"Application workload model generation methodologies for system-level design exploration","authors":"Jukka Saastamoinen, J. Kreku","doi":"10.1109/DASIP.2011.6136888","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136888","url":null,"abstract":"As most of the applications of embedded system products are realized in software, the performance estimation of software is crucial for successful system design. Significant part of the functionality of these applications is based on services provided by the underlying software libraries. Often used performance evaluation technique today is the system-level performance simulation of the applications and platforms using abstracted workload and execution platform models. The accuracy of the software performance results is dependent on how closely the application workload model reflects actual software as a whole. This paper presents a methodology which combines compiler based user code workload model generation with workload extraction of pre-compiled libraries, while exploiting an overall approach and execution platform model developed previously. Benefit of the proposed methodology compared to earlier solution is experimented using a set of benchmarks.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115603223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
High-level modelling and automatic generation of dynamicaly reconfigurable systems 动态可重构系统的高级建模和自动生成
G. Ochoa-Ruiz, E. Bourennane, H. Rabah, Ouassila Labbani
Dynamic Partial Reconfiguration (DPR) has been introduced in recent years as a method to increase the flexibility of FPGA designs. However, using DPR for building complex systems remains a daunting task. Recently, approaches based on MDE and UML MARTE standard have emerged which aim to simplify the design of complex SoCs. Moreover, with the recent standardization of the IP-XACT specification, there is an increasing interest to use it in MDE methodologies to ease system integration and to enable design flow automation. In this paper we propose an MARTE/MDE approach which exploits the capabilities of IP-XACT to model and automatically generate DPR SoC designs. In particular, our goal is the creation of the structural top level description of the system and to include DPR support in the used IP cores. The generated IP-XACT descriptions are transformed to obtain the files required as inputs by the EDK flow and then synthesized to generate the netlists used by the DPR flow. The methodology is demonstrated using two CODEC cores (CAVLC and VLC) into a MicroBlaze based DPR SoC.
动态部分重构(DPR)是近年来引入的一种提高FPGA设计灵活性的方法。然而,使用DPR构建复杂系统仍然是一项艰巨的任务。最近出现了基于MDE和UML MARTE标准的方法,旨在简化复杂soc的设计。此外,随着最近IP-XACT规范的标准化,在MDE方法中使用它来简化系统集成并实现设计流程自动化的兴趣越来越大。在本文中,我们提出了一种MARTE/MDE方法,该方法利用IP-XACT的功能来建模并自动生成DPR SoC设计。特别是,我们的目标是创建系统的结构化顶层描述,并在使用的IP核中包含DPR支持。生成的IP-XACT描述被转换为获得EDK流所需的输入文件,然后被合成为生成DPR流使用的网络列表。该方法使用两个CODEC核心(CAVLC和VLC)到基于MicroBlaze的DPR SoC中进行演示。
{"title":"High-level modelling and automatic generation of dynamicaly reconfigurable systems","authors":"G. Ochoa-Ruiz, E. Bourennane, H. Rabah, Ouassila Labbani","doi":"10.1109/DASIP.2011.6136900","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136900","url":null,"abstract":"Dynamic Partial Reconfiguration (DPR) has been introduced in recent years as a method to increase the flexibility of FPGA designs. However, using DPR for building complex systems remains a daunting task. Recently, approaches based on MDE and UML MARTE standard have emerged which aim to simplify the design of complex SoCs. Moreover, with the recent standardization of the IP-XACT specification, there is an increasing interest to use it in MDE methodologies to ease system integration and to enable design flow automation. In this paper we propose an MARTE/MDE approach which exploits the capabilities of IP-XACT to model and automatically generate DPR SoC designs. In particular, our goal is the creation of the structural top level description of the system and to include DPR support in the used IP cores. The generated IP-XACT descriptions are transformed to obtain the files required as inputs by the EDK flow and then synthesized to generate the netlists used by the DPR flow. The methodology is demonstrated using two CODEC cores (CAVLC and VLC) into a MicroBlaze based DPR SoC.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117163427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Hardware/software co-design of dataflow programs for reconfigurable hardware and multi-core platforms 可重构硬件和多核平台数据流程序的软硬件协同设计
Ghislain Roquier, E. Bezati, Richard Thavot, M. Mattavelli
The possibility of specifying both software and hardware components from a unified high-level description of an application is a very attractive design approach. However, despite the efforts spent for implementing such an approach using general purpose programming languages, it has not yet shown to be viable and efficient for complex designs. One of the reasons is that the sequential programming model does not naturally provide explicit and scalable parallelism and composability properties that effectively permits to build portable applications that can be efficiently mapped on different kind of heterogeneous platforms. Conversely dataflow programming is an approach that naturally provides explicit parallel programs with composability properties. This paper presents a methodology for the hardware/software co-design that enables, by direct synthesis of both hardware descriptions (HDL), software components (C/C++) and mutual interfaces, to generate an implementation of the application from an unique dataflow program, running onto heterogeneous architectures composed by reconfigurable hardware and multi-core processors. Experimental results based on the implementation of a JPEG codec onto an heterogeneous platform are also provided to show the capabilities and flexibility of the implementation approach.
从应用程序的统一高层描述中指定软件和硬件组件的可能性是一种非常有吸引力的设计方法。然而,尽管为使用通用编程语言实现这种方法付出了努力,但它尚未显示出对复杂设计的可行性和有效性。其中一个原因是,顺序编程模型不能自然地提供显式的、可伸缩的并行性和可组合性属性,这些属性有效地允许构建可在不同类型的异构平台上有效映射的可移植应用程序。相反,数据流编程是一种自然地提供具有可组合性属性的显式并行程序的方法。本文提出了一种硬件/软件协同设计的方法,通过直接综合硬件描述(HDL)、软件组件(C/ c++)和相互接口,从一个独特的数据流程序生成应用程序的实现,运行在由可重构硬件和多核处理器组成的异构体系结构上。在异构平台上实现JPEG编解码器的实验结果显示了该实现方法的能力和灵活性。
{"title":"Hardware/software co-design of dataflow programs for reconfigurable hardware and multi-core platforms","authors":"Ghislain Roquier, E. Bezati, Richard Thavot, M. Mattavelli","doi":"10.1109/DASIP.2011.6136875","DOIUrl":"https://doi.org/10.1109/DASIP.2011.6136875","url":null,"abstract":"The possibility of specifying both software and hardware components from a unified high-level description of an application is a very attractive design approach. However, despite the efforts spent for implementing such an approach using general purpose programming languages, it has not yet shown to be viable and efficient for complex designs. One of the reasons is that the sequential programming model does not naturally provide explicit and scalable parallelism and composability properties that effectively permits to build portable applications that can be efficiently mapped on different kind of heterogeneous platforms. Conversely dataflow programming is an approach that naturally provides explicit parallel programs with composability properties. This paper presents a methodology for the hardware/software co-design that enables, by direct synthesis of both hardware descriptions (HDL), software components (C/C++) and mutual interfaces, to generate an implementation of the application from an unique dataflow program, running onto heterogeneous architectures composed by reconfigurable hardware and multi-core processors. Experimental results based on the implementation of a JPEG codec onto an heterogeneous platform are also provided to show the capabilities and flexibility of the implementation approach.","PeriodicalId":199500,"journal":{"name":"Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121318094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Proceedings of the 2011 Conference on Design & Architectures for Signal & Image Processing (DASIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1