2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines最新文献

英文中文

MRAPI Implementation for Heterogeneous Reconfigurable Systems-on-Chip 异构可重构片上系统的MRAPI实现

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

Pub Date : 1900-01-01 DOI: 10.1109/FCCM.2014.74

L. Gantel, M. A. Benkhelifa, F. Verdier, F. Lemonnier

In a Reconfigurable System-on-Chip (SoC) platform, the application is divided into threads managed by an operating system, and whereas some threads are implemented as hardware threads and allocated into a partition of the chip, others run as software threads on embedded processing elements. Relying on the Multicore Resource management API (MRAPI) specification and client-server mechanisms, we propose solutions to provide a flexible access to the operating system services for every threads whatever is the core they are running on. In order to improve the application deployment process when targeting HRSoC, we realized both a hardware and a software implementation of this API. Depending on affinities of the operating system services with each core, such a solution allows a fine-grain implementation of these services over the platform.

在可重构的片上系统(SoC)平台中，应用程序被划分为由操作系统管理的线程，而一些线程作为硬件线程实现并分配到芯片的分区中，其他线程作为嵌入式处理元素上的软件线程运行。依靠多核资源管理API (MRAPI)规范和客户端-服务器机制，我们提出解决方案，为每个线程提供对操作系统服务的灵活访问，无论它们运行在哪个核心上。为了改进针对HRSoC的应用程序部署过程，我们实现了该API的硬件和软件实现。根据操作系统服务与每个核心的亲缘关系，这样的解决方案允许在平台上对这些服务进行细粒度实现。

引用次数: 1

Energy Reduction through Differential Reliability and Lightweight Checking 通过差别化可靠性和轻量化检查降低能耗

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

Pub Date : 1900-01-01 DOI: 10.1109/FCCM.2014.78

E. Kadrić, K. Mahajan, A. DeHon

As technology feature sizes shrink, aggressive voltage scaling is required to contain power density. However, this also increases the rate of transient upsets-potentially preventing us from scaling down voltage and possibly even requiring voltage increases to maintain reliability. Duplication with checking and triple-modular redundancy are traditional approaches to combat transient errors, but spending 2-3× the energy for redundant computation can diminish or reverse the benefits of voltage scaling. As an alternative, we explore the opportunity to use checking computations that are cheaper than the base computation they are guarding. We identify and evaluate the effectiveness of lightweight checks in a broad set of common FPGA tasks in scientific computing and signal and image processing. We find that the lightweight checks cost less than 14% of the base computation. Using an exponential model for the relationship between voltage and transient upset rate, we are able to show over 80% net energy reduction by aggressive voltage scaling without compromising reliability compared to operation at the nominal voltage.

随着技术特征尺寸的缩小，需要积极的电压缩放来控制功率密度。然而，这也增加了瞬态故障的发生率——潜在地阻止了我们降低电压，甚至可能需要增加电压来保持可靠性。带检查的重复和三模冗余是对抗瞬态错误的传统方法，但花费2-3倍的能量进行冗余计算可能会减少或逆转电压缩放的好处。作为一种替代方案，我们探索了使用比它们所保护的基础计算更便宜的检查计算的机会。我们在科学计算、信号和图像处理中广泛的常见FPGA任务中识别和评估轻量级检查的有效性。我们发现轻量级检查的成本不到基础计算的14%。使用电压和瞬态扰动率之间关系的指数模型，我们能够通过积极的电压缩放显示超过80%的净能量减少，而不影响可靠性与标称电压下的运行相比。

{"title":"Energy Reduction through Differential Reliability and Lightweight Checking","authors":"E. Kadrić, K. Mahajan, A. DeHon","doi":"10.1109/FCCM.2014.78","DOIUrl":"https://doi.org/10.1109/FCCM.2014.78","url":null,"abstract":"As technology feature sizes shrink, aggressive voltage scaling is required to contain power density. However, this also increases the rate of transient upsets-potentially preventing us from scaling down voltage and possibly even requiring voltage increases to maintain reliability. Duplication with checking and triple-modular redundancy are traditional approaches to combat transient errors, but spending 2-3× the energy for redundant computation can diminish or reverse the benefits of voltage scaling. As an alternative, we explore the opportunity to use checking computations that are cheaper than the base computation they are guarding. We identify and evaluate the effectiveness of lightweight checks in a broad set of common FPGA tasks in scientific computing and signal and image processing. We find that the lightweight checks cost less than 14% of the base computation. Using an exponential model for the relationship between voltage and transient upset rate, we are able to show over 80% net energy reduction by aggressive voltage scaling without compromising reliability compared to operation at the nominal voltage.","PeriodicalId":246162,"journal":{"name":"2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124339666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

UTOPIA: Generic User-Level Access to the Physical Memory Address Space for IP Core Debugging and Validation on FPGA Based PCIe Extension Cards 基于FPGA的PCIe扩展卡的IP核调试和验证的通用用户级访问物理内存地址空间

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

Pub Date : 1900-01-01 DOI: 10.1109/FCCM.2014.41

Hendrik Noll, Sebastian Siegert, Johannes Hiltscher, W. Rehm

Testing and debugging of an Field Programmable Gate Array (FPGA) based Peripheral Component Interconnect Express (PCIe) extension card require an access to its resources and the system's main memory. Both are accessible via the physical memory address space (PMAS). User-level solutions for accessing this address space exist, but are proprietary and/or limited to specific address ranges, among others. An arbitrary user-level access, e.g. for a flexible validation of an intellectual property (IP) core, is not possible. Enabling such accesses, the open source Linux tool set UTOPIA - including its concept, structure and interfaces - is presented in this paper. Further, bandwidths and latencies between user-level applications and the PMAS are measured and evaluated.

测试和调试基于现场可编程门阵列(FPGA)的PCIe扩展卡需要访问其资源和系统的主存储器。两者都可以通过物理内存地址空间(PMAS)访问。访问此地址空间的用户级解决方案是存在的，但它们是专有的和/或限于特定的地址范围。任意的用户级访问，例如对知识产权(IP)核心的灵活验证，是不可能的。为了实现这样的访问，本文介绍了开源Linux工具集UTOPIA——包括它的概念、结构和接口。此外，还测量和评估了用户级应用程序和PMAS之间的带宽和延迟。

引用次数: 0

Low Power Reconfigurable Controllers for Wireless Sensor Network Nodes 无线传感器网络节点的低功耗可重构控制器

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

Pub Date : 1900-01-01 DOI: 10.1109/FCCM.2014.68

Vivek D. Tovinakere, O. Sentieys, Steven Derrien, Christophe Huriaux

A key concern in the design of controllers in wireless sensor network (WSN) nodes is the flexibility to execute different control tasks involving sensing, communications and computational resources of the node. In this paper, low power flexible controllers for WSN nodes based on reconfigurable microtasks composed of an FSM and datapath are presented. Coarse grain power gating opportunities are exploited in FSM and datapath for low power operation in reconfigurable microtasks. Power estimation results on typical benchmark microtasks show a 2× to 5× improvement in energy efficiency w.r.t a microcontroller at a cost of 5× relative to a microtask implemented as an ASIC with higher NRE costs.

无线传感器网络(WSN)节点控制器设计的一个关键问题是如何灵活地执行涉及节点感知、通信和计算资源的不同控制任务。提出了一种基于FSM和数据路径组成的可重构微任务的WSN节点低功耗柔性控制器。在可重构微任务中利用粗粒度功率门控机会实现低功耗运行。在典型的基准微任务上的功耗估计结果显示，与作为NRE成本较高的ASIC实现的微任务相比，微控制器的能源效率提高了2到5倍，成本为5倍。

引用次数: 5

High-Throughput and Low-Cost Hardware Accelerator for Privacy Preserving Publishing 高通量和低成本的隐私保护出版硬件加速器

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

Pub Date : 1900-01-01 DOI: 10.1109/FCCM.2014.77

Fumito Yamaguchi, H. Nishi

Deep Packet Inspection (DPI) has become crucial for providing rich internet services, such as intrusion and phishing protection, but the use of DPI raises concerns for protecting the privacy of internet users. In this paper, a RAM-based hardware anonymizer is proposed for implementation on a Virtex-5 FPGA device. The results of the hardware anonymizer showed that the proposed architecture reduced circuit usage by 40%.

深度包检测(DPI)已经成为提供丰富的互联网服务的关键，如入侵和网络钓鱼保护，但DPI的使用引起了对保护互联网用户隐私的关注。本文提出了一种基于ram的硬件匿名器，并在Virtex-5 FPGA器件上实现。硬件匿名器的结果表明，所提出的架构减少了40%的电路使用。

引用次数: 0

MixFX-SCORE: Heterogeneous Fixed-Point Compilation of Dataflow Computations MixFX-SCORE:数据流计算的异构定点编译

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

Pub Date : 1900-01-01 DOI: 10.1109/FCCM.2014.64

Deheng Ye, Nachiket Kapre

Mixed-precision implementation of computation can deliver area, throughput and power improvements for dataflow computations over homogeneous fixed-precision circuits without any loss in accuracy. When designing circuits for reconfigurable hardware, we can exercise independent control over bitwidth selection of each variable in the computation. However, selecting the best precision for each variable is an NP-hard problem. While traditional solutions use automated heuristics like simulated annealing or integer linear programming, they still rely on the manual formulation of resource models, which can be tedious, and potentially inaccurate due to the unpredictable interactions between different stages of the FPGA CAD flow. We develop MixFX-SCORE, an automated tool-flow based on FX-SCORE fixed-point compilation framework and simulated annealing, to address this challenge. We outsource error analysis (Gappa++) and resource model generation (Vivado HLS, Logic Synthesis, Xilinx Place-and-Route) to external tools that offer a more accurate representation of error behavior (backed by proofs) and resource usage (based on actual utilization). We demonstrate 1.1-3.5x LUTs count savings, 1-1.8x DSP count reductions, and 1-3.9x dynamic power improvements while still satisfying the accuracy constraints when compared to homogeneous fixed-point implementations.

混合精度计算实现可以在均匀固定精度电路上为数据流计算提供面积、吞吐量和功耗方面的改进，而不会损失任何精度。在为可重构硬件设计电路时，我们可以对计算中每个变量的位宽选择进行独立控制。然而，为每个变量选择最佳精度是一个np困难问题。虽然传统的解决方案使用自动启发式方法，如模拟退火或整数线性规划，但它们仍然依赖于手动制定资源模型，这可能是繁琐的，并且由于FPGA CAD流程的不同阶段之间不可预测的相互作用而可能不准确。我们开发了MixFX-SCORE，这是一个基于FX-SCORE定点编译框架和模拟退火的自动化工具流，以解决这一挑战。我们将错误分析(gappa++)和资源模型生成(Vivado HLS、Logic Synthesis、Xilinx Place-and-Route)外包给外部工具，这些工具可以更准确地表示错误行为(有证据支持)和资源使用(基于实际利用)。我们演示了1.1-3.5倍的lut计数节省，1-1.8倍的DSP计数减少，1-3.9倍的动态功率改进，同时与同质定点实现相比仍然满足精度限制。

{"title":"MixFX-SCORE: Heterogeneous Fixed-Point Compilation of Dataflow Computations","authors":"Deheng Ye, Nachiket Kapre","doi":"10.1109/FCCM.2014.64","DOIUrl":"https://doi.org/10.1109/FCCM.2014.64","url":null,"abstract":"Mixed-precision implementation of computation can deliver area, throughput and power improvements for dataflow computations over homogeneous fixed-precision circuits without any loss in accuracy. When designing circuits for reconfigurable hardware, we can exercise independent control over bitwidth selection of each variable in the computation. However, selecting the best precision for each variable is an NP-hard problem. While traditional solutions use automated heuristics like simulated annealing or integer linear programming, they still rely on the manual formulation of resource models, which can be tedious, and potentially inaccurate due to the unpredictable interactions between different stages of the FPGA CAD flow. We develop MixFX-SCORE, an automated tool-flow based on FX-SCORE fixed-point compilation framework and simulated annealing, to address this challenge. We outsource error analysis (Gappa++) and resource model generation (Vivado HLS, Logic Synthesis, Xilinx Place-and-Route) to external tools that offer a more accurate representation of error behavior (backed by proofs) and resource usage (based on actual utilization). We demonstrate 1.1-3.5x LUTs count savings, 1-1.8x DSP count reductions, and 1-3.9x dynamic power improvements while still satisfying the accuracy constraints when compared to homogeneous fixed-point implementations.","PeriodicalId":246162,"journal":{"name":"2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines","volume":"77 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134410979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Floorplanning for Partially-Reconfigurable FPGA Systems via Mixed-Integer Linear Programming 基于混合整数线性规划的部分可重构FPGA系统平面规划

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

Pub Date : 1900-01-01 DOI: 10.1109/FCCM.2014.61

Marco Rabozzi, J. Lillis, M. Santambrogio

The aim of this paper is to show a novel floorplanner based on Mixed-Integer Linear Programming (MILP), providing a suitable formulation that makes the problem tractable using state-of-the-art solvers. The proposed method takes into account an accurate description of heterogeneous resources and partially reconfigurable constraints of recent FPGAs. A global optimum can be found for small instances in a small amount of time. For large instances, with a time limited search, a 20% average improvement can be achieved over floorplanners based on simulated annealing. Our approach allows the designer to customize the objective function to be minimized, so that different weights can be assigned to a linear combination of metrics such as total wire length, aspect ratio and area occupancy.

本文的目的是展示一种新的基于混合整数线性规划(MILP)的地板规划，提供一个合适的公式，使问题易于处理，使用最先进的求解器。该方法考虑了对异构资源的准确描述和当前fpga的部分可重构约束。可以在很短的时间内为小实例找到全局最优。对于大型实例，具有时间限制的搜索，基于模拟退火的地板规划器可以实现20%的平均改进。我们的方法允许设计师定制最小化的目标函数，这样不同的权重可以分配到一个线性组合的指标，如总线长、宽高比和面积占用。

引用次数: 30

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀