2012 13th International Workshop on Cellular Nanoscale Networks and their Applications最新文献

英文中文

Automatic generation of locally controlled arithmetic unit via floorplan based partitioning 通过基于平面图的分区自动生成本地控制的算术单元

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331442

C. Nemes, Z. Nagy, P. Szolgay

In the paper a framework for generating a locally controlled arithmetic unit is presented including graph generation from a mathematical expression, graph partitioning to determine locally controlled parts of the design and VHDL generation. The output of the framework is a pipelined architecture containing locally controlled groups of floating point units. It is demonstrated that both partitioning and placement aspects of the design have to be considered to obtain a highspeed circuit. In a well-placeable design locally controlled groups can be mapped to FPGA in such a way that only neighboring groups communicate with each other. In the presented algorithm an initial floorplan of the floating point units is produced and a novel graph partitioning representation is used for partitioning the floating point units to obtain a well-placeable design. The framework is demonstrated during the automatic circuit generation of a complex mathematical expression related to Computation Fluid Dynamics (CFD). The framework produces 15-27% faster design than the unpartitioned, globally controlled one in the price of a modest area increase. The framework automatically produces well-placeable deadlock-free partitions for complex expressions as well, while in case of traditional partitioners these objectives cannot be targeted.

本文提出了一种局部控制算法单元的生成框架，包括从数学表达式生成图形、划分图形以确定局部控制部分和生成VHDL。框架的输出是一个包含本地控制的浮点单元组的流水线架构。结果表明，为了获得高速电路，必须考虑设计的划分和放置方面。在一个放置良好的设计中，局部控制的组可以以这样一种方式映射到FPGA，即只有相邻组彼此通信。在该算法中，生成了浮点单元的初始平面图，并采用一种新的图形划分表示来划分浮点单元，以获得良好的放置设计。该框架在计算流体力学(CFD)中一个复杂数学表达式的自动电路生成过程中得到了验证。该框架的设计速度比未分割的、全球控制的设计快15-27%，在价格上有适度的增长。框架还会自动为复杂表达式生成可放置的无死锁分区，而在传统分区器的情况下，这些目标无法实现。

{"title":"Automatic generation of locally controlled arithmetic unit via floorplan based partitioning","authors":"C. Nemes, Z. Nagy, P. Szolgay","doi":"10.1109/CNNA.2012.6331442","DOIUrl":"https://doi.org/10.1109/CNNA.2012.6331442","url":null,"abstract":"In the paper a framework for generating a locally controlled arithmetic unit is presented including graph generation from a mathematical expression, graph partitioning to determine locally controlled parts of the design and VHDL generation. The output of the framework is a pipelined architecture containing locally controlled groups of floating point units. It is demonstrated that both partitioning and placement aspects of the design have to be considered to obtain a highspeed circuit. In a well-placeable design locally controlled groups can be mapped to FPGA in such a way that only neighboring groups communicate with each other. In the presented algorithm an initial floorplan of the floating point units is produced and a novel graph partitioning representation is used for partitioning the floating point units to obtain a well-placeable design. The framework is demonstrated during the automatic circuit generation of a complex mathematical expression related to Computation Fluid Dynamics (CFD). The framework produces 15-27% faster design than the unpartitioned, globally controlled one in the price of a modest area increase. The framework automatically produces well-placeable deadlock-free partitions for complex expressions as well, while in case of traditional partitioners these objectives cannot be targeted.","PeriodicalId":387536,"journal":{"name":"2012 13th International Workshop on Cellular Nanoscale Networks and their Applications","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123712528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Cellular Neural Networks with dynamic cell activity control for Hausdorff distance estimation 基于动态细胞活动控制的细胞神经网络用于Hausdorff距离估计

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331415

M. Janczyk, K. Slot

A concept of Cellular Neural Networks with dynamic cell activity control is proposed in the paper. The concept is an extension to the Fixed State Map mechanism and it assumes that cells can be disabled or enabled for processing based on assessment of current distributions of their neighboring signals. A particular case, where this assessment is made by thresholding a result of cross-correlation between feedback template and neighborhood outputs is shown to provide a simple means for efficient min/max problem handling. This idea requires introducing only minor modifications to a cell structure. As an example, application of the proposed network for fast estimation of Hausdorff distance between two sets has been considered.

提出了具有动态细胞活动控制的细胞神经网络的概念。该概念是对固定状态映射机制的扩展，它假设可以根据对相邻信号的当前分布的评估禁用或启用细胞进行处理。在一个特殊的案例中，通过对反馈模板和邻域输出之间的相互关联结果进行阈值化来进行评估，为有效地处理最小/最大问题提供了一种简单的方法。这个想法只需要对细胞结构进行微小的修改。以该网络为例，研究了该网络在两集间Hausdorff距离快速估计中的应用。

引用次数: 0

Examining the accuracy and the precision of PDEs for FPGA computations 在FPGA计算中检验偏微分方程的精度和精度

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331439

A. Kiss, Z. Nagy, Á. Csík, P. Szolgay

There are a large number of problems which can be accelerated by using architectures on Field Programmable Gate Arrays (FPGA). However sometimes the complexity of a problem does not allow to map it onto a specific FPGA. In that case analysis of precision of the arithmetic unit which may solve the computational problem can be a good attempt to fit the architecture and to accelerate its computation. Numerical algorithm can be implemented using fixed-point or floating point arithmetic (or mixed (both)) with different precision. The aim of the article is not to optimize the numerical algorithm but to find a smaller arithmetic unit precision, which results enough accuracy and fits to smaller FPGA-s. In the paper, one particular problem type is investigated, namely the accuracy of the solution of a simple Partial Differential Equation (PDE). The accuracy measurement is done on an FPGA with different bit width. The solution of the advection equation is analyzed using first and second order discretization methods. As a result we managed to find an optimal bit width for the solution on a specific FPGA.

使用现场可编程门阵列(FPGA)上的架构可以加速许多问题。然而，有时问题的复杂性不允许将其映射到特定的FPGA上。在这种情况下，对计算单元的精度进行分析可以解决计算问题，这是适应体系结构和加快计算速度的一个很好的尝试。数值算法可以使用不同精度的定点或浮点运算(或混合(两者))来实现。本文的目的不是优化数值算法，而是寻找更小的算术单元精度，从而获得足够的精度并适合更小的fpga。本文研究一类特殊的问题，即简单偏微分方程(PDE)解的精度。在不同位宽的FPGA上进行了精度测量。采用一阶和二阶离散方法对平流方程的解进行了分析。因此，我们设法在特定FPGA上找到解决方案的最佳位宽度。

引用次数: 1

Advanced memristive model of synapses with adaptive thresholds 具有自适应阈值的突触的高级记忆模型

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331459

Weiran Cai, Ronald Tetzlaff

In this paper, we propose a memristive STDP model realizing the principle of suppression of Froemke and Dan for triplet spikes. The proposed model claims compatibility with both the pair and triplet STDP rules, going beyond the limit of the basic memristive STDP model. The compatibility is realized by assuming a mechanism of variable thresholds adapting to synaptic potentiation (LTP) and depression (LTD): the preceding LTP has a negative influence on the following LTD. The corresponding dynamical process is governed by a set of ordinary differential equations. It is an equivalent model of the original suppression STDP model. A relation of the adaptive thresholds to short-term plasticity is addressed.

在本文中，我们提出了一个记忆的STDP模型，实现了对三重峰的Froemke和Dan抑制原理。提出的模型声称兼容对和三重STDP规则，超越了基本记忆性STDP模型的限制。这种相容性是通过假设一个适应突触增强(LTP)和抑制(LTD)的可变阈值机制来实现的:前一个LTP对后一个LTD有负向影响。相应的动力学过程由一组常微分方程控制。它是原始抑制STDP模型的等效模型。研究了自适应阈值与短期塑性的关系。

引用次数: 9

FPGA-based generation of autowaves in Memristive Cellular Neural Networks 记忆细胞神经网络中基于fpga的自动波生成

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331435

V. Pham, A. Buscarino, M. Frasca, L. Fortuna, T. Hoang

Cellular Neural/Nonlinear Networks (CNNs) constitute an effective approach for studying complex phenomena like autowaves, spiral waves or pattern formation either by providing a computationally efficient environment for numerical simulations or by allowing the possibility of hardware emulators of the system under study. In this work, we focus on a CNN made of memristor-based cells, namely a Memristive Cellular Neural/Nonlinear Network (MCNN). This has been recently shown to be capable of generating complex phenomena such as autowave propagation. In this work, we implement such a MCNN by using Field Programmable Gate Array (FPGA). Our system consisting of a FPGA development board connected to a monitor allows us to emulate autowave propagation in an efficient way. Experimental results show the feasibility of FPGA-based approach to implement MCNN.

细胞神经/非线性网络(cnn)是研究自动波、螺旋波或模式形成等复杂现象的有效方法，它为数值模拟提供了高效的计算环境，并允许所研究系统的硬件模拟器的可能性。在这项工作中，我们专注于由基于忆阻器的细胞组成的CNN，即忆阻细胞神经/非线性网络(MCNN)。这最近已被证明能够产生复杂的现象，如自动波传播。在这项工作中，我们使用现场可编程门阵列(FPGA)实现了这样一个MCNN。我们的系统由连接到监视器的FPGA开发板组成，使我们能够以有效的方式模拟自动波传播。实验结果表明，基于fpga实现MCNN的方法是可行的。

引用次数: 1

MRL — Memristor Ratioed Logic MRL -忆阻器比率逻辑

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331426

Shahar Kvatinsky, Nimrod Wald, Guy Satat, A. Kolodny, U. Weiser, E. Friedman

Memristive devices are novel structures, developed primarily as memory. Another interesting application for memristive devices is logic circuits. In this paper, MRL (Memristor Ratioed Logic) - a hybrid CMOS-memristive logic family - is described. In this logic family, OR and AND logic gates are based on memristive devices, and CMOS inverters are added to provide a complete logic structure and signal restoration. Unlike previously published memristive-based logic families, the MRL family is compatible with standard CMOS logic. A case study of an eight-bit full adder is presented and related design considerations are discussed.

忆阻装置是一种新颖的结构，主要是作为记忆而发展起来的。忆阻器件另一个有趣的应用是逻辑电路。本文描述了忆阻比逻辑(MRL)——一种cmos -忆阻混合逻辑族。在该逻辑家族中，OR和and逻辑门基于忆阻器件，并添加CMOS逆变器以提供完整的逻辑结构和信号恢复。与以前发布的基于记忆的逻辑系列不同，MRL系列与标准CMOS逻辑兼容。给出了一个8位全加法器的案例研究，并讨论了相关的设计注意事项。

引用次数: 230

Non-Boolean associative architectures based on nano-oscillators 基于纳米振荡器的非布尔关联体系结构

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331473

S. Levitan, Yan Fang, D. Dash, T. Shibata, D. Nikonov, G. Bourianoff

Many of the proposed and emerging nano-scale technologies simply cannot compete with CMOS in terms of energy efficiency for performing Boolean operations. However, the potential for these technologies to perform useful non-Boolean computations remains an opportunity to be explored. In this talk we examine the use of the resonance of coupled nano-scale oscillators as a primitive computational operator for associative processing and develop the architectural structures that could enable such devices to be integrated into mainstream applications.

在执行布尔运算的能源效率方面，许多提出的和新兴的纳米级技术根本无法与CMOS竞争。然而，这些技术执行有用的非布尔计算的潜力仍然是一个有待探索的机会。在这次演讲中，我们研究了耦合纳米级振荡器的共振作为联想处理的基本计算算子的使用，并开发了能够使此类设备集成到主流应用中的架构结构。

引用次数: 53

Study on application of Reference Conjugated Hologram for aberration correction of multiple object planes 参考共轭全息图在多物面像差校正中的应用研究

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331445

B. Nagy, S. Tõkés

Aberration correction using Reference Conjugated Hologram (RCH) method is investigated. However we use it not for a single but for a number of reconstructed object planes in Digital Holographic Microscopy (DHM). We build an off-axis DHM for testing the performance of the method. The limits of this method have been studied. We compare in-line with aberration compensated off-axis DHM. The in-line DHM compensates quite the same aberrations physically as the RCH method numerically.

研究了参考共轭全息(RCH)法的像差校正。然而，在数字全息显微镜(DHM)中，我们使用的不是单一的，而是多个重建的物体平面。我们建立了一个离轴DHM来测试该方法的性能。研究了这种方法的局限性。我们比较了直线和像差补偿的离轴DHM。直列DHM在物理上补偿了与RCH方法相当的像差。

引用次数: 0

An Associative Memory with oscillatory CNN arrays using spin torque oscillator cells and spin-wave interactions architecture and End-to-end Simulator 基于自旋力矩振荡器单元、自旋波相互作用结构和端到端模拟器的振荡CNN阵列联想存储器

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331463

T. Roska, A. Horváth, A. Stubendek, F. Corinto, G. Csaba, W. Porod, T. Shibata, G. Bourianoff

An Associative Memory is built by three consecutive components: (1) a CMOS preprocessing unit generating input feature vectors from picture inputs, (2) an AM cluster generating signature outputs composed of spintronic oscillator (STO) cells and local spin-wave interactions, as an oscillatory CNN (O-CNN) array unit, applied several times arranged in space, and (3) a classification unit (CMOS). The end to end design of the preprocessing unit, the interacting O-CNN arrays, and the classification unit is embedded in a learning and optimization procedure where the geometric distances between the STOs in the O-CNN arrays play a crucial role. The O-CNN array has an input vector as a 1D array of oscillator frequencies, and the synchronized O-CNN array codes the output as the phases of the output 1D array. The typical O-CNN array has 1-3 rows of STOs. Simplified STO and interaction macro models are used. A typical example is shown using an End-to-end Simulator.

联想记忆由三个连续组件组成:(1)从图像输入生成输入特征向量的CMOS预处理单元;(2)由自旋电子振荡器(STO)单元和局部自旋波相互作用组成的AM簇生成签名输出，作为振荡CNN (O-CNN)阵列单元，在空间中多次应用;(3)分类单元(CMOS)。预处理单元、相互作用的O-CNN阵列和分类单元的端到端设计嵌入到一个学习和优化过程中，其中O-CNN阵列中STOs之间的几何距离起着至关重要的作用。O-CNN阵列的输入矢量为振荡器频率的一维阵列，同步的O-CNN阵列将输出编码为输出一维阵列的相位。典型的O-CNN阵列有1-3行sto。使用简化的STO和交互宏模型。使用端到端模拟器展示了一个典型的示例。

引用次数: 18

Real-time use of GPUs in NA62 experiment gpu在NA62实验中的实时使用

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331454

G. Collazuol, V. Innocente, G. Lamanna, F. Pantaleo, M. Sozzi

We describe a pilot project for the use of GPUs in a real-time triggering application in the early trigger stages at the CERN NA62 experiment, and the results of the first field tests together with a prototype data acquisition (DAQ) system. This pilot project within NA62 aims at integrating GPUs into the central L0 trigger processor, and also to use them as fast online processors for computing trigger primitives. Several TDC-equipped sub-detectors with sub-nanosecond time resolution will participate in the first-level NA62 trigger (L0), fully integrated with the data-acquisition system, to reduce the readout rate of all sub-detectors to 1 MHz, using multiplicity information asynchronously computed over time frames of a few ns, both for positive sub-detectors and for vetos. The online use of GPUs would allow the computation of more complex trigger primitives already at this first trigger level. We describe the architectures of the proposed systems, focusing on measuring the performance (both throughput and latency) of various approaches meant to solve these high energy physics problems. The challenges and the prospects of this promising idea are discussed.

我们描述了在CERN NA62实验的早期触发阶段，在实时触发应用中使用gpu的试点项目，以及第一次现场测试的结果以及一个原型数据采集(DAQ)系统。NA62中的这个试点项目旨在将gpu集成到中央L0触发处理器中，并将它们用作计算触发原语的快速在线处理器。几个配备tdc的亚纳秒时间分辨率的子探测器将参与一级NA62触发器(L0)，与数据采集系统完全集成，使用在几ns的时间框架内异步计算的多重信息，将所有子探测器的读出率降低到1 MHz，包括正子探测器和反子探测器。gpu的在线使用将允许在第一个触发级别上计算更复杂的触发原语。我们描述了所提议系统的架构，重点是测量用于解决这些高能物理问题的各种方法的性能(吞吐量和延迟)。讨论了这一有前途的想法所面临的挑战和前景。

{"title":"Real-time use of GPUs in NA62 experiment","authors":"G. Collazuol, V. Innocente, G. Lamanna, F. Pantaleo, M. Sozzi","doi":"10.1109/CNNA.2012.6331454","DOIUrl":"https://doi.org/10.1109/CNNA.2012.6331454","url":null,"abstract":"We describe a pilot project for the use of GPUs in a real-time triggering application in the early trigger stages at the CERN NA62 experiment, and the results of the first field tests together with a prototype data acquisition (DAQ) system. This pilot project within NA62 aims at integrating GPUs into the central L0 trigger processor, and also to use them as fast online processors for computing trigger primitives. Several TDC-equipped sub-detectors with sub-nanosecond time resolution will participate in the first-level NA62 trigger (L0), fully integrated with the data-acquisition system, to reduce the readout rate of all sub-detectors to 1 MHz, using multiplicity information asynchronously computed over time frames of a few ns, both for positive sub-detectors and for vetos. The online use of GPUs would allow the computation of more complex trigger primitives already at this first trigger level. We describe the architectures of the proposed systems, focusing on measuring the performance (both throughput and latency) of various approaches meant to solve these high energy physics problems. The challenges and the prospects of this promising idea are discussed.","PeriodicalId":387536,"journal":{"name":"2012 13th International Workshop on Cellular Nanoscale Networks and their Applications","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125555697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀