2012 13th International Workshop on Cellular Nanoscale Networks and their Applications最新文献

英文中文

Nonlinear spatio-temporal wave computing for real-time applications on GPU 非线性时空波计算在GPU上的实时应用

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331419

M. Tukel, R. Yeniceri, M. Yalçin

In this work, active wave simulation on Cellular Nonlinear Network was computed for path planning on the GPU of a NVIDIA GTX275 video card. In software part, QtOpenCL, which is a wrapper library of OpenCL, was used to make code portable for systems with different GPUs. We achieved promising results comparing to results achieved by both CPU and FPGA. We have implemented different hardware and software solutions to path planning problem for 2-D media in real-time. They were almost at limit of real-time requirements because of some bottlenecks such as low communication bandwidth and low resolution of network. In this work, by utilizing GPUs, we performed 60000 iterations per second for simulation of 128×128 node network while we achieved at most 35 iterations per second with software on an Intel Core 2 Duo P8700 processor. We also achieved 36 iterations per second for 3-D active wave simulation of a 256 × 256 × 256 network on GPU.

本文在NVIDIA GTX275显卡的GPU上进行了蜂窝非线性网络有源波仿真，并进行了路径规划。在软件部分，利用OpenCL的封装库QtOpenCL实现了代码在不同gpu系统上的可移植性。与CPU和FPGA的结果相比，我们取得了很好的结果。针对二维介质的实时路径规划问题，我们实现了不同的硬件和软件解决方案。由于通信带宽低、网络分辨率低等瓶颈，它们的实时性几乎达到了极限。在这项工作中，通过使用gpu，我们每秒执行60000次迭代来模拟128×128节点网络，而我们在Intel Core 2 Duo P8700处理器上的软件每秒最多实现35次迭代。我们还在GPU上实现了256 × 256 × 256网络的三维有源波模拟每秒36次迭代。

引用次数: 1

Continuous-time neural networks without local traps for solving Boolean satisfiability 求解布尔可满足性的无局部陷阱连续时间神经网络

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331411

B. Molnár, Z. Toroczkai, M. Ercsey-Ravasz

We present a deterministic continuous-time recurrent neural network similar to CNN models, which can solve Boolean satisfiability (k-SAT) problems without getting trapped in non-solution fixed points. The model can be implemented by analog circuits, in which case the algorithm would take a single operation: the template (connection weights) is set by the k-SAT instance and starting from any initial condition the system converges to a solution. We prove that there is a one-to-one correspondence between the stable fixed points of the model and the k-SAT solutions and present numerical evidence that limit cycles may also be avoided by appropriately choosing the parameters of the model. As this study opens potentially novel technical avenues to tackle hard optimization problems, we also discuss some of the arising questions that need to be investigated in future studies.

我们提出了一种类似于CNN模型的确定性连续时间递归神经网络，它可以解决布尔可满足性(k-SAT)问题，而不会陷入非解不动点。该模型可以通过模拟电路实现，在这种情况下，算法将采取单一操作:模板(连接权重)由k-SAT实例设置，并从任何初始条件开始，系统收敛到解决方案。我们证明了模型的稳定不动点与k-SAT解之间存在一一对应关系，并给出了通过适当选择模型参数也可以避免极限环的数值证据。由于这项研究为解决困难的优化问题开辟了潜在的新技术途径，我们还讨论了一些需要在未来研究中调查的新问题。

引用次数: 8

Analysis of a GPU based CNN implementation 基于GPU的CNN实现分析

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331451

E. László, P. Szolgay, Z. Nagy

The CNN (Cellular Neural Network) is a powerful image processing architecture whose hardware implementation is extremely fast. The lack of such hardware device in a development process can be substituted by using an efficient simulator implementation. Commercially available graphics cards with high computing capabilities make this simulator feasible. The aim of this work is to present a GPU based implementation of a CNN simulator using nVidia's Fermi architecture. Different implementation approaches are considered and compared to a multi-core, multi-threaded CPU and some earlier GPU implementations. A detailed analysis of the introduced GPU implementation is presented.

CNN (Cellular Neural Network)是一种功能强大的图像处理架构，其硬件实现速度非常快。在开发过程中缺少这样的硬件设备可以通过使用有效的模拟器实现来代替。具有高计算能力的商用显卡使该模拟器可行。这项工作的目的是提出一个基于GPU的CNN模拟器的实现，使用nVidia的费米架构。考虑了不同的实现方法，并与多核、多线程CPU和一些早期的GPU实现进行了比较。对引入的GPU实现进行了详细的分析。

引用次数: 16

Integrated CMOS sub-THz imager array 集成CMOS亚太赫兹成像仪阵列

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331457

P. Foldesy, Á. Zarándy

This paper describes the of a 90 nm CMOS sub-THz detector array ASIC. The sub-THz detector array is an integrated system composed of silicon field effect plasma wave sensors, various integrated antennas, pre-amplifiers, ADCs, and digital domain lock-in amplifier detector. The peak responsivity is found 185 kV/W@365 GHz and 52 kV/W@470 GHz and at the detectivity maximum NEP ~ 20 pW/Hz-1.

本文介绍了一种90 nm CMOS亚太赫兹探测器阵列专用集成电路的设计。亚太赫兹探测器阵列是由硅场效应等离子体波传感器、各种集成天线、前置放大器、adc和数字域锁定放大器探测器组成的集成系统。峰值响应度分别为185 kV/W@365 GHz和52 kV/W@470 GHz，最大探测率NEP为20 pW/Hz-1。

引用次数: 4

Memristance and memcapacitance modeling of thin film devices showing memristive behavior 具有忆阻行为的薄膜器件的忆阻和忆电容建模

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331436

M. G. Ahmed, Kyoungrok Cho, Tae-Won Cho

In 2008, the fourth passive element “Memristor” was implemented as a device having both passivity and nonvolatile properties opening the way into new possibilities in the design and fabrication of innovative memory, arithmetic and logic architectures. Nano-features and ionic transport mechanism inherent in memristor device introduce new challenges into modeling, characterization and, in particular, in the related circuit simulation needs with system constructs. Therefore, in this paper, we analyze memristor device fundamentally to characterize the memristance paying particular attention to the hidden memcapacitance effect. Our proposed macro-model modifies takes into account some of the non ideal effects like tunneling current and the hidden memcapacitor constructed across non conducting materials. The model provides the insight for building a device as either memristive or memcapacitive system. The simulation results have been compared with HP published data which show good agreement.

2008年，第四个无源元件“忆阻器”作为一种具有无源性和非易失性的器件被实现，为创新存储器、算术和逻辑架构的设计和制造开辟了新的可能性。忆阻器器件固有的纳米特性和离子传输机制为其建模、表征，特别是与系统结构相关的电路仿真需求带来了新的挑战。因此，本文从根本上分析了忆阻器器件的特性，特别关注了隐性忆电容效应。我们提出的宏观模型修正考虑了一些非理想效应，如隧道电流和在非导电材料上构造的隐藏memcapacitor。该模型为将器件构建为忆阻系统或忆容系统提供了见解。仿真结果与HP公布的数据进行了比较，结果吻合较好。

引用次数: 2

Cellular processor array based UAV safety system 基于蜂窝式处理器阵列的无人机安全系统

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331475

Á. Zarándy, T. Zsedrovits, Zoltán Nagy, A. Kiss, P. Szolgay, T. Roska

Embedded sensor-processor system is being developed for on-board UAV (Unmanned Aerial Vehicle) safety applications. The role of the device is to detect intruder airplanes which are on or close to collision course. Due to weight, power, size, and cost requirements, the visual approach leads to feasible solution only. In our design, 5 cameras are applied to collect visual data from a large field of view. The image flows are processed by 3 different virtual cellular processor arrays, which are implemented in FPGA.

嵌入式传感器处理器系统是为机载无人机(UAV)安全应用而开发的。该装置的作用是探测处于或接近碰撞航线上的入侵飞机。由于重量、功率、尺寸和成本要求，可视化方法只能导致可行的解决方案。在我们的设计中，我们使用了5个摄像头来收集大视场的视觉数据。图像流由3种不同的虚拟蜂窝式处理器阵列处理，并在FPGA上实现。

引用次数: 1

Bio-inspired looming direction detection method 仿生若隐若现方向检测方法

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331448

T. Fulop, Á. Zarándy

The retina inspired approaching object detection algorithm - based on the recently identified Pvlab-5 ganglion cell - is a computationally easy segmentation free method. The original method can detect only the dark looming objects against bright background. This paper shows a modified algorithm, which can detect any looming and recessing objects against dark or bright background. Moreover, we show a post processing evaluation method, which can measure the lateral motion direction using the spatial-temporal activities of the ganglion cells without introducing any hard calculation.

基于最近发现的Pvlab-5神经节细胞的视网膜启发接近目标检测算法是一种计算简便的无分割方法。原来的方法只能在明亮的背景下检测到暗隐现的物体。本文提出了一种改进的算法，该算法可以在暗背景和亮背景下检测出任何隐现和隐没的物体。此外，我们还提出了一种后处理评价方法，该方法可以在不引入任何硬计算的情况下，利用神经节细胞的时空活动来测量横向运动方向。

引用次数: 7

Spin torque oscillator models for applications in associative memories 自旋力矩振荡器模型在联想记忆中的应用

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331474

G. Csaba, M. Pufall, D. Nikonov, G. Bourianoff, A. Horváth, T. Roska, W. Porod

We present physics-based models for both individual and coupled spin torque nano oscillators (STNOs). Such STNOs may become as building blocks for CNN-like dynamic computing architectures. We discuss a hierarchy of models, extending from micromagnetic models, which include the detailed geometry and physics, to compact models, which are based on parameters extracted from the underlying physical description. These simulations also include coupling between individual STNOs, both via spin waves and via electrical interconnects. Using this modeling approach, we demonstrate frequency entrainment and phase synchronization between STOs in the array, which enable computing functions.

我们提出了单个和耦合自旋力矩纳米振荡器(STNOs)的基于物理的模型。这样的stno可能会成为类似cnn的动态计算架构的构建模块。我们讨论了模型的层次结构，从包含详细几何和物理的微磁模型扩展到基于从底层物理描述中提取的参数的紧凑模型。这些模拟还包括单个STNOs之间的耦合，包括通过自旋波和通过电互连。利用这种建模方法，我们演示了阵列中sto之间的频率夹带和相位同步，从而实现了计算功能。

引用次数: 41

Many-core processors and GPU opportunities in particle detectors 粒子探测器中的多核处理器和GPU机会

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-10-18 DOI: 10.1109/CNNA.2012.6331456

N. Neufeld, X. Vilasís-Cardona

High energy physics particle detectors are large and complex devices with very demanding requirements at the level of signal to noise ratios, processing times and data throughput. The first stages of the data acquisition are hardware based while the last ones depend rather on software. Among the solutions to the problems posed by the requirements we may find the use of multi-core processors or maybe GPU's. We shall review what are the points in which these techniques could be of use and the actual proposals.

高能物理粒子探测器是一种大型而复杂的设备，在信噪比、处理时间和数据吞吐量方面有着非常苛刻的要求。数据采集的第一阶段是基于硬件的，而最后阶段则依赖于软件。在这些需求所带来的问题的解决方案中，我们可能会发现使用多核处理器或GPU。我们将审查这些技术可以使用的要点和实际建议。

引用次数: 2

Memory access optimization for computations on unstructured meshes 非结构化网格计算的内存访问优化

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

Pub Date : 2012-08-01 DOI: 10.1109/CNNA.2012.6331437

Antal Hiba, Zoltán Nagy, Miklos Ruszinko

Many real-life applications of processor-arrays suffer from memory bandwidth limitations. In many cases an unstructured mesh is given (computation on sensor data, simulations of physical systems - PDEs), where the vertices represent computations with dependencies represented by the edges. Utilization of processing elements (PEs) during these computations is mainly depends on the node indexing of the mesh. If the adjacent nodes are stored close to each other in main memory, the reloading of node data can be significantly decreased. In case of FPGA the memory accesses can be fully determined by the designer. The mesh and an ordering of its nodes, define the graph bandwidth, which determines the minimum size of on-chip memory to avoid reloading of the nodes from the off-chip memory. If the required on-chip memory size is higher than the available resources, the mesh must be divided into parts. In this paper a novel geometry-based method is presented, which constructs reordered parts from a given unstructured mesh, where each part meets some predefined constraints on graph bandwidth.

处理器阵列的许多实际应用都受到内存带宽限制的影响。在许多情况下，给出了一个非结构化网格(传感器数据的计算，物理系统的模拟- PDEs)，其中顶点表示由边缘表示的依赖关系的计算。在这些计算过程中，处理单元(PEs)的利用率主要取决于网格的节点索引。如果相邻节点在主存中存储得很近，可以显著减少节点数据的重载。对于FPGA，存储器的访问完全可以由设计者决定。网格及其节点的排序定义了图带宽，它决定了片上存储器的最小大小，以避免从片外存储器重新加载节点。如果所需的片上内存大小高于可用资源，则必须将网格划分为多个部分。本文提出了一种新的基于几何的方法，该方法从给定的非结构化网格中构造重新排序的部分，其中每个部分满足预定义的图带宽约束。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2012 13th International Workshop on Cellular Nanoscale Networks and their Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀