2015 International Joint Conference on Neural Networks (IJCNN)最新文献

英文中文

Network-traffic anomaly detection with incremental majority learning 基于增量多数学习的网络流量异常检测

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280573

Shin-Ying Huang, Fang Yu, R. Tsaih, Yennun Huang

Detecting anomaly behavior in large network traffic data has presented a great challenge in designing effective intrusion detection systems. We propose an adaptive model to learn majority patterns under a dynamic changing environment. We first propose unsupervised learning on data abstraction to extract essential features of samples. We then adopt incremental majority learning with iterative evolutions on fitting envelopes to characterize the majority of samples within moving windows. A network traffic sample is considered an anomaly if its abstract feature falls on the outside of the fitting envelope. We justify the effectiveness of the presented approach against 150000+ traffic samples from the NSL-KDD dataset in training and testing, demonstrating positive promise in detecting network attacks by identifying samples that have abnormal features.

在大型网络流量数据中检测异常行为对设计有效的入侵检测系统提出了很大的挑战。我们提出了一个自适应模型来学习动态变化环境下的大多数模式。我们首先提出了数据抽象的无监督学习，以提取样本的基本特征。然后，我们采用增量多数学习和拟合包络的迭代进化来表征移动窗口内的大多数样本。如果网络流量样本的抽象特征落在拟合包络的外部，则认为它是异常的。我们在训练和测试中对来自NSL-KDD数据集的150000多个流量样本证明了所提出方法的有效性，通过识别具有异常特征的样本来检测网络攻击，展示了积极的前景。

引用次数: 10

A tree-structured representation for book author and its recommendation using multilayer SOM 基于多层SOM的图书作者树状表示及其推荐

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280530

Lu Lu, Haijun Zhang

This paper introduces a new framework for author recommending using Multi-Layer Self-Organizing Map (ML-SOM). Concretely, an author is modeled by a tree-structured representation, and an MLSOM-based system is used as an efficient solution to the content-based author recommending problem. The tree-structured representation formulates author features in a hierarchy of author biography, written books and book comments. To efficiently tackle the tree-structured representation, we use an MLSOM algorithm that serves as a clustering technique to handle authors. The effectiveness of our approach was examined in a large-scale dataset containing 7426 authors, 205805 books they wrote, and 3027502 comments that readers have provided. The experimental results corroborate that the proposed approach outperforms current algorithms and can provide a promising solution to author recommendation.

本文介绍了一个作者推荐的新框架——多层自组织映射(ML-SOM)。具体而言，采用树状结构对作者进行建模，并采用基于mlsom的系统有效地解决了基于内容的作者推荐问题。树状结构的表示在作者传记、已写的书和书评的层次结构中表述作者的特征。为了有效地处理树结构表示，我们使用MLSOM算法作为聚类技术来处理作者。我们的方法的有效性在一个包含7426位作者、他们写的205805本书和读者提供的3027502条评论的大型数据集中得到了检验。实验结果证实了该方法优于现有算法，为作者推荐提供了一个有希望的解决方案。

引用次数: 4

A novelty detection approach to identify the occurrence of leakage in smart gas and water grids 一种识别智能燃气和供水管网泄漏的新颖检测方法

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280473

Marco Fagiani, S. Squartini, M. Severini, F. Piazza

In this paper, a novelty detection algorithm for the identification of leakages in smart water/gas grid contexts is proposed. It is based on two separate stages: the first deals with the creation of the statistical leakage-free model, whereas the second evaluates the eventual occurrence of leakage on the basis of the model likelihood. Up to the authors' knowledge, this approach has never been used in the application scenario of interest. A set of several features are extracted from the Almanac of Minutely Power Dataset, and a suboptimal selection is executed to determinate the best combination. The abnormal event (leakage) is induced by manipulating the consumption in the test set. A total of 10 background models are created, by employing both Gaussian Mixture Models (GMMs) and Hidden Markov Models (HMMs) under a comparative perspective, and each of them is adopted to detect 10 leakages, with random duration, length and starting time. Finally, the performance are evaluated in terms of Area Under Curve (AUC) of the Receiver Operating Characteristic (ROC). Obtained results are more than encouraging: the best average AUCs of 85.60% and 87.97% are achieved with HMM, at 1 minute resolution, for natural gas and water, respectively. Specifically, considering true detection rates (TDRs) of 100%, the natural gas exhibits an overall false detection rate (FDR) of 17.11%, and the water achieves an overall FDR of 13.79%.

本文提出了一种用于智能水/气管网泄漏识别的新颖性检测算法。它基于两个独立的阶段:第一个阶段处理统计无泄漏模型的创建，而第二个阶段根据模型的可能性评估泄漏的最终发生。据作者所知，这种方法从未在感兴趣的应用场景中使用过。从分钟功率数据集的Almanac中提取多个特征集合，并执行次优选择以确定最佳组合。异常事件(泄漏)是通过操纵测试集中的消耗引起的。采用比较视角下的高斯混合模型(GMMs)和隐马尔可夫模型(hmm)，共建立10个背景模型，分别检测10个泄漏，泄漏持续时间、长度和开始时间随机。最后，根据接收机工作特征(ROC)的曲线下面积(AUC)对性能进行评估。所获得的结果非常令人鼓舞:在1分钟分辨率下，HMM对天然气和水的平均auc分别达到了85.60%和87.97%。具体而言，考虑到100%的真实检测率(tdr)，天然气的整体误检率(FDR)为17.11%，水的整体误检率为13.79%。

{"title":"A novelty detection approach to identify the occurrence of leakage in smart gas and water grids","authors":"Marco Fagiani, S. Squartini, M. Severini, F. Piazza","doi":"10.1109/IJCNN.2015.7280473","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280473","url":null,"abstract":"In this paper, a novelty detection algorithm for the identification of leakages in smart water/gas grid contexts is proposed. It is based on two separate stages: the first deals with the creation of the statistical leakage-free model, whereas the second evaluates the eventual occurrence of leakage on the basis of the model likelihood. Up to the authors' knowledge, this approach has never been used in the application scenario of interest. A set of several features are extracted from the Almanac of Minutely Power Dataset, and a suboptimal selection is executed to determinate the best combination. The abnormal event (leakage) is induced by manipulating the consumption in the test set. A total of 10 background models are created, by employing both Gaussian Mixture Models (GMMs) and Hidden Markov Models (HMMs) under a comparative perspective, and each of them is adopted to detect 10 leakages, with random duration, length and starting time. Finally, the performance are evaluated in terms of Area Under Curve (AUC) of the Receiver Operating Characteristic (ROC). Obtained results are more than encouraging: the best average AUCs of 85.60% and 87.97% are achieved with HMM, at 1 minute resolution, for natural gas and water, respectively. Specifically, considering true detection rates (TDRs) of 100%, the natural gas exhibits an overall false detection rate (FDR) of 17.11%, and the water achieves an overall FDR of 13.79%.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"42 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79755684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

New insights into the landscape relationships of host response to bacterial pathogens 宿主对细菌病原体反应的景观关系的新见解

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280410

Xiaoyao Yin, Lu Han, Hui Bai, Xiaochen Bo, Yun Bai, Cong Niu, Naiyang Guan, Zhigang Luo

Modern understanding of microbiology largely lays foundation in the biological characterization of microorganisms. However, the landscape relationships of host transcriptional response (HTR) to different bacterial pathogens have not yet been systematically explored. Here, we established the first generation of HTR network (HTRN) according to the HTR similarities among 21 different human pathogenic bacterial species by integrating 258 pairs of host cellular gene expression profiles upon infections. Further, the network was dissected into five bacterial communities of more consensus internal HTR. Interestingly, analysis of signature genes across different communities revealed that distinct community signatures (CS) present differential gene expression patterns. Functional annotation suggested a common feature of host cell response to bacterial infections that specific functional gene clusters (BPs and/or signaling pathways) were preferentially elicited or subverted by community bacterial pathogens. Notably, community signatures (especially key associators participating dissimilar functional profiles) were highly enriched of GWAS disease-related genes, which associated bacterial infections with common and specific non-infectious human disease(s). About 40% of the associations were confirmed by literature investigation that further indicated possible/potential association directionality. Our characterization and analysis were the first to feature differential community HTRs upon bacterial pathogen infections and suggested new perspective of understanding infection-disease associations and underlying pathogenesis.

现代对微生物学的认识在很大程度上为微生物的生物学特性奠定了基础。然而，宿主对不同病原菌的转录反应(HTR)的景观关系尚未得到系统的探讨。本研究通过整合258对感染后宿主细胞基因表达谱，根据21种不同人类致病菌HTR的相似性，建立了第一代HTR网络(HTRN)。此外，该网络被分解成五个更一致的内部HTR细菌群落。有趣的是，对不同群落特征基因的分析表明，不同的群落特征(CS)存在差异的基因表达模式。功能注释表明宿主细胞对细菌感染反应的一个共同特征是，特定的功能基因簇(bp和/或信号通路)被群落细菌病原体优先诱导或破坏。值得注意的是，社区特征(特别是参与不同功能谱的关键相关基因)高度富集了GWAS疾病相关基因，这些基因将细菌感染与常见和特定的非传染性人类疾病联系起来。约40%的关联通过文献调查得到证实，进一步表明可能/潜在的关联方向性。我们的表征和分析首次揭示了细菌病原体感染的不同群落htr，并为理解感染-疾病关联和潜在发病机制提供了新的视角。

{"title":"New insights into the landscape relationships of host response to bacterial pathogens","authors":"Xiaoyao Yin, Lu Han, Hui Bai, Xiaochen Bo, Yun Bai, Cong Niu, Naiyang Guan, Zhigang Luo","doi":"10.1109/IJCNN.2015.7280410","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280410","url":null,"abstract":"Modern understanding of microbiology largely lays foundation in the biological characterization of microorganisms. However, the landscape relationships of host transcriptional response (HTR) to different bacterial pathogens have not yet been systematically explored. Here, we established the first generation of HTR network (HTRN) according to the HTR similarities among 21 different human pathogenic bacterial species by integrating 258 pairs of host cellular gene expression profiles upon infections. Further, the network was dissected into five bacterial communities of more consensus internal HTR. Interestingly, analysis of signature genes across different communities revealed that distinct community signatures (CS) present differential gene expression patterns. Functional annotation suggested a common feature of host cell response to bacterial infections that specific functional gene clusters (BPs and/or signaling pathways) were preferentially elicited or subverted by community bacterial pathogens. Notably, community signatures (especially key associators participating dissimilar functional profiles) were highly enriched of GWAS disease-related genes, which associated bacterial infections with common and specific non-infectious human disease(s). About 40% of the associations were confirmed by literature investigation that further indicated possible/potential association directionality. Our characterization and analysis were the first to feature differential community HTRs upon bacterial pathogen infections and suggested new perspective of understanding infection-disease associations and underlying pathogenesis.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"103 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80298299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An analysis of Dynamic Cortex Memory networks 动态皮层记忆网络的分析

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280753

S. Otte, A. Zell, M. Liwicki

The recently introduced Dynamic Cortex Memory (DCM) is an extension of the Long Short Term Memory (LSTM) providing a systematic inter-gate connection infrastructure. In this paper the behavior of DCM networks is studied in more detail and their potential in the field of gradient-based sequence learning is investigated. Hereby, DCM networks are analyzed regarding particular key features of neural signal processing systems, namely, their robustness to noise and their ability of time warping. Throughout all experiments we show that DCMs converge faster and yield better results than LSTMs. Hereby, DCM networks require overall less weights than pure LSTM networks to achieve the same or even better results. Besides, a promising neurally implemented just-in-time online signal filter approach is presented, which is latency-free and still provides an accurate filtering performance much better than conventional low-pass filters. We also show that the neural networks can do explicit time warping even better than the Dynamic Time Warping (DTW) algorithm, which is a specialized method developed for this task.

最近推出的动态皮层记忆(DCM)是长短期记忆(LSTM)的扩展，提供了系统的门间连接基础设施。本文更详细地研究了DCM网络的行为，并探讨了其在基于梯度的序列学习领域的潜力。因此，针对神经信号处理系统的特定关键特征，即对噪声的鲁棒性和时间翘曲能力，对DCM网络进行了分析。在所有的实验中，我们证明了dcm比lstm收敛得更快，并且产生了更好的结果。因此，与纯LSTM网络相比，DCM网络总体上需要更少的权重来获得相同甚至更好的结果。此外，提出了一种有前途的神经实现的实时在线信号滤波方法，该方法在无延迟的情况下仍然提供比传统低通滤波器更好的精确滤波性能。我们还表明，神经网络可以做显式时间翘曲，甚至比动态时间翘曲(DTW)算法更好，动态时间翘曲(DTW)算法是为此任务开发的专门方法。

引用次数: 8

The on-line curvilinear component analysis (onCCA) for real-time data reduction 在线曲线分量分析(onCCA)用于实时数据约简

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280318

G. Cirrincione, J. Hérault, V. Randazzo

Real time pattern recognition applications often deal with high dimensional data, which require a data reduction step which is only performed offline. However, this loses the possibility of adaption to a changing environment. This is also true for other applications different from pattern recognition, like data visualization for input inspection. Only linear projections, like the principal component analysis, can work in real time by using iterative algorithms while all known nonlinear techniques cannot be implemented in such a way and actually always work on the whole database at each epoch. Among these nonlinear tools, the Curvilinear Component Analysis (CCA), which is a non-convex technique based on the preservation of the local distances into the lower dimensional space, plays an important role. This paper presents the online version of CCA. It inherits the same features of CCA, is adaptive in real time and tracks non-stationary high dimensional distributions. It is composed of neurons with two weights: one, pointing to the input space, quantizes the data distribution, and the other, pointing to the output space, represents the projection of the first weight. This on-line CCA has been conceived not only for the previously cited applications, but also as a basic tool for more complex supervised neural networks for modelling very complex high dimensional data. This algorithm is tested on 2-D and 3-D synthetic data and on an experimental database concerning the bearing faults of an electrical motor, with the goal of novelty (fault) detection.

实时模式识别应用通常处理高维数据，这需要一个数据约简步骤，而这个步骤只能离线执行。然而，这失去了适应不断变化的环境的可能性。对于与模式识别不同的其他应用程序也是如此，例如用于输入检查的数据可视化。只有线性投影，如主成分分析，可以通过迭代算法实时工作，而所有已知的非线性技术都不能以这种方式实现，并且实际上总是在每个历元的整个数据库上工作。在这些非线性工具中，曲线分量分析(CCA)是一种基于局部距离保持到低维空间的非凸技术，在非线性分析中起着重要作用。本文介绍了CCA的在线版本。它继承了CCA的特点，具有实时自适应和跟踪非平稳高维分布的能力。它由两个权重的神经元组成:一个指向输入空间，量化数据分布，另一个指向输出空间，表示第一个权重的投影。这种在线CCA不仅适用于前面提到的应用，而且还可以作为更复杂的监督神经网络的基本工具，用于模拟非常复杂的高维数据。该算法在二维和三维合成数据以及电机轴承故障的实验数据库上进行了测试，目的是进行新颖性(故障)检测。

{"title":"The on-line curvilinear component analysis (onCCA) for real-time data reduction","authors":"G. Cirrincione, J. Hérault, V. Randazzo","doi":"10.1109/IJCNN.2015.7280318","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280318","url":null,"abstract":"Real time pattern recognition applications often deal with high dimensional data, which require a data reduction step which is only performed offline. However, this loses the possibility of adaption to a changing environment. This is also true for other applications different from pattern recognition, like data visualization for input inspection. Only linear projections, like the principal component analysis, can work in real time by using iterative algorithms while all known nonlinear techniques cannot be implemented in such a way and actually always work on the whole database at each epoch. Among these nonlinear tools, the Curvilinear Component Analysis (CCA), which is a non-convex technique based on the preservation of the local distances into the lower dimensional space, plays an important role. This paper presents the online version of CCA. It inherits the same features of CCA, is adaptive in real time and tracks non-stationary high dimensional distributions. It is composed of neurons with two weights: one, pointing to the input space, quantizes the data distribution, and the other, pointing to the output space, represents the projection of the first weight. This on-line CCA has been conceived not only for the previously cited applications, but also as a basic tool for more complex supervised neural networks for modelling very complex high dimensional data. This algorithm is tested on 2-D and 3-D synthetic data and on an experimental database concerning the bearing faults of an electrical motor, with the goal of novelty (fault) detection.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"34 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81638000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Lattice point sets for efficient kernel smoothing models 高效核平滑模型的点阵集

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280469

C. Cervellera, Mauro Gaggero, Danilo Macciò, R. Marcialis

This work addresses the problem of learning an unknown function from data when local models are employed. In particular, kernel smoothing models are considered, which use kernels in a straightforward fashion by modeling the output as a weighted average of values observed in a neighborhood of the input. Such models are a popular alternative to other kernel paradigms, such as support vector machines (SVM), due to their very light computational burden. The purpose of this work is to prove that a smart deterministic selection of the observation points can be advantageous with respect to input data coming from a pure random sampling. Apart from the theoretical interest, this has a practical implication in all the cases in which one can control the generation of the input samples (e.g., in applications from robotics, dynamic programming, optimization, mechanics, etc.) To this purpose, lattice point sets (LPSs), a special kind of sampling schemes commonly employed for efficient numerical integration, are investigated. It is proved that building local kernel smoothers using LPSs guarantees universal approximation property with better rates with respect to i.i.d. sampling. Then, a rule for automatic kernel width selection, making the computational burden of building the model negligible, is introduced to show how the regular structure of the lattice can lead to practical advantages. Simulation results are also provided to test in practice the performance of the proposed methods.

这项工作解决了当使用局部模型时从数据中学习未知函数的问题。特别是，考虑了核平滑模型，它通过将输出建模为输入邻域中观察到的值的加权平均值，以一种直接的方式使用核。由于这些模型的计算负担非常轻，因此它们是其他内核范例(如支持向量机(SVM))的流行替代方案。这项工作的目的是证明，相对于来自纯随机抽样的输入数据，观察点的智能确定性选择是有利的。除了理论兴趣之外，这在所有可以控制输入样本生成的情况下(例如，在机器人，动态规划，优化，力学等应用中)具有实际意义。为此，研究了晶格点集(lps)，一种通常用于有效数值积分的特殊采样方案。证明了利用LPSs构建局部核平滑可以保证对i.i.d采样具有较好的普适性和近似率。然后，引入了一个自动核宽度选择规则，使构建模型的计算负担可以忽略不计，以显示晶格的规则结构如何导致实际优势。仿真结果验证了所提方法的实际性能。

{"title":"Lattice point sets for efficient kernel smoothing models","authors":"C. Cervellera, Mauro Gaggero, Danilo Macciò, R. Marcialis","doi":"10.1109/IJCNN.2015.7280469","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280469","url":null,"abstract":"This work addresses the problem of learning an unknown function from data when local models are employed. In particular, kernel smoothing models are considered, which use kernels in a straightforward fashion by modeling the output as a weighted average of values observed in a neighborhood of the input. Such models are a popular alternative to other kernel paradigms, such as support vector machines (SVM), due to their very light computational burden. The purpose of this work is to prove that a smart deterministic selection of the observation points can be advantageous with respect to input data coming from a pure random sampling. Apart from the theoretical interest, this has a practical implication in all the cases in which one can control the generation of the input samples (e.g., in applications from robotics, dynamic programming, optimization, mechanics, etc.) To this purpose, lattice point sets (LPSs), a special kind of sampling schemes commonly employed for efficient numerical integration, are investigated. It is proved that building local kernel smoothers using LPSs guarantees universal approximation property with better rates with respect to i.i.d. sampling. Then, a rule for automatic kernel width selection, making the computational burden of building the model negligible, is introduced to show how the regular structure of the lattice can lead to practical advantages. Simulation results are also provided to test in practice the performance of the proposed methods.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"30 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84436673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Regularity and randomness in modular network structures for neural associative memories 神经联想记忆模块网络结构的规律性和随机性

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280829

G. Tanaka, T. Yamane, D. Nakano, R. Nakane, Y. Katayama

This study explores efficient structures of artificial neural networks for associative memories. Motivated by the real brain structure and the demand of energy efficiency in hardware implementation, we consider neural networks with sparse modular structures. Numerical experiments are performed to clarify how the storage capacity of associative memory depends on regularity and randomness of the network structures. We first show that a fully regularized network, suited for design of hardware, has poor recall performance and a fully random network, undesired for hardware implementation, yields excellent recall performance. For seeking a network structure with good performance and high implementability, we consider four different modular networks constructed based on different combinations of regularity and randomness. From the results of associative memory tests for these networks, we find that the combination of random intramodule connections and regular intermodule connections works better than the other cases. Our results suggest that the parallel usage of regularity and randomness in network structures could be beneficial for developing energy-efficient neural networks.

本研究探讨了人工神经网络在联想记忆中的有效结构。考虑到真实的大脑结构和硬件实现中对能量效率的需求，我们考虑了稀疏模块化结构的神经网络。通过数值实验阐明了网络结构的规律性和随机性对联想记忆存储容量的影响。我们首先表明，适合硬件设计的完全正则化网络具有较差的召回性能，而完全随机网络(不适合硬件实现)具有出色的召回性能。为了寻求一种具有良好性能和高可实现性的网络结构，我们考虑了基于规则性和随机性的不同组合构建的四种不同的模块化网络。从这些网络的联想记忆测试结果中，我们发现随机模块内连接和规则模块间连接的组合比其他情况效果更好。我们的研究结果表明，在网络结构中并行使用规律性和随机性可能有利于开发节能的神经网络。

{"title":"Regularity and randomness in modular network structures for neural associative memories","authors":"G. Tanaka, T. Yamane, D. Nakano, R. Nakane, Y. Katayama","doi":"10.1109/IJCNN.2015.7280829","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280829","url":null,"abstract":"This study explores efficient structures of artificial neural networks for associative memories. Motivated by the real brain structure and the demand of energy efficiency in hardware implementation, we consider neural networks with sparse modular structures. Numerical experiments are performed to clarify how the storage capacity of associative memory depends on regularity and randomness of the network structures. We first show that a fully regularized network, suited for design of hardware, has poor recall performance and a fully random network, undesired for hardware implementation, yields excellent recall performance. For seeking a network structure with good performance and high implementability, we consider four different modular networks constructed based on different combinations of regularity and randomness. From the results of associative memory tests for these networks, we find that the combination of random intramodule connections and regular intermodule connections works better than the other cases. Our results suggest that the parallel usage of regularity and randomness in network structures could be beneficial for developing energy-efficient neural networks.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"89 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84475956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A word distributed representation based framework for large-scale short text classification 基于单词分布式表示的大规模短文本分类框架

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280513

Di Yao, Jingping Bi, Jianhui Huang, Jin Zhu

With the development of internet, there are billions of short texts generated each day. However, the accuracy of large scale short text classification is poor due to the data sparseness. Traditional methods used to use external dataset to enrich the representation of document and solve the data sparsity problem. But external dataset which matches the specific short texts is hard to find. In this paper, we propose a framework to solve the data sparsity problem without using external dataset. Our framework deal with large scale short text by making the most of semantic similarity of words which learned from the training short texts. First, we learn word distributed representation and measure the word semantic similarity from the training short texts. Then, we propose a method which enrich the document representation by using the word semantic similarity information. At last, we build classifiers based on the enriched representation. We evaluate our framework on both the benchmark dataset(Standford Sentiment Treebank) and the large scale Chinese news title dataset which collected by ourselves. For the benchmark dataset, using our framework can improve 3% classification accuracy. The result we tested on the large scale Chinese news title dataset shows that our framework achieve better result with the increase of the training set size.

随着互联网的发展，每天都会产生数十亿条短信。然而，由于数据的稀疏性，大规模短文本分类的准确率较低。传统的方法是利用外部数据集来丰富文档的表示，解决数据稀疏性问题。但是很难找到与特定文本匹配的外部数据集。在本文中，我们提出了一个不使用外部数据集的框架来解决数据稀疏性问题。我们的框架通过充分利用从训练短文本中学习到的词的语义相似度来处理大规模短文本。首先，我们学习词的分布式表示，并从训练短文本中测量词的语义相似度。然后，我们提出了一种利用词的语义相似度信息丰富文档表示的方法。最后，在此基础上构建分类器。我们在基准数据集(斯坦福情感树库)和我们自己收集的大型中文新闻标题数据集上对我们的框架进行了评估。对于基准数据集，使用我们的框架可以提高3%的分类准确率。我们在大型中文新闻标题数据集上的测试结果表明，随着训练集规模的增加，我们的框架取得了更好的效果。

{"title":"A word distributed representation based framework for large-scale short text classification","authors":"Di Yao, Jingping Bi, Jianhui Huang, Jin Zhu","doi":"10.1109/IJCNN.2015.7280513","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280513","url":null,"abstract":"With the development of internet, there are billions of short texts generated each day. However, the accuracy of large scale short text classification is poor due to the data sparseness. Traditional methods used to use external dataset to enrich the representation of document and solve the data sparsity problem. But external dataset which matches the specific short texts is hard to find. In this paper, we propose a framework to solve the data sparsity problem without using external dataset. Our framework deal with large scale short text by making the most of semantic similarity of words which learned from the training short texts. First, we learn word distributed representation and measure the word semantic similarity from the training short texts. Then, we propose a method which enrich the document representation by using the word semantic similarity information. At last, we build classifiers based on the enriched representation. We evaluate our framework on both the benchmark dataset(Standford Sentiment Treebank) and the large scale Chinese news title dataset which collected by ourselves. For the benchmark dataset, using our framework can improve 3% classification accuracy. The result we tested on the large scale Chinese news title dataset shows that our framework achieve better result with the increase of the training set size.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"28 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84870870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Controllability of multi-level states in memristive device models using a transistor as current compliance during SET operation 在SET操作期间，使用晶体管作为电流遵从性的忆阻器件模型中多级状态的可控性

2015 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280745

A. Siemon, S. Menzel, R. Waser, E. Linn

Redox-based resistive switching devices are an emerging class of non-volatile ultra-scalable memory and logic devices. These devices offer complex internal device physics leading to rich dynamical behavior. Memristive device models are intended to reproduce the underlying redox-based resistive switching device behavior accurately to enable proper circuit simulations. A specific feature of resistively switching devices is the controllability of multi-level resistive states by using a current compliance during the SET operation. Here, we consider a one-transistor-one-resistive-switch circuit to study the multi-level capability of three different types of memristive models. The feasibility of current compliance induced multi-level resistance state control is a check for the accuracy of the memristive device model.

基于氧化还原的电阻开关器件是一类新兴的非易失性超可扩展存储和逻辑器件。这些设备提供复杂的内部设备物理导致丰富的动态行为。忆阻器件模型旨在准确地再现基于氧化还原的电阻开关器件的底层行为，以实现适当的电路模拟。阻性开关器件的一个特殊特性是在SET操作期间通过使用电流遵从性来控制多级阻性状态。在这里，我们考虑一晶体管一电阻开关电路来研究三种不同类型的记忆模型的多电平能力。电流顺应性诱导多级电阻状态控制的可行性是对忆阻器件模型准确性的检验。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2015 International Joint Conference on Neural Networks (IJCNN)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀