首页 > 最新文献

The 2011 International Joint Conference on Neural Networks最新文献

英文 中文
The effects of feedback and lateral connections on perceptual processing: A study using oscillatory networks 反馈和横向连接对知觉加工的影响:一项使用振荡网络的研究
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033357
A. R. Rao, G. Cecchi
We model neural dynamical behavior during object perception using the principle of sparse coding in multilayer oscillatory networks. The network model consists of units with amplitude and phase variables, and allows the propagation of higher-level information to lower levels via feedback connections. We show that this model can replicate findings in the neuroscience literature, where measurements have shown that neurons in lower level visual areas respond in a delayed fashion to missing contours of whole objects. We contrast the behavior of feedback connections with that of lateral connections by selectively disabling these in our model to examine their contributions to object perception. This paper successfully extends the previously reported capabilities of oscillatory networks by applying them to model perceptual tasks.
我们利用多层振荡网络中的稀疏编码原理对物体感知过程中的神经动力学行为进行建模。网络模型由具有振幅和相位变量的单元组成,并允许通过反馈连接将高层信息传播到低层。我们表明,这个模型可以复制神经科学文献中的发现,在这些文献中,测量表明,较低水平视觉区域的神经元对整个物体缺失的轮廓的反应延迟。我们将反馈连接的行为与横向连接的行为进行对比,在我们的模型中选择性地禁用这些连接,以检查它们对物体感知的贡献。本文通过将振荡网络应用于感知任务模型,成功地扩展了先前报道的振荡网络的能力。
{"title":"The effects of feedback and lateral connections on perceptual processing: A study using oscillatory networks","authors":"A. R. Rao, G. Cecchi","doi":"10.1109/IJCNN.2011.6033357","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033357","url":null,"abstract":"We model neural dynamical behavior during object perception using the principle of sparse coding in multilayer oscillatory networks. The network model consists of units with amplitude and phase variables, and allows the propagation of higher-level information to lower levels via feedback connections. We show that this model can replicate findings in the neuroscience literature, where measurements have shown that neurons in lower level visual areas respond in a delayed fashion to missing contours of whole objects. We contrast the behavior of feedback connections with that of lateral connections by selectively disabling these in our model to examine their contributions to object perception. This paper successfully extends the previously reported capabilities of oscillatory networks by applying them to model perceptual tasks.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124866918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
RANSAC algorithm with sequential probability ratio test for robust training of feed-forward neural networks 基于序列概率比检验的RANSAC算法用于前馈神经网络的鲁棒训练
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033653
M. El-Melegy
This paper addresses the problem of fitting a functional model to data corrupted with outliers using a multilayered feed-forward neural network (MFNN). Almost all previous efforts to solve this problem have focused on using a training algorithm that minimizes an M-estimator based error criterion. However the robustness gained from M-estimators is still low. Using a training algorithm based on the RANdom SAmple Consensus (RANSAC) framework improves significantly the robustness of the algorithm. However the algorithm typically requires prolonged period of time before a final solution is reached. In this paper, we propose a new strategy to improve the time performance of the RANSAC algorithm for training MFNNs. A statistical pre-test based on Wald's sequential probability ratio test (SPRT) is performed on each randomly generated sample to decide whether it deserves to be used for model estimation. The proposed algorithm is evaluated on synthetic data, contaminated with varying degrees of outliers, and have demonstrated faster performance compared to the original RANSAC algorithm with no significant sacrifice of the robustness.
本文利用多层前馈神经网络(MFNN)解决了对被异常值破坏的数据进行函数模型拟合的问题。几乎所有以前解决这个问题的努力都集中在使用最小化基于m估计器的误差准则的训练算法上。然而,从m估计得到的鲁棒性仍然很低。使用基于随机样本一致性(RANSAC)框架的训练算法显著提高了算法的鲁棒性。然而,该算法通常需要较长时间才能达到最终解决方案。在本文中,我们提出了一种新的策略来提高RANSAC算法在训练mfnn时的时间性能。对每个随机生成的样本进行基于Wald序列概率比检验(SPRT)的统计预检验,以确定其是否值得用于模型估计。该算法在被不同程度的异常值污染的合成数据上进行了评估,与原始RANSAC算法相比,在没有显著牺牲鲁棒性的情况下,证明了更快的性能。
{"title":"RANSAC algorithm with sequential probability ratio test for robust training of feed-forward neural networks","authors":"M. El-Melegy","doi":"10.1109/IJCNN.2011.6033653","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033653","url":null,"abstract":"This paper addresses the problem of fitting a functional model to data corrupted with outliers using a multilayered feed-forward neural network (MFNN). Almost all previous efforts to solve this problem have focused on using a training algorithm that minimizes an M-estimator based error criterion. However the robustness gained from M-estimators is still low. Using a training algorithm based on the RANdom SAmple Consensus (RANSAC) framework improves significantly the robustness of the algorithm. However the algorithm typically requires prolonged period of time before a final solution is reached. In this paper, we propose a new strategy to improve the time performance of the RANSAC algorithm for training MFNNs. A statistical pre-test based on Wald's sequential probability ratio test (SPRT) is performed on each randomly generated sample to decide whether it deserves to be used for model estimation. The proposed algorithm is evaluated on synthetic data, contaminated with varying degrees of outliers, and have demonstrated faster performance compared to the original RANSAC algorithm with no significant sacrifice of the robustness.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125009023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Ant Colony Optimization Changing the Rate of Dull Ants and its application to QAP 蚁群优化改变迟钝蚁率及其在QAP中的应用
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033592
Shozo Shimomura, H. Matsushita, Y. Nishio
In our previous study, we have proposed an Ant Colony Optimization with Intelligent and Dull Ants (IDACO) which contains two kinds of ants. We have applied IDACO to various Traveling Salesman Problems (TSPs) and confirmed its effectiveness. This study proposes an Ant Colony Optimization Changing the Rate of Dull Ants (IDACO-CR) and its Application to Quadratic Assignment Problems (QAPs). In addition to the existence of the dull ants which cannot trail the pheromone, the rate of dull ants in IDACO-CR is changed flexibly and automatically in the simulation, depending on the problem. We investigate the behavior of IDACO-CR in detail and the effect of changing the rate of dull ants. Simulation results show that IDACO-CR gets out from the local optima by changing the rate of dull ants, and we confirm that IDACO-CR obtains the effective results in solving complex optimization problems.
在我们之前的研究中,我们提出了一种包含两种蚂蚁的智能和迟钝蚂蚁的蚁群优化算法(IDACO)。我们将IDACO应用于各种旅行商问题(tsp),并证实了它的有效性。提出了一种改变迟钝蚁率的蚁群优化算法(IDACO-CR)及其在二次分配问题(qap)中的应用。除了存在不能跟踪信息素的迟钝蚂蚁外,在模拟中,IDACO-CR中迟钝蚂蚁的比率根据问题灵活自动地变化。我们详细研究了IDACO-CR的行为以及改变迟钝蚂蚁率的效果。仿真结果表明,IDACO-CR算法通过改变钝蚁的速度使算法从局部最优状态中解脱出来,验证了IDACO-CR算法在解决复杂优化问题时的有效性。
{"title":"Ant Colony Optimization Changing the Rate of Dull Ants and its application to QAP","authors":"Shozo Shimomura, H. Matsushita, Y. Nishio","doi":"10.1109/IJCNN.2011.6033592","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033592","url":null,"abstract":"In our previous study, we have proposed an Ant Colony Optimization with Intelligent and Dull Ants (IDACO) which contains two kinds of ants. We have applied IDACO to various Traveling Salesman Problems (TSPs) and confirmed its effectiveness. This study proposes an Ant Colony Optimization Changing the Rate of Dull Ants (IDACO-CR) and its Application to Quadratic Assignment Problems (QAPs). In addition to the existence of the dull ants which cannot trail the pheromone, the rate of dull ants in IDACO-CR is changed flexibly and automatically in the simulation, depending on the problem. We investigate the behavior of IDACO-CR in detail and the effect of changing the rate of dull ants. Simulation results show that IDACO-CR gets out from the local optima by changing the rate of dull ants, and we confirm that IDACO-CR obtains the effective results in solving complex optimization problems.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125064104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variations to incremental growing neural gas algorithm based on label maximization 基于标签最大化的增量增长神经气体算法的改进
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033326
Jean-Charles Lamirel, Raghvendra Mall, Pascal Cuxac, Ghada Safi
Neural clustering algorithms show high performance in the general context of the analysis of homogeneous textual dataset. This is especially true for the recent adaptive versions of these algorithms, like the incremental growing neural gas algorithm (IGNG) and the labeling maximization based incremental growing neural gas algorithm (IGNG-F). In this paper we highlight that there is a drastic decrease of performance of these algorithms, as well as the one of more classical algorithms, when a heterogeneous textual dataset is considered as an input. Specific quality measures and cluster labeling techniques that are independent of the clustering method are used for the precise performance evaluation. We provide new variations to incremental growing neural gas algorithm exploiting in an incremental way knowledge from clusters about their current labeling along with cluster distance measure data. This solution leads to significant gain in performance for all types of datasets, especially for the clustering of complex heterogeneous textual data.
神经聚类算法在分析同质文本数据集的一般情况下表现出较高的性能。对于这些算法的最新自适应版本来说尤其如此,比如增量增长神经气体算法(IGNG)和基于标记最大化的增量增长神经气体算法(IGNG- f)。在本文中,我们强调当将异构文本数据集作为输入时,这些算法以及更经典的算法的性能会急剧下降。使用独立于聚类方法的特定质量度量和聚类标记技术进行精确的性能评估。我们为增量增长神经气体算法提供了新的变化,以增量的方式利用来自聚类的关于其当前标记的知识以及聚类距离度量数据。这种解决方案可以显著提高所有类型数据集的性能,特别是对于复杂异构文本数据的聚类。
{"title":"Variations to incremental growing neural gas algorithm based on label maximization","authors":"Jean-Charles Lamirel, Raghvendra Mall, Pascal Cuxac, Ghada Safi","doi":"10.1109/IJCNN.2011.6033326","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033326","url":null,"abstract":"Neural clustering algorithms show high performance in the general context of the analysis of homogeneous textual dataset. This is especially true for the recent adaptive versions of these algorithms, like the incremental growing neural gas algorithm (IGNG) and the labeling maximization based incremental growing neural gas algorithm (IGNG-F). In this paper we highlight that there is a drastic decrease of performance of these algorithms, as well as the one of more classical algorithms, when a heterogeneous textual dataset is considered as an input. Specific quality measures and cluster labeling techniques that are independent of the clustering method are used for the precise performance evaluation. We provide new variations to incremental growing neural gas algorithm exploiting in an incremental way knowledge from clusters about their current labeling along with cluster distance measure data. This solution leads to significant gain in performance for all types of datasets, especially for the clustering of complex heterogeneous textual data.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"147 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125867977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Development of a mix-design based Rapid Chloride Permeability assessment model using neuronets 基于混合设计的神经网络氯化物渗透率快速评估模型的开发
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033580
Hakan Yasarer, Y. Najjar
Corrosion of reinforcing steel due to chloride penetration is one of the most common causes of deterioration in concrete pavement structures. On an annual basis, millions of dollars are spent on corrosion-related repairs. High incidence rates and repair costs have stimulated widespread research interests in order to properly assess the durability problem of concrete pavements. Chloride penetration of concrete pavement structures is determined through the Rapid Chloride Permeability test (RCPT), which typically measures the number of coulombs passing through a concrete sample over a period of six hours at a concrete age of 7, 28, and 56 days. In a composite material, such as concrete, the parameters of the mixture design and interaction between them determine the behavior of the material. Previous studies have shown that Artificial Neural Network (ANN) based material modeling approach has been successfully used to capture complex interactions among input and output variables. In this study, back-propagation ANN, and Regression-based permeability response prediction models were developed to assess the permeability potential of various concrete mixes using data obtained from actual Rapid Chloride Permeability tests. The back-propagation ANN learning technique proved to be an efficient method to produce relatively accurate permeability response prediction models. Comparison of the prediction accuracy of the developed ANN models and the regression model proved that the developed ANN model outperformed the regression-based model. The developed ANN models have high predictive capability to properly assess the chloride permeability of concrete mixes based on various mix-design parameters. These models can reliably be used for permeability prediction tasks in order to reduce or eliminate the duration of the testing as well as the sample preparation periods required for proper RCP testing.
氯化物渗透对钢筋的腐蚀是混凝土路面结构恶化的最常见原因之一。每年,数百万美元被花费在与腐蚀有关的维修上。混凝土路面耐久性问题的高发生率和高修复成本引起了广泛的研究兴趣。氯化物对混凝土路面结构的渗透是通过快速氯化物渗透测试(RCPT)来确定的,该测试通常测量在混凝土龄期为7天、28天和56天的6小时内通过混凝土样品的库仑数。在混凝土等复合材料中,混合设计的参数和它们之间的相互作用决定了材料的性能。以往的研究表明,基于人工神经网络(ANN)的材料建模方法已经成功地用于捕获输入和输出变量之间复杂的相互作用。在这项研究中,利用从实际快速氯离子渗透试验中获得的数据,开发了反向传播神经网络和基于回归的渗透响应预测模型,以评估各种混凝土混合料的渗透潜力。事实证明,反向传播人工神经网络学习技术是一种有效的方法,可以产生相对准确的渗透率响应预测模型。将所建立的人工神经网络模型与回归模型的预测精度进行比较,证明所建立的人工神经网络模型的预测精度优于基于回归模型的预测精度。所建立的人工神经网络模型对不同配合比设计参数下混凝土的氯离子渗透性具有较高的预测能力。这些模型可以可靠地用于渗透率预测任务,以减少或消除测试的持续时间以及适当的RCP测试所需的样品准备时间。
{"title":"Development of a mix-design based Rapid Chloride Permeability assessment model using neuronets","authors":"Hakan Yasarer, Y. Najjar","doi":"10.1109/IJCNN.2011.6033580","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033580","url":null,"abstract":"Corrosion of reinforcing steel due to chloride penetration is one of the most common causes of deterioration in concrete pavement structures. On an annual basis, millions of dollars are spent on corrosion-related repairs. High incidence rates and repair costs have stimulated widespread research interests in order to properly assess the durability problem of concrete pavements. Chloride penetration of concrete pavement structures is determined through the Rapid Chloride Permeability test (RCPT), which typically measures the number of coulombs passing through a concrete sample over a period of six hours at a concrete age of 7, 28, and 56 days. In a composite material, such as concrete, the parameters of the mixture design and interaction between them determine the behavior of the material. Previous studies have shown that Artificial Neural Network (ANN) based material modeling approach has been successfully used to capture complex interactions among input and output variables. In this study, back-propagation ANN, and Regression-based permeability response prediction models were developed to assess the permeability potential of various concrete mixes using data obtained from actual Rapid Chloride Permeability tests. The back-propagation ANN learning technique proved to be an efficient method to produce relatively accurate permeability response prediction models. Comparison of the prediction accuracy of the developed ANN models and the regression model proved that the developed ANN model outperformed the regression-based model. The developed ANN models have high predictive capability to properly assess the chloride permeability of concrete mixes based on various mix-design parameters. These models can reliably be used for permeability prediction tasks in order to reduce or eliminate the duration of the testing as well as the sample preparation periods required for proper RCP testing.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126068594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Group lasso regularized multiple kernel learning for heterogeneous feature selection 基于组lasso正则化多核学习的异构特征选择
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033554
Yi-Ren Yeh, Y. Chung, Ting-Chu Lin, Y. Wang
We propose a novel multiple kernel learning (MKL) algorithm with a group lasso regularizer, called group lasso regularized MKL (GL-MKL), for heterogeneous feature selection. We extend the existing MKL algorithm and impose a mixed ℓ1 and ℓ2 norm constraint (known as group lasso) as the regularizer. Our GL-MKL determines the optimal base kernels, including the associated weights and kernel parameters, and results in a compact set of features for comparable or improved recognition performance. The use of our GL-MKL avoids the problem of choosing the proper technique to normalize the feature attributes collected from heterogeneous domains (and thus with different properties and distribution ranges). Our approach does not need to exhaustively search for the entire feature space when performing feature selection like prior sequential-based feature selection methods did, and we do not require any prior knowledge on the optimal size of the feature subset either. Comparisons with existing MKL or sequential-based feature selection methods on a variety of datasets confirm the effectiveness of our method in selecting a compact feature subset for comparable or improved classification performance.
本文提出了一种基于群lasso正则化器的多核学习算法,称为群lasso正则化MKL (GL-MKL),用于异构特征选择。我们扩展了现有的MKL算法,并施加了一个混合的1和2范数约束(称为群lasso)作为正则化器。我们的GL-MKL确定了最优的基本内核,包括相关的权重和内核参数,并产生了一组紧凑的特征,可用于相当或改进的识别性能。使用我们的GL-MKL避免了选择合适的技术来规范化从异构域(因此具有不同的属性和分布范围)收集的特征属性的问题。在进行特征选择时,我们的方法不需要像先前基于序列的特征选择方法那样穷尽搜索整个特征空间,并且我们也不需要任何关于特征子集的最佳大小的先验知识。与现有的MKL或基于序列的特征选择方法在各种数据集上的比较证实了我们的方法在选择紧凑特征子集以获得可比或改进的分类性能方面的有效性。
{"title":"Group lasso regularized multiple kernel learning for heterogeneous feature selection","authors":"Yi-Ren Yeh, Y. Chung, Ting-Chu Lin, Y. Wang","doi":"10.1109/IJCNN.2011.6033554","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033554","url":null,"abstract":"We propose a novel multiple kernel learning (MKL) algorithm with a group lasso regularizer, called group lasso regularized MKL (GL-MKL), for heterogeneous feature selection. We extend the existing MKL algorithm and impose a mixed ℓ1 and ℓ2 norm constraint (known as group lasso) as the regularizer. Our GL-MKL determines the optimal base kernels, including the associated weights and kernel parameters, and results in a compact set of features for comparable or improved recognition performance. The use of our GL-MKL avoids the problem of choosing the proper technique to normalize the feature attributes collected from heterogeneous domains (and thus with different properties and distribution ranges). Our approach does not need to exhaustively search for the entire feature space when performing feature selection like prior sequential-based feature selection methods did, and we do not require any prior knowledge on the optimal size of the feature subset either. Comparisons with existing MKL or sequential-based feature selection methods on a variety of datasets confirm the effectiveness of our method in selecting a compact feature subset for comparable or improved classification performance.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123261262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Advances on criteria for biological plausibility in artificial neural networks: Think of learning processes 人工神经网络生物合理性标准的研究进展:以学习过程为例
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033387
Alberione Braz da Silva, J. Rosa
Artificial neural network (ANN) community is engaged in biological plausibility issues these days. Different views about this subject can lead to disagreements of classification criteria among ANN researchers. In order to contribute to this debate, two of these views are highlighted here: one is related directly to the cerebral cortex biological structure, and the other focuses the neural features and the signaling between neurons. The model proposed in this paper considers that a biologically more plausible ANN has the purpose to create a more faithful model concerning the biological structure, properties, and functionalities, including learning processes, of the cerebral cortex, not disregarding its computational efficiency. The choice of the models upon which the proposed description is based takes into account two main criteria: the fact they are considered biologically more realistic and the fact they deal with intra and inter-neuron signaling in electrical and chemical synapses. Also, the duration of action potentials is taken into account. In addition to the characteristics for encoding information regarding biological plausibility present in current spiking neuron models, a distinguishable feature is emphasized here: a combination of Hebbian learning and error-driven learning.
近年来,人工神经网络(ANN)学界一直致力于研究生物的合理性问题。对这一主题的不同看法会导致人工神经网络研究者对分类标准的分歧。为了促成这场争论,这里强调了两种观点:一种直接与大脑皮层生物结构有关,另一种关注神经特征和神经元之间的信号传导。本文提出的模型认为,一个生物学上更合理的人工神经网络的目的是创建一个关于大脑皮层的生物结构、特性和功能(包括学习过程)的更忠实的模型,而不是忽视其计算效率。所提出的描述所基于的模型的选择考虑了两个主要标准:它们被认为在生物学上更现实,并且它们处理电信号和化学突触中的神经元内和神经元间信号。同时,动作电位的持续时间也被考虑在内。除了当前尖峰神经元模型中存在的关于生物合理性的编码信息的特征外,这里还强调了一个可区分的特征:Hebbian学习和错误驱动学习的结合。
{"title":"Advances on criteria for biological plausibility in artificial neural networks: Think of learning processes","authors":"Alberione Braz da Silva, J. Rosa","doi":"10.1109/IJCNN.2011.6033387","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033387","url":null,"abstract":"Artificial neural network (ANN) community is engaged in biological plausibility issues these days. Different views about this subject can lead to disagreements of classification criteria among ANN researchers. In order to contribute to this debate, two of these views are highlighted here: one is related directly to the cerebral cortex biological structure, and the other focuses the neural features and the signaling between neurons. The model proposed in this paper considers that a biologically more plausible ANN has the purpose to create a more faithful model concerning the biological structure, properties, and functionalities, including learning processes, of the cerebral cortex, not disregarding its computational efficiency. The choice of the models upon which the proposed description is based takes into account two main criteria: the fact they are considered biologically more realistic and the fact they deal with intra and inter-neuron signaling in electrical and chemical synapses. Also, the duration of action potentials is taken into account. In addition to the characteristics for encoding information regarding biological plausibility present in current spiking neuron models, a distinguishable feature is emphasized here: a combination of Hebbian learning and error-driven learning.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123543839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Selective adjustment of rotationally-asymmetric neuron σ-widths 旋转不对称神经元σ-宽度的选择性调节
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033392
Nathan Rose
Radial Basis Networks are a reliable and efficient tool for performing classification tasks. In networks that include a Gaussian output transform within the Pattern Layer neurons, the method of setting the σ-width of the Gaussian curve is critical to obtaining accurate classification. Many existing methods perform poorly in regions of the problem space between examples of differing classes, or when there is overlap between classes in the data set. A method is proposed to produce unique σ values for each weight of every neuron, resulting in each neuron having its own Gaussian ‘coverage’ area within problem space. This method achieves better results than the alternatives on data sets with a significant amount of overlap and when the data is unscaled.
径向基网络是一种可靠、高效的分类工具。在模式层神经元中包含高斯输出变换的网络中,设置高斯曲线的σ-宽度的方法是获得准确分类的关键。许多现有的方法在不同类别的例子之间的问题空间区域或数据集中的类别之间存在重叠时表现不佳。提出了一种为每个神经元的每个权值产生唯一σ值的方法,从而使每个神经元在问题空间中具有自己的高斯“覆盖”区域。该方法在具有大量重叠的数据集和未缩放的数据集上取得了比其他方法更好的结果。
{"title":"Selective adjustment of rotationally-asymmetric neuron σ-widths","authors":"Nathan Rose","doi":"10.1109/IJCNN.2011.6033392","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033392","url":null,"abstract":"Radial Basis Networks are a reliable and efficient tool for performing classification tasks. In networks that include a Gaussian output transform within the Pattern Layer neurons, the method of setting the σ-width of the Gaussian curve is critical to obtaining accurate classification. Many existing methods perform poorly in regions of the problem space between examples of differing classes, or when there is overlap between classes in the data set. A method is proposed to produce unique σ values for each weight of every neuron, resulting in each neuron having its own Gaussian ‘coverage’ area within problem space. This method achieves better results than the alternatives on data sets with a significant amount of overlap and when the data is unscaled.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123672715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of electric power consumption for commercial buildings 商业建筑用电量预测
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033285
V. Cherkassky, S. Chowdhury, Volker Landenberger, Saurabh Tewari, P. Bursch
Currently many commercial buildings are not continuously monitored for energy consumption, especially small buildings which constitute 90% of all such buildings. However, readily available data from the electric meters can be used for monitoring and analyzing energy consumption. Efficient utilization of available historical data (from these meters) can potentially improve energy efficiency, help to identify common energy wasting problems, and, in the future, enable various Smart Grid programs, such as demand response, real-time pricing etc. This paper describes application of computational intelligence techniques for prediction of electric power consumption. The proposed approach combines regression and clustering methods, in order to improve the prediction accuracy of power consumption, as a function of time (of the day) and temperature, using real-life data from several commercial and government buildings. Empirical comparisons show that the proposed approach provides an improvement over the currently used bin-based method for modeling power consumption.
目前,许多商业建筑没有对能耗进行持续监测,尤其是小型建筑,占所有此类建筑的90%。然而,电表上现成的数据可以用来监测和分析能源消耗。有效利用可用的历史数据(来自这些电表)可以潜在地提高能源效率,帮助识别常见的能源浪费问题,并在未来实现各种智能电网计划,如需求响应、实时定价等。本文介绍了计算智能技术在电力消耗预测中的应用。该方法结合了回归和聚类方法,以提高电力消耗的预测精度,作为时间(一天)和温度的函数,使用来自几个商业和政府建筑的实际数据。实证比较表明,该方法比目前使用的基于bin的功耗建模方法有了改进。
{"title":"Prediction of electric power consumption for commercial buildings","authors":"V. Cherkassky, S. Chowdhury, Volker Landenberger, Saurabh Tewari, P. Bursch","doi":"10.1109/IJCNN.2011.6033285","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033285","url":null,"abstract":"Currently many commercial buildings are not continuously monitored for energy consumption, especially small buildings which constitute 90% of all such buildings. However, readily available data from the electric meters can be used for monitoring and analyzing energy consumption. Efficient utilization of available historical data (from these meters) can potentially improve energy efficiency, help to identify common energy wasting problems, and, in the future, enable various Smart Grid programs, such as demand response, real-time pricing etc. This paper describes application of computational intelligence techniques for prediction of electric power consumption. The proposed approach combines regression and clustering methods, in order to improve the prediction accuracy of power consumption, as a function of time (of the day) and temperature, using real-life data from several commercial and government buildings. Empirical comparisons show that the proposed approach provides an improvement over the currently used bin-based method for modeling power consumption.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123674517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Traffic sign recognition with multi-scale Convolutional Networks 基于多尺度卷积网络的交通标志识别
Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033589
P. Sermanet, Yann LeCun
We apply Convolutional Networks (ConvNets) to the task of traffic sign classification as part of the GTSRB competition. ConvNets are biologically-inspired multi-stage architectures that automatically learn hierarchies of invariant features. While many popular vision approaches use hand-crafted features such as HOG or SIFT, ConvNets learn features at every level from data that are tuned to the task at hand. The traditional ConvNet architecture was modified by feeding 1st stage features in addition to 2nd stage features to the classifier. The system yielded the 2nd-best accuracy of 98.97% during phase I of the competition (the best entry obtained 98.98%), above the human performance of 98.81%, using 32×32 color input images. Experiments conducted after phase 1 produced a new record of 99.17% by increasing the network capacity, and by using greyscale images instead of color. Interestingly, random features still yielded competitive results (97.33%).
作为GTSRB竞赛的一部分,我们将卷积网络(ConvNets)应用于交通标志分类任务。卷积神经网络是受生物学启发的多阶段架构,可以自动学习不变特征的层次结构。虽然许多流行的视觉方法使用手工制作的特征,如HOG或SIFT,但卷积神经网络从调整到手头任务的数据中学习每个级别的特征。对传统的卷积神经网络结构进行了改进,在向分类器输入第二阶段特征的基础上再输入第一阶段特征。使用32×32彩色输入图像,该系统在第一阶段的比赛中获得了98.97%的第二高准确率(最佳参赛作品获得98.98%),高于人类98.81%的表现。在第一阶段之后进行的实验通过增加网络容量,并使用灰度图像代替彩色图像,产生了99.17%的新记录。有趣的是,随机特征仍然产生竞争性结果(97.33%)。
{"title":"Traffic sign recognition with multi-scale Convolutional Networks","authors":"P. Sermanet, Yann LeCun","doi":"10.1109/IJCNN.2011.6033589","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033589","url":null,"abstract":"We apply Convolutional Networks (ConvNets) to the task of traffic sign classification as part of the GTSRB competition. ConvNets are biologically-inspired multi-stage architectures that automatically learn hierarchies of invariant features. While many popular vision approaches use hand-crafted features such as HOG or SIFT, ConvNets learn features at every level from data that are tuned to the task at hand. The traditional ConvNet architecture was modified by feeding 1st stage features in addition to 2nd stage features to the classifier. The system yielded the 2nd-best accuracy of 98.97% during phase I of the competition (the best entry obtained 98.98%), above the human performance of 98.81%, using 32×32 color input images. Experiments conducted after phase 1 produced a new record of 99.17% by increasing the network capacity, and by using greyscale images instead of color. Interestingly, random features still yielded competitive results (97.33%).","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114958133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 729
期刊
The 2011 International Joint Conference on Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1