首页 > 最新文献

2015 International Joint Conference on Neural Networks (IJCNN)最新文献

英文 中文
Deep learning using partitioned data vectors 使用分区数据向量的深度学习
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280484
B. Mitchell, H. Tosun, John W. Sheppard
Deep learning is a popular field that encompasses a range of multi-layer connectionist techniques. While these techniques have achieved great success on a number of difficult computer vision problems, the representation biases that allow this success have not been thoroughly explored. In this paper, we examine the hypothesis that one strength of many deep learning algorithms is their ability to exploit spatially local statistical information. We present a formal description of how data vectors can be partitioned into sub-vectors that preserve spatially local information. As a test case, we then use statistical models to examine how much of such structure exists in the MNIST dataset. Finally, we present experimental results from training RBMs using partitioned data, and demonstrate the advantages they have over non-partitioned RBMs. Through these results, we show how the performance advantage is reliant on spatially local structure, by demonstrating the performance impact of randomly permuting the input data to destroy local structure. Overall, our results support the hypothesis that a representation bias reliant upon spatially local statistical information can improve performance, so long as this bias is a good match for the data. We also suggest statistical tools for determining a priori whether a dataset is a good match for this bias or not.
深度学习是一个流行的领域,它包含了一系列多层次的连接论技术。虽然这些技术在许多困难的计算机视觉问题上取得了巨大的成功,但允许这一成功的表示偏差尚未得到彻底的探索。在本文中,我们检验了一个假设,即许多深度学习算法的一个优势是它们利用空间局部统计信息的能力。我们提出了一个关于如何将数据向量划分为保留空间局部信息的子向量的正式描述。作为一个测试用例,我们然后使用统计模型来检查MNIST数据集中存在多少这样的结构。最后,我们给出了使用分区数据训练rbm的实验结果,并展示了它们相对于未分区rbm的优势。通过这些结果,我们通过展示随机排列输入数据以破坏局部结构对性能的影响,展示了性能优势如何依赖于空间局部结构。总的来说,我们的结果支持这样一个假设,即依赖于空间局部统计信息的表示偏差可以提高性能,只要这种偏差与数据很好地匹配。我们还建议使用统计工具来先验地确定数据集是否与这种偏差相匹配。
{"title":"Deep learning using partitioned data vectors","authors":"B. Mitchell, H. Tosun, John W. Sheppard","doi":"10.1109/IJCNN.2015.7280484","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280484","url":null,"abstract":"Deep learning is a popular field that encompasses a range of multi-layer connectionist techniques. While these techniques have achieved great success on a number of difficult computer vision problems, the representation biases that allow this success have not been thoroughly explored. In this paper, we examine the hypothesis that one strength of many deep learning algorithms is their ability to exploit spatially local statistical information. We present a formal description of how data vectors can be partitioned into sub-vectors that preserve spatially local information. As a test case, we then use statistical models to examine how much of such structure exists in the MNIST dataset. Finally, we present experimental results from training RBMs using partitioned data, and demonstrate the advantages they have over non-partitioned RBMs. Through these results, we show how the performance advantage is reliant on spatially local structure, by demonstrating the performance impact of randomly permuting the input data to destroy local structure. Overall, our results support the hypothesis that a representation bias reliant upon spatially local statistical information can improve performance, so long as this bias is a good match for the data. We also suggest statistical tools for determining a priori whether a dataset is a good match for this bias or not.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"5 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85534956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Improved human pulse peak estimation using derivative features for noncontact pulse transit time measurements 改进人类脉冲峰值估计的导数特征,用于非接触脉冲传递时间测量
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280486
Mototaka Yoshioka, Kenta Murakami, Jun Ozawa
This paper proposes a method to estimate temporally accurate human pulse peaks for noncontact pulse transit time (PTT) measurements. The PTT is considered as a significant diagnostic index for conditions such as blood pressure and arterial stiffness; however, millisecond-order accuracy is required in the determination of each pulse peak. In this study, human pulse waveforms are obtained from wrist and ankle images taken using a webcam at 90 cm distance. In the proposed method, the waveform is smoothed using finite impulse response low-pass filtering that sustains the shape of the pulse waveform, and the phase delay is compensated. Then, features of the first-order derivative of the filtered waveform are used to estimate the pulse peaks. The interbeat intervals obtained from the peaks estimated by the proposed method closely coincided with those obtained from a contact-type photoplethysmogram sensor, yielding less absolute error than that obtained from a comparative method; thus, this confirms the improved temporal accuracy of the proposed method. The PTTs are calculated from the time differences between the estimated pulse peaks of the wrist and those of the ankle images. The benefit of accurate pulse peak estimation is demonstrated by investigating the relation between the PTT and blood pressure. The PTTs are correlated with blood pressure in ten human participants, and a high correlation coefficient of -0.88 was obtained, indicating a direct relation between these two measures.
本文提出了一种非接触脉冲传递时间(PTT)测量中人体脉冲峰值的时域精确估计方法。PTT被认为是血压和动脉僵硬等疾病的重要诊断指标;然而,在确定每个脉冲峰值时,需要毫秒级的精度。在这项研究中,使用网络摄像头在90厘米距离处拍摄的手腕和脚踝图像获得了人体脉冲波形。在该方法中,采用有限脉冲响应低通滤波对波形进行平滑,以保持脉冲波形的形状,并补偿相位延迟。然后,利用滤波后波形的一阶导数特征估计脉冲峰值。该方法估计的峰值间拍间隔与接触式光容积描记器的峰值间拍间隔非常接近,产生的绝对误差小于比较法;因此,这证实了所提出方法的时间精度的提高。ptt是根据腕部和踝关节图像估计的脉冲峰值之间的时间差来计算的。通过研究PTT和血压之间的关系,证明了准确的脉冲峰值估计的好处。在10名参与者中,ptt与血压相关,获得了-0.88的高相关系数,表明这两个指标之间存在直接关系。
{"title":"Improved human pulse peak estimation using derivative features for noncontact pulse transit time measurements","authors":"Mototaka Yoshioka, Kenta Murakami, Jun Ozawa","doi":"10.1109/IJCNN.2015.7280486","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280486","url":null,"abstract":"This paper proposes a method to estimate temporally accurate human pulse peaks for noncontact pulse transit time (PTT) measurements. The PTT is considered as a significant diagnostic index for conditions such as blood pressure and arterial stiffness; however, millisecond-order accuracy is required in the determination of each pulse peak. In this study, human pulse waveforms are obtained from wrist and ankle images taken using a webcam at 90 cm distance. In the proposed method, the waveform is smoothed using finite impulse response low-pass filtering that sustains the shape of the pulse waveform, and the phase delay is compensated. Then, features of the first-order derivative of the filtered waveform are used to estimate the pulse peaks. The interbeat intervals obtained from the peaks estimated by the proposed method closely coincided with those obtained from a contact-type photoplethysmogram sensor, yielding less absolute error than that obtained from a comparative method; thus, this confirms the improved temporal accuracy of the proposed method. The PTTs are calculated from the time differences between the estimated pulse peaks of the wrist and those of the ankle images. The benefit of accurate pulse peak estimation is demonstrated by investigating the relation between the PTT and blood pressure. The PTTs are correlated with blood pressure in ten human participants, and a high correlation coefficient of -0.88 was obtained, indicating a direct relation between these two measures.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"10 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81962549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Selective potentiality maximization for input neuron selection in self-organizing maps 自组织映射中输入神经元选择的选择电位最大化
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280541
R. Kamimura, Ryozo Kitajima
The present paper proposes a new type of information-theoretic method to enhance the potentiality of input neurons for improving the class structure of the self-organizing maps (SOM). The SOM has received much attention in neural networks, because it can be used to visualize input patterns, in particular, to clarify class structure. However, it has been observed that the good performance of visualization is limited to relatively simple data sets. To visualize more complex data sets, it is needed to develop a method to extract main characteristics of input patterns more explicitly. For this, several information-theoretic methods have been developed with some problems. One of the main problems is that the method needs much heavy computation to obtain the main features, because the computational procedures to obtain information content should be repeated many times. To simplify the procedures, a new measure called “potentiality” of input neurons is proposed. The potentiality is based on the variance of connection weights for input neurons and it can be computed without the complex computation of information content. The method was applied to the artificial and symmetric data set and the biodegradation data from the machine learning database. Experimental results showed that the method could be used to enhance a smaller number of input neurons. Those neurons were effective in intensifying class boundaries for clearer class structures. The present results show the effectiveness of the new measure of the potentiality for improved visualization and class structure.
为了改进自组织映射(SOM)的类结构,本文提出了一种新的增强输入神经元电位的信息论方法。SOM在神经网络中受到了很大的关注,因为它可以用来可视化输入模式,特别是澄清类结构。然而,据观察,可视化的良好性能仅限于相对简单的数据集。为了可视化更复杂的数据集,需要开发一种更明确地提取输入模式主要特征的方法。为此,人们发展了几种信息理论方法,但存在一些问题。该方法的主要问题之一是需要大量的计算来获得主要特征,因为获取信息内容的计算过程需要多次重复。为了简化程序,提出了一种新的测量方法,称为输入神经元的“电位”。电势是基于输入神经元连接权的方差来计算的,不需要复杂的信息量计算。将该方法应用于人工对称数据集和机器学习数据库中的生物降解数据。实验结果表明,该方法可用于增强较少数量的输入神经元。这些神经元有效地强化了阶级界限,使阶级结构更加清晰。目前的结果表明了新测量方法在改进可视化和类结构方面的有效性。
{"title":"Selective potentiality maximization for input neuron selection in self-organizing maps","authors":"R. Kamimura, Ryozo Kitajima","doi":"10.1109/IJCNN.2015.7280541","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280541","url":null,"abstract":"The present paper proposes a new type of information-theoretic method to enhance the potentiality of input neurons for improving the class structure of the self-organizing maps (SOM). The SOM has received much attention in neural networks, because it can be used to visualize input patterns, in particular, to clarify class structure. However, it has been observed that the good performance of visualization is limited to relatively simple data sets. To visualize more complex data sets, it is needed to develop a method to extract main characteristics of input patterns more explicitly. For this, several information-theoretic methods have been developed with some problems. One of the main problems is that the method needs much heavy computation to obtain the main features, because the computational procedures to obtain information content should be repeated many times. To simplify the procedures, a new measure called “potentiality” of input neurons is proposed. The potentiality is based on the variance of connection weights for input neurons and it can be computed without the complex computation of information content. The method was applied to the artificial and symmetric data set and the biodegradation data from the machine learning database. Experimental results showed that the method could be used to enhance a smaller number of input neurons. Those neurons were effective in intensifying class boundaries for clearer class structures. The present results show the effectiveness of the new measure of the potentiality for improved visualization and class structure.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"4 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81997194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Face recognition in unconstrained environments 无约束环境下的人脸识别
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280803
M. Saffar, Banafsheh Rekabdar, S. Louis, M. Nicolescu
This paper investigates three approaches to the problem of identity recognition in real-world unconstrained environments. We describe a new and challenging face recognition dataset captured in a laboratory environment with no strong constraints on lighting, motion, or subject pose, orientation, distance, or facial expression. We then evaluate three approaches to identity recognition on this new dataset. We find that a deep neural network with stacked denoising auto-encoders significantly outperforms a standard feedforward neural network and a baseline eigenfaces approach from the OpenCV library. Despite the 66 million plus parameters in the best trained deep network, it significantly outperforms the other two methods even on the relatively small number (relative to the number of deep network parameters) of 8,895 training samples. We believe our work adds to the growing empirical and theoretical evidence that deep networks provide a promising approach to unconstrained recognition problems.
本文研究了现实世界无约束环境中身份识别问题的三种方法。我们描述了一个在实验室环境中捕获的新的具有挑战性的人脸识别数据集,该数据集对光线、运动或主体姿势、方向、距离或面部表情没有很强的限制。然后,我们在这个新数据集上评估了三种身份识别方法。我们发现具有堆叠去噪自编码器的深度神经网络显著优于标准前馈神经网络和来自OpenCV库的基线特征面方法。尽管在训练最好的深度网络中有6600多万个参数,但即使在相对较少的8,895个训练样本(相对于深度网络参数的数量)上,它也明显优于其他两种方法。我们相信,我们的工作增加了越来越多的经验和理论证据,证明深度网络为无约束识别问题提供了一种有前途的方法。
{"title":"Face recognition in unconstrained environments","authors":"M. Saffar, Banafsheh Rekabdar, S. Louis, M. Nicolescu","doi":"10.1109/IJCNN.2015.7280803","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280803","url":null,"abstract":"This paper investigates three approaches to the problem of identity recognition in real-world unconstrained environments. We describe a new and challenging face recognition dataset captured in a laboratory environment with no strong constraints on lighting, motion, or subject pose, orientation, distance, or facial expression. We then evaluate three approaches to identity recognition on this new dataset. We find that a deep neural network with stacked denoising auto-encoders significantly outperforms a standard feedforward neural network and a baseline eigenfaces approach from the OpenCV library. Despite the 66 million plus parameters in the best trained deep network, it significantly outperforms the other two methods even on the relatively small number (relative to the number of deep network parameters) of 8,895 training samples. We believe our work adds to the growing empirical and theoretical evidence that deep networks provide a promising approach to unconstrained recognition problems.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"5 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82089766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
HMAX model: A survey HMAX模型:一个调查
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280677
Chang Liu, F. Sun
HMAX model is a bio-inspired feedforward architecture for object recognition, which is derived from the simple and complex cells model in cortex proposed by Hubel and Wiesel. As a hierarchical bio-based recognition model, HMAX captures the properties of primate cortex with alternated S layers and C layers, corresponding to simple cells and complex cells respectively. Although constrained by biological factors, HMAX shows satisfying performance in different fields when competing with other state-of-the-art computer vision algorithms. Insightful ideas and methods have been developed for this hierarchical model, which advances the progress of HMAX model. This paper reviews the origin of this model, as well as the improvements and modifications based on this model.
HMAX模型是由Hubel和Wiesel提出的皮层简单细胞和复杂细胞模型衍生而来的一种生物启发的前馈目标识别架构。HMAX是一种分层的基于生物的识别模型,它以交替的S层和C层捕捉灵长类动物皮层的特性,分别对应简单细胞和复杂细胞。尽管受到生物因素的限制,但在与其他最先进的计算机视觉算法竞争时,HMAX在不同领域表现出令人满意的性能。这一层次模型提出了有见地的思想和方法,推动了HMAX模型的发展。本文回顾了该模型的起源,以及在此基础上的改进和修改。
{"title":"HMAX model: A survey","authors":"Chang Liu, F. Sun","doi":"10.1109/IJCNN.2015.7280677","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280677","url":null,"abstract":"HMAX model is a bio-inspired feedforward architecture for object recognition, which is derived from the simple and complex cells model in cortex proposed by Hubel and Wiesel. As a hierarchical bio-based recognition model, HMAX captures the properties of primate cortex with alternated S layers and C layers, corresponding to simple cells and complex cells respectively. Although constrained by biological factors, HMAX shows satisfying performance in different fields when competing with other state-of-the-art computer vision algorithms. Insightful ideas and methods have been developed for this hierarchical model, which advances the progress of HMAX model. This paper reviews the origin of this model, as well as the improvements and modifications based on this model.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"346 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79660481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Interval-valued symbolic representation based method for off-line signature verification 基于区间值符号表示的离线签名验证方法
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280518
S. Pal, Alireza Alaei, U. Pal, M. Blumenstein
The objective of this investigation is to present an interval-symbolic representation based method for offline signature verification. In the feature extraction stage, Connected Components (CC), Enclosed Regions (ER), Basic Features (BF) and Curvelet Feature (CF)-based approaches are used to characterize signatures. Considering the extracted feature vectors, an interval data value is created for each feature extracted from every individual's signatures as an interval-valued symbolic data. This process results in a signature model for each individual that consists of a set of interval values. A similarity measure is proposed as the classifier in this paper. The interval-valued symbolic representation based method has never been used for signature verification considering Indian script signatures. Therefore, to evaluate the proposed method, a Hindi signature database consisting of 2400 (100×24) genuine signatures and 3000 (100×30) skilled forgeries is employed for experimentation. Concerning this large Hindi signature dataset, the highest verification accuracy of 91.83% was obtained on a joint feature set considering all four sets of features, while 2.5%, 13.84% and 8.17% of FAR (False Acceptance Rate), FRR (False Rejection Rate), and AER (Average Error Rate) were achieved, respectively.
本研究的目的是提出一种基于区间符号表示的离线签名验证方法。在特征提取阶段,采用连通成分(CC)、封闭区域(ER)、基本特征(BF)和曲波特征(CF)等方法对签名进行特征提取。考虑提取的特征向量,对从每个个体签名中提取的每个特征创建一个区间数据值作为区间值符号数据。此过程将为每个个体生成一个签名模型,该模型由一组间隔值组成。本文提出了一种相似度测度作为分类器。考虑到印度文字签名,基于区间值符号表示的方法从未用于签名验证。因此,为了评估所提出的方法,使用了一个由2400个(100×24)真实签名和3000个(100×30)熟练伪造签名组成的印地语签名数据库进行实验。在此大型印地语签名数据集上,综合考虑所有四组特征的联合特征集的验证准确率最高,达到91.83%,而FAR (False Acceptance Rate)、FRR (False Rejection Rate)和AER (Average Error Rate)的准确率分别为2.5%、13.84%和8.17%。
{"title":"Interval-valued symbolic representation based method for off-line signature verification","authors":"S. Pal, Alireza Alaei, U. Pal, M. Blumenstein","doi":"10.1109/IJCNN.2015.7280518","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280518","url":null,"abstract":"The objective of this investigation is to present an interval-symbolic representation based method for offline signature verification. In the feature extraction stage, Connected Components (CC), Enclosed Regions (ER), Basic Features (BF) and Curvelet Feature (CF)-based approaches are used to characterize signatures. Considering the extracted feature vectors, an interval data value is created for each feature extracted from every individual's signatures as an interval-valued symbolic data. This process results in a signature model for each individual that consists of a set of interval values. A similarity measure is proposed as the classifier in this paper. The interval-valued symbolic representation based method has never been used for signature verification considering Indian script signatures. Therefore, to evaluate the proposed method, a Hindi signature database consisting of 2400 (100×24) genuine signatures and 3000 (100×30) skilled forgeries is employed for experimentation. Concerning this large Hindi signature dataset, the highest verification accuracy of 91.83% was obtained on a joint feature set considering all four sets of features, while 2.5%, 13.84% and 8.17% of FAR (False Acceptance Rate), FRR (False Rejection Rate), and AER (Average Error Rate) were achieved, respectively.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"29 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84129554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
An efficient hybrid algorithm for fire flame detection 一种有效的火焰检测混合算法
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280590
Seyed Amin Khatami, S. Mirghasemi, A. Khosravi, S. Nahavandi
Proposing efficient methods for fire protection is becoming more and more important, because a small flame of fire may cause huge problems in social safety. In this paper, an effective fire flame detection method is investigated. This fire detection method includes four main stages: in the first step, a linear transformation is applied to convert red, green, and blue (RGB) color space through a 3*3 matrix to a new color space. In the next step, fuzzy c-mean clustering method (FCM) is used to distinguish between fire flame and non-fire flame pixels. Particle Swarm Optimization algorithm (PSO) is also utilized in the last step to decrease the error value measured by FCM after conversion. Finally, we apply Otsu threshold method to the new converted images to make a binary picture. Empirical results show the strength, accuracy and fast-response of the proposed algorithm in detecting fire flames in color images.
提出有效的消防方法变得越来越重要,因为一个小小的火焰可能会造成巨大的社会安全问题。本文研究了一种有效的火灾火焰检测方法。该火灾检测方法包括四个主要阶段:第一步,通过3*3矩阵,应用线性变换将红、绿、蓝(RGB)颜色空间转换为新的颜色空间。下一步,使用模糊c均值聚类方法(FCM)区分火焰和非火焰像素。最后一步采用粒子群优化算法(Particle Swarm Optimization algorithm, PSO)减小FCM变换后测量的误差值。最后,利用Otsu阈值法对转换后的图像进行二值化处理。实验结果表明,该算法在彩色图像火焰检测中具有较强的准确性和快速响应能力。
{"title":"An efficient hybrid algorithm for fire flame detection","authors":"Seyed Amin Khatami, S. Mirghasemi, A. Khosravi, S. Nahavandi","doi":"10.1109/IJCNN.2015.7280590","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280590","url":null,"abstract":"Proposing efficient methods for fire protection is becoming more and more important, because a small flame of fire may cause huge problems in social safety. In this paper, an effective fire flame detection method is investigated. This fire detection method includes four main stages: in the first step, a linear transformation is applied to convert red, green, and blue (RGB) color space through a 3*3 matrix to a new color space. In the next step, fuzzy c-mean clustering method (FCM) is used to distinguish between fire flame and non-fire flame pixels. Particle Swarm Optimization algorithm (PSO) is also utilized in the last step to decrease the error value measured by FCM after conversion. Finally, we apply Otsu threshold method to the new converted images to make a binary picture. Empirical results show the strength, accuracy and fast-response of the proposed algorithm in detecting fire flames in color images.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"29 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84135840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Using regional homogeneity from functional MRI for diagnosis of ASD among males 利用功能性MRI的区域同质性诊断男性ASD
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280562
Vigneshwaran Senthilvel, B. S. Mahanand, S. Sundaram, N. Sundararajan
This paper presents an approach for automatic diagnosis of Autism Spectrum Disorder (ASD) among males using functional Magnetic Resonance Imaging (fMRI). fMRI has the capability to identify any abnormal neural interactions that may be responsible for behavioral symptoms observed in ASD patients. In this paper, the regional homogeneity of the voxels in the 116 regions of the automated anatomical labeling (AAL) atlas of the brain are used as features which result in a large set of 54837 features. Chi-square feature selection method is then used to identify the most significant features and only these features are then used for classification with a metacognitive radial basis function classifier. Since genetic studies have indicated that ASD manifests differently in males and females, a large scale study specific to males is highlighted here using the publicly available preprocessed fMRI dataset from the Autism Brain Imaging Data Exchange (ABIDE), unlike existing studies which are either smaller in scale or consider both males and females together. Among the males, it is shown here that the classification performance can be improved (by up to 10%) by considering adults and adolescents separately. By using Chi-square algorithm the number of features was reduced drastically to lower than 200 in contrast to the thousands of features that have been used in recent studies.
本文介绍了一种应用功能磁共振成像(fMRI)自动诊断男性自闭症谱系障碍(ASD)的方法。功能磁共振成像能够识别任何可能导致ASD患者行为症状的异常神经相互作用。本文利用大脑自动解剖标记图谱(AAL)中116个区域体素的区域均匀性作为特征,得到54837个特征的大集合。然后使用卡方特征选择方法来识别最重要的特征,然后使用元认知径向基函数分类器将这些特征用于分类。由于遗传研究表明,ASD在男性和女性中的表现不同,因此本文强调了一项针对男性的大规模研究,该研究使用了来自自闭症脑成像数据交换(ABIDE)的公开的预处理功能磁共振成像数据集,而不是像现有的研究那样规模较小或同时考虑男性和女性。在雄性中,通过分别考虑成人和青少年,分类性能可以提高(高达10%)。与最近研究中使用的数千个特征相比,通过使用卡方算法,特征的数量大大减少到200个以下。
{"title":"Using regional homogeneity from functional MRI for diagnosis of ASD among males","authors":"Vigneshwaran Senthilvel, B. S. Mahanand, S. Sundaram, N. Sundararajan","doi":"10.1109/IJCNN.2015.7280562","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280562","url":null,"abstract":"This paper presents an approach for automatic diagnosis of Autism Spectrum Disorder (ASD) among males using functional Magnetic Resonance Imaging (fMRI). fMRI has the capability to identify any abnormal neural interactions that may be responsible for behavioral symptoms observed in ASD patients. In this paper, the regional homogeneity of the voxels in the 116 regions of the automated anatomical labeling (AAL) atlas of the brain are used as features which result in a large set of 54837 features. Chi-square feature selection method is then used to identify the most significant features and only these features are then used for classification with a metacognitive radial basis function classifier. Since genetic studies have indicated that ASD manifests differently in males and females, a large scale study specific to males is highlighted here using the publicly available preprocessed fMRI dataset from the Autism Brain Imaging Data Exchange (ABIDE), unlike existing studies which are either smaller in scale or consider both males and females together. Among the males, it is shown here that the classification performance can be improved (by up to 10%) by considering adults and adolescents separately. By using Chi-square algorithm the number of features was reduced drastically to lower than 200 in contrast to the thousands of features that have been used in recent studies.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"40 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84629439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Multi-frequency sinusoidal wave control in a chaotic neural network 混沌神经网络中的多频正弦波控制
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280380
Guoguang He, Chongchong Wang, Xiaoping Xie, Ping Zhu
Brain waves are classified as gamma, beta, alpha, theta, and delta waves to quantify brain activity and can be approximated as sinusoidal waves of different frequencies. In this work, we use sinusoidal waves at two different frequencies to control chaos in a chaotic neural network (CNN) to explore the effect of multi-frequency sinusoidal waves in chaos control. We propose two methods to control chaos. In one, two sinusoidal wave signals are added to different groups of neurons. In the other, a control signal with a mixture of two sinusoidal waves with different frequencies is added to all neurons. The controlling dynamics differ in these two cases. A stable output sequence of the controlled CNN contains only one type of stored pattern and its reversed pattern, which are related to the initial pattern.
脑电波分为γ、β、α、θ和δ波,以量化大脑活动,可以近似为不同频率的正弦波。在本研究中,我们使用两种不同频率的正弦波来控制混沌神经网络(CNN)中的混沌,以探索多频率正弦波在混沌控制中的作用。我们提出了两种控制混沌的方法。其中一种是将两个正弦波信号加到不同的神经元组中。在另一种方法中,将两个不同频率的正弦波混合的控制信号添加到所有神经元中。在这两种情况下,控制动力学是不同的。被控CNN的稳定输出序列只包含一种类型的存储模式及其反向模式,它们与初始模式相关。
{"title":"Multi-frequency sinusoidal wave control in a chaotic neural network","authors":"Guoguang He, Chongchong Wang, Xiaoping Xie, Ping Zhu","doi":"10.1109/IJCNN.2015.7280380","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280380","url":null,"abstract":"Brain waves are classified as gamma, beta, alpha, theta, and delta waves to quantify brain activity and can be approximated as sinusoidal waves of different frequencies. In this work, we use sinusoidal waves at two different frequencies to control chaos in a chaotic neural network (CNN) to explore the effect of multi-frequency sinusoidal waves in chaos control. We propose two methods to control chaos. In one, two sinusoidal wave signals are added to different groups of neurons. In the other, a control signal with a mixture of two sinusoidal waves with different frequencies is added to all neurons. The controlling dynamics differ in these two cases. A stable output sequence of the controlled CNN contains only one type of stored pattern and its reversed pattern, which are related to the initial pattern.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"476 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84691066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Incremental learning on a budget and a quick calculation method using a tree-search algorithm 增量学习的预算和快速计算方法,使用树搜索算法
Pub Date : 2015-07-12 DOI: 10.1109/IJCNN.2015.7280805
Akihisa Kato, Hirohito Kawahara, K. Yamauchi
In this study, a lightweight kernel regression algorithm for embedded systems is proposed. In our previous study, we proposed an online learning method with a limited number of kernels based on a kernel regression model known as a limited general regression neural network (LGRNN). The LGRNN behavior is similar to that of k-nearest neighbors except for its continual interpolation between learned samples. The output of kernel regression to an input is dominant for the closest kernel output. This is in contrast to the output of kernel perceptrons, which is determined by the combination of several nested kernels. This means that the output of a kernel regression model can be lightly weighted by omitting calculations for the other kernels. Therefore, we have to find the closest kernel and its neighbors to the current input vector quickly. To realize this, we introduce a tree-search-based calculation method for LGRNN. In the LGRNN learning method, the kernels are clustered into k groups and organized as tree-structured data for the tree-search algorithm.
本文提出了一种适用于嵌入式系统的轻量级核回归算法。在我们之前的研究中,我们提出了一种基于核回归模型的有限数量核的在线学习方法,称为有限一般回归神经网络(LGRNN)。LGRNN的行为与k近邻相似,只是它在学习样本之间进行连续插值。对于最接近的核输出,对输入的核回归的输出占主导地位。这与内核感知器的输出相反,内核感知器的输出是由几个嵌套内核的组合决定的。这意味着一个核回归模型的输出可以通过省略对其他核的计算来轻微加权。因此,我们必须快速找到距离当前输入向量最近的核及其邻居。为了实现这一点,我们引入了一种基于树搜索的LGRNN计算方法。在LGRNN学习方法中,将核聚类成k组,并组织为树状结构数据,用于树状搜索算法。
{"title":"Incremental learning on a budget and a quick calculation method using a tree-search algorithm","authors":"Akihisa Kato, Hirohito Kawahara, K. Yamauchi","doi":"10.1109/IJCNN.2015.7280805","DOIUrl":"https://doi.org/10.1109/IJCNN.2015.7280805","url":null,"abstract":"In this study, a lightweight kernel regression algorithm for embedded systems is proposed. In our previous study, we proposed an online learning method with a limited number of kernels based on a kernel regression model known as a limited general regression neural network (LGRNN). The LGRNN behavior is similar to that of k-nearest neighbors except for its continual interpolation between learned samples. The output of kernel regression to an input is dominant for the closest kernel output. This is in contrast to the output of kernel perceptrons, which is determined by the combination of several nested kernels. This means that the output of a kernel regression model can be lightly weighted by omitting calculations for the other kernels. Therefore, we have to find the closest kernel and its neighbors to the current input vector quickly. To realize this, we introduce a tree-search-based calculation method for LGRNN. In the LGRNN learning method, the kernels are clustered into k groups and organized as tree-structured data for the tree-search algorithm.","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"6 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80711597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2015 International Joint Conference on Neural Networks (IJCNN)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1