首页 > 最新文献

2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)最新文献

英文 中文
On-line Thai handwritten character recognition using hidden Markov model and fuzzy logic 基于隐马尔可夫模型和模糊逻辑的在线泰文手写字符识别
R. Budsayaplakorn, W. Asdornwised, S. Jitapunkul
This paper presents a new on-line recognition of Thai handwritten characters. Active researches in Thai handwritten character recognition are converged into two distinct methods, HMM and fuzzy logic classifier. The former showed poor recognition rate due to Thai fuzzy characters. The shortcoming of the latter is on difficulties in establishing the set of rules to cover a whole handwriting styles. Our method is proposed to exploit the better of two worlds (HMM and distinctive feature based fuzzy classifier). The experimental result was shown an average recognition rate is improved from 89.1%(using HMM) to 91.2 using our proposed method.
本文提出了一种新的泰文手写字符在线识别方法。泰文手写体识别的研究主要集中在HMM和模糊逻辑分类器两种不同的方法上。前者由于特征模糊,识别率较低。后者的缺点是难以建立一套规则来涵盖整个书写风格。我们提出的方法是利用两个世界(HMM和基于显著特征的模糊分类器)的优点。实验结果表明,采用隐马尔可夫模型的平均识别率从89.1%提高到91.2。
{"title":"On-line Thai handwritten character recognition using hidden Markov model and fuzzy logic","authors":"R. Budsayaplakorn, W. Asdornwised, S. Jitapunkul","doi":"10.1109/NNSP.2003.1318053","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318053","url":null,"abstract":"This paper presents a new on-line recognition of Thai handwritten characters. Active researches in Thai handwritten character recognition are converged into two distinct methods, HMM and fuzzy logic classifier. The former showed poor recognition rate due to Thai fuzzy characters. The shortcoming of the latter is on difficulties in establishing the set of rules to cover a whole handwriting styles. Our method is proposed to exploit the better of two worlds (HMM and distinctive feature based fuzzy classifier). The experimental result was shown an average recognition rate is improved from 89.1%(using HMM) to 91.2 using our proposed method.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124653424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Variational expectation-maximization training for Gaussian networks 高斯网络的变分期望最大化训练
N. Nasios, A. Bors
This paper introduces variational expectation-maximization (VEM) algorithm for training Gaussian networks. Hyperparameters model distributions of parameters characterizing Gaussian mixture densities. The proposed algorithm employs a hierarchical learning strategy for estimating a set of hyperparameters and the number of Gaussian mixture components. A dual EM algorithm is employed as the initialization stage in the VEM-based learning. In the first stage the EM algorithm is applied on the given data set while the second stage EM is used on distributions of parameters resulted from several runs of the first stage EM. Appropriate maximum log-likelihood estimators are considered for all the parameter distributions involved.
介绍了一种训练高斯网络的变分期望最大化(VEM)算法。表征高斯混合密度的参数的超参数模型分布。该算法采用一种分层学习策略来估计一组超参数和高斯混合分量的数量。采用双EM算法作为初始化阶段进行基于EM的学习。在第一阶段,EM算法应用于给定的数据集,而第二阶段EM用于处理由第一阶段EM多次运行产生的参数分布。对于所涉及的所有参数分布,都考虑适当的最大对数似然估计。
{"title":"Variational expectation-maximization training for Gaussian networks","authors":"N. Nasios, A. Bors","doi":"10.1109/NNSP.2003.1318033","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318033","url":null,"abstract":"This paper introduces variational expectation-maximization (VEM) algorithm for training Gaussian networks. Hyperparameters model distributions of parameters characterizing Gaussian mixture densities. The proposed algorithm employs a hierarchical learning strategy for estimating a set of hyperparameters and the number of Gaussian mixture components. A dual EM algorithm is employed as the initialization stage in the VEM-based learning. In the first stage the EM algorithm is applied on the given data set while the second stage EM is used on distributions of parameters resulted from several runs of the first stage EM. Appropriate maximum log-likelihood estimators are considered for all the parameter distributions involved.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133657732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An effective reject rule for reliability improvement in bank note neuro-classifiers 一种提高钞票神经分类器可靠性的有效拒绝规则
A. Ahmadi, S. Omatu, T. Kosaka
In this paper the reliability of bank note neuro-classifiers is investigated and a reject rule is proposed on the basis of probability density function of the input data. The reliability of classification is evaluated through two parameters, which are associated with the winning class probability and the second maximal probability. Then a threshold value is considered to reject the unreliable classifications. As for modeling the non-linear correlation among the data variables and extracting the features, a local principal components analysis (PCA) is applied. The method is tested with a learning vector quantization (LVQ) classifier using 3,600 data samples of various bills of US dollar. The results show that by taking a suitable reject threshold value and also a proper number of regions for the local PCA, the reliability of the system can be improved significantly.
本文研究了钞票神经分类器的可靠性,提出了一种基于输入数据的概率密度函数的拒绝规则。通过两个参数来评估分类的可靠性,这两个参数与获胜类别概率和第二次最大概率有关。然后考虑一个阈值来拒绝不可靠分类。为了对数据变量之间的非线性相关性进行建模并提取特征,采用了局部主成分分析(PCA)。用学习向量量化(LVQ)分类器对该方法进行了测试,使用了3600个不同面额美元的数据样本。结果表明,选取合适的拒绝阈值和适当的区域数进行局部主成分分析,可以显著提高系统的可靠性。
{"title":"An effective reject rule for reliability improvement in bank note neuro-classifiers","authors":"A. Ahmadi, S. Omatu, T. Kosaka","doi":"10.1109/NNSP.2003.1318050","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318050","url":null,"abstract":"In this paper the reliability of bank note neuro-classifiers is investigated and a reject rule is proposed on the basis of probability density function of the input data. The reliability of classification is evaluated through two parameters, which are associated with the winning class probability and the second maximal probability. Then a threshold value is considered to reject the unreliable classifications. As for modeling the non-linear correlation among the data variables and extracting the features, a local principal components analysis (PCA) is applied. The method is tested with a learning vector quantization (LVQ) classifier using 3,600 data samples of various bills of US dollar. The results show that by taking a suitable reject threshold value and also a proper number of regions for the local PCA, the reliability of the system can be improved significantly.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115815581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Subspaces of text discrimination with application to biological literature 文本子空间辨析及其在生物文献中的应用
Mahesan Suwannaroj, M. Niranjan
This paper is about the application of statistical pattern recognition techniques to the classification of text with the objective of retrieving documents relevant for the construction of gene networks. We start from the usual practice of representing a document, electronically available abstracts of scientific papers in this case, as a high dimensional vector of term of occurrences. We consider the problem of retrieving documents corresponding to the metabolic pathway of the organism yeast, Saccharomyces Cerevisiae, using a trained classifier as filter. We use support vector machines (SVMs) as classifiers and compare techniques for reducing the dimensionality of the problem: latent semantic kernels (LSK) and sequential forward selection (SFS). In order to deal with the issue of having only a small set of accurately labelled documents, we used the approach of transductive inference. In this case, LSK leads to a subspace formed as a linear combination of features (terms in the lexicon) while SFS selects a subset of the dimension. We find, for this problem, that the discriminant information appears to lie in a subspace, which is very small in dimensionality compared to that of the original formulation. By matching against the gene ontology (GO) database, we further find that the selection process (SFS) picks out the discriminant terms that are of biological significance for this problem.
本文研究了统计模式识别技术在文本分类中的应用,目的是检索与基因网络构建相关的文档。我们从表示文档的通常做法开始,在这种情况下,电子科学论文的摘要作为出现次数的高维向量。我们考虑的问题,检索文件对应的有机体酵母的代谢途径,酿酒酵母,使用训练分类器作为过滤器。我们使用支持向量机(svm)作为分类器,并比较了降低问题维数的技术:潜在语义核(LSK)和顺序前向选择(SFS)。为了处理只有一小部分准确标记的文档的问题,我们使用了转换推理的方法。在这种情况下,LSK生成的子空间是特征(词汇表中的术语)的线性组合,而SFS则选择维度的子集。我们发现,对于这个问题,判别信息似乎位于子空间中,与原始公式相比,该子空间的维数非常小。通过与基因本体(GO)数据库的匹配,我们进一步发现选择过程(SFS)挑选出对该问题具有生物学意义的判别项。
{"title":"Subspaces of text discrimination with application to biological literature","authors":"Mahesan Suwannaroj, M. Niranjan","doi":"10.1109/NNSP.2003.1317999","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1317999","url":null,"abstract":"This paper is about the application of statistical pattern recognition techniques to the classification of text with the objective of retrieving documents relevant for the construction of gene networks. We start from the usual practice of representing a document, electronically available abstracts of scientific papers in this case, as a high dimensional vector of term of occurrences. We consider the problem of retrieving documents corresponding to the metabolic pathway of the organism yeast, Saccharomyces Cerevisiae, using a trained classifier as filter. We use support vector machines (SVMs) as classifiers and compare techniques for reducing the dimensionality of the problem: latent semantic kernels (LSK) and sequential forward selection (SFS). In order to deal with the issue of having only a small set of accurately labelled documents, we used the approach of transductive inference. In this case, LSK leads to a subspace formed as a linear combination of features (terms in the lexicon) while SFS selects a subset of the dimension. We find, for this problem, that the discriminant information appears to lie in a subspace, which is very small in dimensionality compared to that of the original formulation. By matching against the gene ontology (GO) database, we further find that the selection process (SFS) picks out the discriminant terms that are of biological significance for this problem.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125181206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Information cut and information forces for clustering 聚类的信息切割和信息力
R. Jenssen, J. Príncipe, T. Eltoft
We define an information-theoretic divergence measure between probability density functions (pdfs) that has a deep connection to the cut in graph-theory. This connection is revealed when the pdfs are estimated by the Parzen method with a Gaussian kernel. We refer to our divergence measure as the information cut. The information cut provides us with a theoretically sound criterion for cluster evaluation. In this paper we show that it can be used to merge clusters. The initial clusters are obtained based on the related concept of information forces. We create directed trees by selecting the predecessor of a node (pattern) according to the direction of the information force acting on the pattern. Each directed tree corresponds to a cluster, hence enabling us to obtain an initial partitioning of the data set. Subsequently, we utilize the information cut as a cluster evaluation function to merge clusters until the predefined number of clusters is reached. We demonstrate the performance of our novel information-theoretic clustering method when applied to both artificially created data and real data, with encouraging results.
我们定义了一个与图论中的切有密切联系的概率密度函数(pdf)之间的信息理论散度度量。当使用带有高斯核的Parzen方法对pdf进行估计时,就会发现这种联系。我们把散度度量称为信息切割。信息切割为聚类评价提供了理论上合理的标准。在本文中,我们证明了它可以用于聚类合并。初始聚类是基于信息力的相关概念得到的。我们通过根据作用在模式上的信息力的方向选择节点(模式)的前身来创建有向树。每个有向树对应一个簇,从而使我们能够获得数据集的初始分区。随后,我们利用信息切割作为聚类评估函数来合并聚类,直到达到预定义的聚类数量。我们展示了我们的新型信息论聚类方法在应用于人工创建的数据和真实数据时的性能,并取得了令人鼓舞的结果。
{"title":"Information cut and information forces for clustering","authors":"R. Jenssen, J. Príncipe, T. Eltoft","doi":"10.1109/NNSP.2003.1318045","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318045","url":null,"abstract":"We define an information-theoretic divergence measure between probability density functions (pdfs) that has a deep connection to the cut in graph-theory. This connection is revealed when the pdfs are estimated by the Parzen method with a Gaussian kernel. We refer to our divergence measure as the information cut. The information cut provides us with a theoretically sound criterion for cluster evaluation. In this paper we show that it can be used to merge clusters. The initial clusters are obtained based on the related concept of information forces. We create directed trees by selecting the predecessor of a node (pattern) according to the direction of the information force acting on the pattern. Each directed tree corresponds to a cluster, hence enabling us to obtain an initial partitioning of the data set. Subsequently, we utilize the information cut as a cluster evaluation function to merge clusters until the predefined number of clusters is reached. We demonstrate the performance of our novel information-theoretic clustering method when applied to both artificially created data and real data, with encouraging results.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"11 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126319684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Loopy belief propagation and probabilistic image processing 循环信念传播与概率图像处理
Kazuyuki Tanaka, Jun-ichi Inoue, D. Titterington
Estimation of hyperparameters by maximization of the marginal likelihood in probabilistic image processing is investigated by using the cluster variation method. The algorithms are substantially equivalent to generalized loopy belief propagation.
利用聚类变分法研究了概率图像处理中边际似然最大化的超参数估计问题。该算法实质上等价于广义循环信念传播。
{"title":"Loopy belief propagation and probabilistic image processing","authors":"Kazuyuki Tanaka, Jun-ichi Inoue, D. Titterington","doi":"10.1109/NNSP.2003.1318032","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318032","url":null,"abstract":"Estimation of hyperparameters by maximization of the marginal likelihood in probabilistic image processing is investigated by using the cluster variation method. The algorithms are substantially equivalent to generalized loopy belief propagation.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125967696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Separate-variable adaptive combination of LMS adaptive filters for plant identification LMS自适应滤波器在植物识别中的分离变量自适应组合
J. Arenas-García, V. Gómez-Verdejo, M. Martínez‐Ramón, A. Figueiras-Vidal
The Least Mean Square (LMS) algorithm has become a very popular algorithm for adaptive filtering due to its robustness and simplicity. An adaptive convex combination of one fast a one slow LMS filters has been previously proposed for plant identification, as a way to break the speed vs precision compromise inherent to LMS filters. In this paper, an improved version of this combination method is presented. Instead of using a global mixing parameter, the new algorithm uses a different combination parameter for each weight of the adaptive filter, what gives some advantage when identifying varying plants where some of the coefficients remain unaltered, or when the input process is colored. Some simulation examples show the validity of this approach when compared with the one-parameter combination scheme and with a different multi-step approach.
最小均方算法(LMS)以其鲁棒性和简单性成为一种非常流行的自适应滤波算法。一快一慢LMS滤波器的自适应凸组合已经被提出用于植物识别,作为一种打破LMS滤波器固有的速度与精度折衷的方法。本文提出了这种组合方法的改进版本。新算法没有使用全局混合参数,而是为自适应滤波器的每个权重使用不同的组合参数,这在识别某些系数保持不变的不同植物或输入过程被着色时提供了一些优势。仿真实例表明,该方法与单参数组合方案和不同的多步骤组合方案相比是有效的。
{"title":"Separate-variable adaptive combination of LMS adaptive filters for plant identification","authors":"J. Arenas-García, V. Gómez-Verdejo, M. Martínez‐Ramón, A. Figueiras-Vidal","doi":"10.1109/NNSP.2003.1318023","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318023","url":null,"abstract":"The Least Mean Square (LMS) algorithm has become a very popular algorithm for adaptive filtering due to its robustness and simplicity. An adaptive convex combination of one fast a one slow LMS filters has been previously proposed for plant identification, as a way to break the speed vs precision compromise inherent to LMS filters. In this paper, an improved version of this combination method is presented. Instead of using a global mixing parameter, the new algorithm uses a different combination parameter for each weight of the adaptive filter, what gives some advantage when identifying varying plants where some of the coefficients remain unaltered, or when the input process is colored. Some simulation examples show the validity of this approach when compared with the one-parameter combination scheme and with a different multi-step approach.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"28 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132118552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
An on-line algorithm for blind source extraction based on nonlinear prediction approach 基于非线性预测方法的在线盲源提取算法
D. Mandic, A. Cichocki, U. Manmontri
A gradient descent based on-line algorithm for blind source extraction (BSE) of instantaneous signal mixtures is proposed. This algorithm is derived by utilising a nonlinear adaptive filter in a structure that consists of an extraction and prediction module. By exploiting the predictability property of a signal from the mixture, source signals are extracted based on the order of the nonlinear adaptive predictor. To improve the convergence of the basic algorithm, it is further globally normalised based on the minimisation of the a posteriori prediction error. Next, the algorithm is made fully adaptive to compensate for the independence and other assumptions in its derivation. Two examples are presented to illustrate the performance of the algorithms.
提出了一种基于梯度下降的瞬时混合信号在线盲源提取算法。该算法是在由提取和预测模块组成的结构中利用非线性自适应滤波器推导出来的。利用混合信号的可预测性,根据非线性自适应预测器的阶数提取源信号。为了提高基本算法的收敛性,在最小化后验预测误差的基础上进一步进行全局归一化。其次,对该算法进行了充分的自适应,以补偿其推导过程中的独立性和其他假设。给出了两个实例来说明算法的性能。
{"title":"An on-line algorithm for blind source extraction based on nonlinear prediction approach","authors":"D. Mandic, A. Cichocki, U. Manmontri","doi":"10.1109/NNSP.2003.1318042","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318042","url":null,"abstract":"A gradient descent based on-line algorithm for blind source extraction (BSE) of instantaneous signal mixtures is proposed. This algorithm is derived by utilising a nonlinear adaptive filter in a structure that consists of an extraction and prediction module. By exploiting the predictability property of a signal from the mixture, source signals are extracted based on the order of the nonlinear adaptive predictor. To improve the convergence of the basic algorithm, it is further globally normalised based on the minimisation of the a posteriori prediction error. Next, the algorithm is made fully adaptive to compensate for the independence and other assumptions in its derivation. Two examples are presented to illustrate the performance of the algorithms.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116579181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A neural network method to improve prediction of protein-protein interaction sites in heterocomplexes 一种改进异质复合物中蛋白质-蛋白质相互作用位点预测的神经网络方法
P. Fariselli, A. Zauli, I. Rossi, M. Finelli, P. Martelli, R. Casadio
In this paper we describe an algorithm, based on neural networks that adds to the previously published results (ISPRED, www.biocomp.unibo.it) and increases the predictive performance of protein-protein interaction sites in protein structures. The goal is to reduce the number of spurious assignment and developing knowledge based computational approach to focus on clusters of predicted residues on the protein surface. The algorithm is based on neural networks and can be used to highlight putative interacting patches with high reliability, as indicated when tested on known complexes in the PDB. When a smoothing algorithm correlates the network outputs, the accuracy in identifying the interaction patches increases from 73% up 76%. The reliability of the prediction is also increased by the application the smoothing procedure.
在本文中,我们描述了一种基于神经网络的算法,该算法增加了先前发表的结果(ISPRED, www.biocomp.unibo.it),并提高了蛋白质结构中蛋白质-蛋白质相互作用位点的预测性能。目标是减少虚假分配的数量,并开发基于知识的计算方法来关注蛋白质表面预测残基的簇。该算法基于神经网络,可用于突出具有高可靠性的假定相互作用斑块,如在PDB中已知复合物上测试时所示。当一个平滑算法将网络输出关联起来时,识别交互补丁的准确率从73%提高到76%。采用平滑处理,提高了预测的可靠性。
{"title":"A neural network method to improve prediction of protein-protein interaction sites in heterocomplexes","authors":"P. Fariselli, A. Zauli, I. Rossi, M. Finelli, P. Martelli, R. Casadio","doi":"10.1109/NNSP.2003.1318002","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318002","url":null,"abstract":"In this paper we describe an algorithm, based on neural networks that adds to the previously published results (ISPRED, www.biocomp.unibo.it) and increases the predictive performance of protein-protein interaction sites in protein structures. The goal is to reduce the number of spurious assignment and developing knowledge based computational approach to focus on clusters of predicted residues on the protein surface. The algorithm is based on neural networks and can be used to highlight putative interacting patches with high reliability, as indicated when tested on known complexes in the PDB. When a smoothing algorithm correlates the network outputs, the accuracy in identifying the interaction patches increases from 73% up 76%. The reliability of the prediction is also increased by the application the smoothing procedure.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114237138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Protein secondary structure prediction with ICA feature extraction 基于ICA特征提取的蛋白质二级结构预测
J. Melo, George D. C. Cavalcanti, K. Guimaraes
An original application of the independent component analysis (ICA) is presented in this work. This linear transformation method is used for feature extraction for a machine learning approach to the protein secondary structure prediction problem. PSI-blast profiles, built on NCBI's nonredundant protein database, have their dimensionality reduced through ICA method. The resulting components are used as input data to three artificial neural networks with 30, 35 or 40 nodes in the hidden layer. Those classifiers are trained with the RPROP algorithm and five rules are used for the combination of their outputs. The results achieved are compared with the best ones recently obtained in similar conditions, including experiments using principal component analysis (PCA) as feature extraction method, presenting the best result. The performance of each network individually achieved a Q/sub 3/ accuracy of 74.1% on average, using only 120 independent components. When the networks are combined with the product rule the performance achieved is 75.2%. This result is overcome only when the raw data are informed to the networks, when an accuracy of 75.9% is achieved.
本文提出了独立分量分析(ICA)的一个原始应用。该线性变换方法用于蛋白质二级结构预测问题的机器学习方法的特征提取。在NCBI的非冗余蛋白数据库中建立PSI-blast序列,通过ICA方法对其进行降维。所得到的组件被用作三个人工神经网络的输入数据,这些网络在隐藏层中有30、35或40个节点。这些分类器使用RPROP算法进行训练,并使用五个规则来组合它们的输出。将所获得的结果与近年来在相似条件下的最佳结果进行了比较,包括采用主成分分析(PCA)作为特征提取方法的实验,得到了最佳结果。在仅使用120个独立组件的情况下,每个网络的性能达到了平均74.1%的Q/sub /精度。当网络与乘积法则相结合时,性能达到75.2%。只有当原始数据被告知网络时,这个结果才会被克服,准确率达到75.9%。
{"title":"Protein secondary structure prediction with ICA feature extraction","authors":"J. Melo, George D. C. Cavalcanti, K. Guimaraes","doi":"10.1109/NNSP.2003.1318000","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318000","url":null,"abstract":"An original application of the independent component analysis (ICA) is presented in this work. This linear transformation method is used for feature extraction for a machine learning approach to the protein secondary structure prediction problem. PSI-blast profiles, built on NCBI's nonredundant protein database, have their dimensionality reduced through ICA method. The resulting components are used as input data to three artificial neural networks with 30, 35 or 40 nodes in the hidden layer. Those classifiers are trained with the RPROP algorithm and five rules are used for the combination of their outputs. The results achieved are compared with the best ones recently obtained in similar conditions, including experiments using principal component analysis (PCA) as feature extraction method, presenting the best result. The performance of each network individually achieved a Q/sub 3/ accuracy of 74.1% on average, using only 120 independent components. When the networks are combined with the product rule the performance achieved is 75.2%. This result is overcome only when the raw data are informed to the networks, when an accuracy of 75.9% is achieved.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117165181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1