首页 > 最新文献

2011 10th International Conference on Machine Learning and Applications and Workshops最新文献

英文 中文
Mood Classfication from Musical Audio Using User Group-Dependent Models 使用用户组相关模型对音乐音频进行情绪分类
Kyogu Lee, Minsuk Cho
In this paper, we propose a music mood classification system that reflects a user's profile based on a belief that music mood perception is subjective and can vary depending on the user's profile such as age or gender. To this end, we first define a set of generic mood descriptors. Secondly, we make up several user profiles according to the age and gender. We then obtain musical items, for each group, to separately train the statistical models. Using the two different user models, we verify our hypothesis that the user profiles play an important role in mood perception by showing that both models achieve higher classification accuracy when the test data and the mood model are of the same kind. Applying our system to automatic play list generation, we also demonstrate that considering the difference between the user groups in mood perception has a significant effect in computing music similarity.
在本文中,我们提出了一个音乐情绪分类系统,该系统反映了用户的个人资料,基于这样一种信念,即音乐情绪感知是主观的,可以根据用户的个人资料(如年龄或性别)而变化。为此,我们首先定义一组通用的情绪描述符。其次,我们根据用户的年龄和性别,建立了多个用户档案。然后,我们为每个组获取音乐项目,分别训练统计模型。使用两种不同的用户模型,我们验证了我们的假设,即用户档案在情绪感知中发挥重要作用,表明当测试数据和情绪模型相同时,两种模型都具有更高的分类精度。将我们的系统应用于自动播放列表生成,我们还证明了考虑用户群体在情绪感知方面的差异对计算音乐相似度有显着影响。
{"title":"Mood Classfication from Musical Audio Using User Group-Dependent Models","authors":"Kyogu Lee, Minsuk Cho","doi":"10.1109/ICMLA.2011.96","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.96","url":null,"abstract":"In this paper, we propose a music mood classification system that reflects a user's profile based on a belief that music mood perception is subjective and can vary depending on the user's profile such as age or gender. To this end, we first define a set of generic mood descriptors. Secondly, we make up several user profiles according to the age and gender. We then obtain musical items, for each group, to separately train the statistical models. Using the two different user models, we verify our hypothesis that the user profiles play an important role in mood perception by showing that both models achieve higher classification accuracy when the test data and the mood model are of the same kind. Applying our system to automatic play list generation, we also demonstrate that considering the difference between the user groups in mood perception has a significant effect in computing music similarity.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115165129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Building an Ensemble of Probabilistic Classifiers for Lung Nodule Interpretation 构建肺结节解释的概率分类器集合
D. Zinovev, J. Furst, D. Raicu
When examining Computed Tomography (CT) scans of lungs for potential abnormalities, radiologists make use of lung nodule's semantic characteristics during the analysis. Computer-Aided Diagnostic Characterization (CADc) systems can act as an aid - predicting ratings of these semantic characteristics to aid radiologists in evaluating the nodule and potentially improve the quality and consistency of diagnosis. In our work, we propose a system for predicting the distribution of radiologists' opinions using a probabilistic multi-class classification approach based on combination of belief decision trees and ADABoost ensemble learning approach. To train and test our system we use the National Cancer Institute (NCI) Lung Image Database Consortium (LIDC) dataset, which includes semantic annotations by up to four radiologists for each one of the 914 nodules. Furthermore, we evaluate our probabilistic multi-class classifications using a novel distance-threshold curve technique intended for assessing the performance of uncertain classification systems. We conclude that for the majority of semantic characteristics there exists a set of parameters that significantly improves the performance of the ensemble over the single classifier.
当检查计算机断层扫描(CT)肺部的潜在异常时,放射科医生在分析过程中使用肺结节的语义特征。计算机辅助诊断表征(CADc)系统可以作为辅助预测这些语义特征的评级,以帮助放射科医生评估结节,并有可能提高诊断的质量和一致性。在我们的工作中,我们提出了一个基于信念决策树和ADABoost集成学习方法相结合的概率多类分类方法来预测放射科医生意见分布的系统。为了训练和测试我们的系统,我们使用了国家癌症研究所(NCI)肺图像数据库联盟(LIDC)数据集,其中包括多达四名放射科医生对914个结节中的每个结节的语义注释。此外,我们使用一种新的距离阈值曲线技术来评估我们的概率多类分类,该技术旨在评估不确定分类系统的性能。我们得出的结论是,对于大多数语义特征,存在一组参数,这些参数可以显着提高集成在单个分类器上的性能。
{"title":"Building an Ensemble of Probabilistic Classifiers for Lung Nodule Interpretation","authors":"D. Zinovev, J. Furst, D. Raicu","doi":"10.1109/ICMLA.2011.44","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.44","url":null,"abstract":"When examining Computed Tomography (CT) scans of lungs for potential abnormalities, radiologists make use of lung nodule's semantic characteristics during the analysis. Computer-Aided Diagnostic Characterization (CADc) systems can act as an aid - predicting ratings of these semantic characteristics to aid radiologists in evaluating the nodule and potentially improve the quality and consistency of diagnosis. In our work, we propose a system for predicting the distribution of radiologists' opinions using a probabilistic multi-class classification approach based on combination of belief decision trees and ADABoost ensemble learning approach. To train and test our system we use the National Cancer Institute (NCI) Lung Image Database Consortium (LIDC) dataset, which includes semantic annotations by up to four radiologists for each one of the 914 nodules. Furthermore, we evaluate our probabilistic multi-class classifications using a novel distance-threshold curve technique intended for assessing the performance of uncertain classification systems. We conclude that for the majority of semantic characteristics there exists a set of parameters that significantly improves the performance of the ensemble over the single classifier.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117113950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Mobile Robot Self-Localization Based on Omnidirectional Vision and Gaussian Models 基于全向视觉和高斯模型的移动机器人自定位
Diego Campos-Sobrino, Francisco Coral-Sabido, Martha Varguez-Moo, Víctor Uc Cetina, A. Espinosa-Romero
We present preliminary results derived from our project on robot self localization using omni directional images. In our approach, features are generated through the computation of covariance matrices that capture important patterns that relates changes in pixel intensities. The learning models used are Mixture of Gaussians and Gaussian Discriminant Analysis. The first method is used initially to test the viability of our feature vectors, and at the same time provides useful information about a natural way of clustering the images in our traning set. Once we determined a reliable set of features, we generated the Gaussian discriminant functions. We show promising experimental results obtained with the Pioneer P3-DX robot in the hallways of the School of Mathematics at the Yucatan Autonomous University in Mexico.
我们提出了从我们的项目中获得的初步结果,该项目使用全方位图像进行机器人自我定位。在我们的方法中,特征是通过计算协方差矩阵生成的,协方差矩阵捕获与像素强度变化相关的重要模式。使用的学习模型是混合高斯和高斯判别分析。第一种方法最初用于测试我们的特征向量的可行性,同时提供有关训练集中图像聚类的自然方法的有用信息。一旦我们确定了一组可靠的特征,我们就生成了高斯判别函数。我们展示了先锋P3-DX机器人在墨西哥尤卡坦自治大学数学学院的走廊上获得的有希望的实验结果。
{"title":"Mobile Robot Self-Localization Based on Omnidirectional Vision and Gaussian Models","authors":"Diego Campos-Sobrino, Francisco Coral-Sabido, Martha Varguez-Moo, Víctor Uc Cetina, A. Espinosa-Romero","doi":"10.1109/ICMLA.2011.95","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.95","url":null,"abstract":"We present preliminary results derived from our project on robot self localization using omni directional images. In our approach, features are generated through the computation of covariance matrices that capture important patterns that relates changes in pixel intensities. The learning models used are Mixture of Gaussians and Gaussian Discriminant Analysis. The first method is used initially to test the viability of our feature vectors, and at the same time provides useful information about a natural way of clustering the images in our traning set. Once we determined a reliable set of features, we generated the Gaussian discriminant functions. We show promising experimental results obtained with the Pioneer P3-DX robot in the hallways of the School of Mathematics at the Yucatan Autonomous University in Mexico.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125883801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning to Rank Using Markov Random Fields 使用马尔可夫随机场学习排序
Antonino Freno, Tiziano Papini, Michelangelo Diligenti
Learning to rank from examples is an important task in modern Information Retrieval systems like Web search engines, where the large number of available features makes hard to manually devise high-performing ranking functions. This paper presents a novel approach to learning-to-rank, which can natively integrate any target metric with no modifications. The target metric is optimized via maximum-likelihood estimation of a probability distribution over the ranks, which are assumed to follow a Boltzmann distribution. Unlike other approaches in the literature like BoltzRank, this approach does not rely on maximizing the expected value of the target score as a proxy of the optimization of target metric. This has both theoretical and performance advantages as the expected value can not be computed both accurately and efficiently. Furthermore, our model employs the pseudo-likelihood as an accurate surrogate of the likelihood to avoid to explicitly compute the normalization factor of the Boltzmann distribution, which is intractable in this context. The experimental results show that the approach provides state-of-the-art results on various benchmarks and on a dataset built from the logs of a commercial search engine.
在现代信息检索系统(如Web搜索引擎)中,从示例中学习排序是一项重要任务,其中大量可用的特性使得很难手动设计高性能的排序功能。本文提出了一种新的学习排序方法,该方法可以在不修改的情况下对任意目标度量进行自然积分。目标度量通过对秩的概率分布的最大似然估计来优化,假设秩遵循玻尔兹曼分布。与BoltzRank等文献中的其他方法不同,该方法不依赖于将目标分数的期望值最大化作为目标指标优化的代理。由于期望值不能准确有效地计算,因此具有理论和性能上的优点。此外,我们的模型采用伪似然作为似然的精确代理,避免了在这种情况下难以明确计算玻尔兹曼分布的归一化因子。实验结果表明,该方法在各种基准测试和基于商业搜索引擎日志构建的数据集上提供了最先进的结果。
{"title":"Learning to Rank Using Markov Random Fields","authors":"Antonino Freno, Tiziano Papini, Michelangelo Diligenti","doi":"10.1109/ICMLA.2011.157","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.157","url":null,"abstract":"Learning to rank from examples is an important task in modern Information Retrieval systems like Web search engines, where the large number of available features makes hard to manually devise high-performing ranking functions. This paper presents a novel approach to learning-to-rank, which can natively integrate any target metric with no modifications. The target metric is optimized via maximum-likelihood estimation of a probability distribution over the ranks, which are assumed to follow a Boltzmann distribution. Unlike other approaches in the literature like BoltzRank, this approach does not rely on maximizing the expected value of the target score as a proxy of the optimization of target metric. This has both theoretical and performance advantages as the expected value can not be computed both accurately and efficiently. Furthermore, our model employs the pseudo-likelihood as an accurate surrogate of the likelihood to avoid to explicitly compute the normalization factor of the Boltzmann distribution, which is intractable in this context. The experimental results show that the approach provides state-of-the-art results on various benchmarks and on a dataset built from the logs of a commercial search engine.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122330806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Large Margin Classifier Based on Hyperdisks 基于hyperdisk的大间距分类器
Hakan Cevikalp
This paper introduces a binary large margin classifier that approximates each class with an hyper disk constructed from its training samples. For any pair of classes approximated with hyper disks, there is a corresponding linear separating hyper plane that maximizes the margin between them, and this can be found by solving a convex program that finds the closest pair of points on the hyper disks. More precisely, the best separating hyper plane is chosen to be the one that is orthogonal to the line segment connecting the closest points on the hyper disks and at the same time bisects the line. The method is extended to the nonlinear case by using the kernel trick, and the multi-class classification problems are dealt with constructing and combining several binary classifiers as in Support Vector Machine (SVM) classifier. The experiments on several databases show that the proposed method compares favorably to other popular large margin classifiers.
本文介绍了一种二元大边距分类器,该分类器用每个类的训练样本构造一个超磁盘来逼近每个类。对于任何用超磁盘近似的类对,都存在一个相应的线性分离超平面,使它们之间的边界最大化,这可以通过求解一个凸程序来找到超磁盘上最近的点对来找到。更准确地说,最佳分离超平面是与连接超圆盘上最近点的线段正交并同时与直线平分的平面。利用核技巧将该方法扩展到非线性情况,并像支持向量机(SVM)分类器那样构造和组合多个二分类器来处理多类分类问题。在多个数据库上的实验表明,该方法优于其他常用的大余量分类器。
{"title":"Large Margin Classifier Based on Hyperdisks","authors":"Hakan Cevikalp","doi":"10.1109/ICMLA.2011.86","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.86","url":null,"abstract":"This paper introduces a binary large margin classifier that approximates each class with an hyper disk constructed from its training samples. For any pair of classes approximated with hyper disks, there is a corresponding linear separating hyper plane that maximizes the margin between them, and this can be found by solving a convex program that finds the closest pair of points on the hyper disks. More precisely, the best separating hyper plane is chosen to be the one that is orthogonal to the line segment connecting the closest points on the hyper disks and at the same time bisects the line. The method is extended to the nonlinear case by using the kernel trick, and the multi-class classification problems are dealt with constructing and combining several binary classifiers as in Support Vector Machine (SVM) classifier. The experiments on several databases show that the proposed method compares favorably to other popular large margin classifiers.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128334773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Introducing Flow Field Forecasting 流场预测简介
Michael Frey, Kyle A. Caudle
A machine learning methodology, called flow field forecasting, is proposed for statistically predicting the future of a univariate time series. Flow field forecasting draws information from the interpolated flow field of an observed time series to build a forecast step-by-step. Flow field forecasting is presented with examples, a discussion of its properties relative to other common forecasting techniques, and a statistical error analysis.
提出了一种称为流场预测的机器学习方法,用于统计预测单变量时间序列的未来。流场预测从观测时间序列的插值流场中提取信息,逐步建立预测。给出了流场预测的实例,讨论了流场预测与其他常用预测技术的特性,并进行了统计误差分析。
{"title":"Introducing Flow Field Forecasting","authors":"Michael Frey, Kyle A. Caudle","doi":"10.1109/ICMLA.2011.82","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.82","url":null,"abstract":"A machine learning methodology, called flow field forecasting, is proposed for statistically predicting the future of a univariate time series. Flow field forecasting draws information from the interpolated flow field of an observed time series to build a forecast step-by-step. Flow field forecasting is presented with examples, a discussion of its properties relative to other common forecasting techniques, and a statistical error analysis.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"332 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124675691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Charge Prediction of Lipid Fragments in Mass Spectrometry 质谱法中脂质碎片电荷预测
B. Schrom, L. Kangas, Bojana Ginovska-Pangovska, T. Metz, John H. Miller
An artificial neural network is developed for predicting which fragment is charged and which fragment is neutral for lipid fragment pairs produced from a liquid chromatography tandem mass spectrometry simulation process. This charge predictor is integrated into software developed at PNNL for in silico spectra generation and identification of metabolites known as Met ISIS. To test the effect of including charge prediction in Met ISIS, 46 lipids are used which show a reduction in false positive identifications when the charge predictor is utilized.
针对液相色谱串联质谱模拟过程中产生的脂质片段对,建立了一种人工神经网络,用于预测哪个片段带电,哪个片段中性。该电荷预测器集成到PNNL开发的软件中,用于生成硅光谱和鉴定称为Met ISIS的代谢物。为了测试在Met ISIS中包括电荷预测的效果,使用了46种脂质,当使用电荷预测器时,显示假阳性识别的减少。
{"title":"Charge Prediction of Lipid Fragments in Mass Spectrometry","authors":"B. Schrom, L. Kangas, Bojana Ginovska-Pangovska, T. Metz, John H. Miller","doi":"10.1109/ICMLA.2011.45","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.45","url":null,"abstract":"An artificial neural network is developed for predicting which fragment is charged and which fragment is neutral for lipid fragment pairs produced from a liquid chromatography tandem mass spectrometry simulation process. This charge predictor is integrated into software developed at PNNL for in silico spectra generation and identification of metabolites known as Met ISIS. To test the effect of including charge prediction in Met ISIS, 46 lipids are used which show a reduction in false positive identifications when the charge predictor is utilized.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129869140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Infinite Decision Agent Ensemble Learning System for Credit Risk Analysis 信用风险分析的无限决策代理集成学习系统
Shukai Li, I. Tsang, N. Chaudhari
Considering the special needs of credit risk analysis, the Infinite DEcision Agent ensemble Learning (IDEAL) system is proposed. In the first level of our model, we adopt soft margin boosting to overcome over fitting. In the second level, the RVM algorithm is revised for boosting so that different RVM agents can be generated from the updated instance space of the data. In the third level, the perceptron kernel is employed in RVM to generate infinite subagents. Our IDEAL system also shares some good properties, such as good generalization performance, immunity to over fitting and predicting the distance to default. According to the experimental results, our proposed system can achieve better performance in term of sensitivity, specificity and overall accuracy.
考虑到信用风险分析的特殊需要,提出了无限决策代理集成学习(IDEAL)系统。在模型的第一级,我们采用软边际增强来克服过拟合。在第二级,修改RVM算法以进行提升,以便可以从更新的数据实例空间生成不同的RVM代理。第三层,在RVM中使用感知机内核生成无限个子代理。我们的IDEAL系统还具有良好的泛化性能、抗过拟合能力和预测违约距离等特性。实验结果表明,本文提出的系统在灵敏度、特异度和整体准确度方面都取得了较好的性能。
{"title":"Infinite Decision Agent Ensemble Learning System for Credit Risk Analysis","authors":"Shukai Li, I. Tsang, N. Chaudhari","doi":"10.1109/ICMLA.2011.80","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.80","url":null,"abstract":"Considering the special needs of credit risk analysis, the Infinite DEcision Agent ensemble Learning (IDEAL) system is proposed. In the first level of our model, we adopt soft margin boosting to overcome over fitting. In the second level, the RVM algorithm is revised for boosting so that different RVM agents can be generated from the updated instance space of the data. In the third level, the perceptron kernel is employed in RVM to generate infinite subagents. Our IDEAL system also shares some good properties, such as good generalization performance, immunity to over fitting and predicting the distance to default. According to the experimental results, our proposed system can achieve better performance in term of sensitivity, specificity and overall accuracy.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130571511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
L0-Regularized Parametric Non-negative Factorization for Analyzing Composite Signals l0 -正则化参数非负分解分析复合信号
Takumi Kobayashi, Kenji Watanabe, N. Otsu
Signal sequences are practically observed as composites in which a few number of factor signals are linearly combined with non-negative weights. Based on prior physical knowledge about the target, the factors can be modeled as parametric functions, and their parameter values benefit further analyses. In this paper, we present a novel factorization method for the composite signals in terms of parametric factor functions. The method optimizes both the factor weights and the parameter values in the factor functions. While the parameter values are simply optimized by gradient descent, we propose L0-regularized non-negative least squares (L0-NNLS) for optimizing the factor weights. In L0-NNLS, both L0 regularization and non-negativity constraint are imposed on the weights in the least squares to enhance the sparsity as much as possible. Since so regularized least squares is NPhard, we propose a stepwise forward/backward optimization to efficiently solve it in an approximated manner. Due to the sparsity by the L0-NNLS, the proposed factorization method can automatically discover the inherent number of factor functions as well as the parametric functions themselves by estimating their parameter values. In the experiments on factorization of simulated signals and practical biological signals, the proposed method exhibits favorable performances.
实际上,信号序列是由若干因子信号以非负权重线性组合而成的复合信号。基于对目标的先验物理知识,可以将这些因素建模为参数函数,其参数值有利于进一步分析。本文提出了一种用参数因子函数分解复合信号的新方法。该方法对因子函数中的因子权重和参数值进行了优化。虽然参数值是简单的梯度下降优化,但我们提出了l0正则化非负最小二乘(L0-NNLS)来优化因子权重。在L0- nnls中,对最小二乘中的权值进行L0正则化和非负性约束,以尽可能地增强稀疏性。由于正则化最小二乘是NPhard,我们提出了一个逐步向前/向后优化,以近似的方式有效地解决它。由于L0-NNLS的稀疏性,本文提出的分解方法可以通过估计因子函数的参数值来自动发现因子函数的固有数量以及参数函数本身。在模拟信号和实际生物信号的分解实验中,该方法均表现出良好的性能。
{"title":"L0-Regularized Parametric Non-negative Factorization for Analyzing Composite Signals","authors":"Takumi Kobayashi, Kenji Watanabe, N. Otsu","doi":"10.1109/ICMLA.2011.84","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.84","url":null,"abstract":"Signal sequences are practically observed as composites in which a few number of factor signals are linearly combined with non-negative weights. Based on prior physical knowledge about the target, the factors can be modeled as parametric functions, and their parameter values benefit further analyses. In this paper, we present a novel factorization method for the composite signals in terms of parametric factor functions. The method optimizes both the factor weights and the parameter values in the factor functions. While the parameter values are simply optimized by gradient descent, we propose L0-regularized non-negative least squares (L0-NNLS) for optimizing the factor weights. In L0-NNLS, both L0 regularization and non-negativity constraint are imposed on the weights in the least squares to enhance the sparsity as much as possible. Since so regularized least squares is NPhard, we propose a stepwise forward/backward optimization to efficiently solve it in an approximated manner. Due to the sparsity by the L0-NNLS, the proposed factorization method can automatically discover the inherent number of factor functions as well as the parametric functions themselves by estimating their parameter values. In the experiments on factorization of simulated signals and practical biological signals, the proposed method exhibits favorable performances.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130593115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonlinear RANSAC Optimization for Parameter Estimation with Applications to Phagocyte Transmigration 非线性RANSAC优化参数估计及其在吞噬细胞迁移中的应用
Mingon Kang, Jean X. Gao, Liping Tang
Developing vigorous mathematical models and estimating accurate parameters within feasible computational time are two indispensable parts to build reliable system models for representing biological properties of the system and for producing reliable simulation. For a complex biological system with limited observations, one of the daunting tasks is the large number of unknown parameters in the mathematical modeling whose values directly determine the performance of computational modeling. To tackle this problem, we have developed a data-driven global optimization method, nonlinear RANSAC, based on Random Sample Consensus (a.k.a. RANSAC) method, for parameter estimation of nonlinear system models. Conventional RANSAC method is sound and simple, but it is oriented for linear system models. We not only adopt the strengths of RANSAC, but also extend the method for nonlinear systems with outstanding performance. As a specific application example, we have targeted understanding phagocyte transmigration which is involved in the fibrosis process for biomedical device implantation. With well-defined mathematical nonlinear equations of the system, nonlinear RANSAC is performed for the parameter estimation. Moreover, simulations of the system for propagation prediction over the time are conducted under both normal conditions and knock-out conditions. In order to evaluate the general performance of the method, we also applied the method to signalling pathways where mathematical equations which are representing interaction of proteins are generated using ordinary differential equations as a general format, and public data sets for nonlinear regression evaluation are used to assess its performance.
建立有力的数学模型和在可行的计算时间内估计准确的参数是建立可靠的系统模型以表示系统的生物学特性和进行可靠的仿真所不可缺少的两个部分。对于观测值有限的复杂生物系统,数学建模中存在大量未知参数,这些参数的取值直接决定了计算建模的性能,这是一项艰巨的任务。为了解决这一问题,我们开发了一种数据驱动的全局优化方法——非线性RANSAC,该方法基于随机样本一致性(Random Sample Consensus,又名RANSAC)方法,用于非线性系统模型的参数估计。传统的RANSAC方法简单可靠,但主要面向线性系统模型。我们不仅吸收了RANSAC的优点,而且将该方法推广到具有优异性能的非线性系统。作为一个具体的应用实例,我们有针对性地了解了生物医学器械植入过程中参与纤维化过程的吞噬细胞迁移。根据系统的非线性数学方程,对系统参数进行非线性RANSAC估计。此外,在正常条件和淘汰条件下对系统进行了随时间的传播预测模拟。为了评估该方法的一般性能,我们还将该方法应用于信号通路,其中使用常微分方程作为一般格式生成表示蛋白质相互作用的数学方程,并使用用于非线性回归评估的公共数据集来评估其性能。
{"title":"Nonlinear RANSAC Optimization for Parameter Estimation with Applications to Phagocyte Transmigration","authors":"Mingon Kang, Jean X. Gao, Liping Tang","doi":"10.1109/ICMLA.2011.104","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.104","url":null,"abstract":"Developing vigorous mathematical models and estimating accurate parameters within feasible computational time are two indispensable parts to build reliable system models for representing biological properties of the system and for producing reliable simulation. For a complex biological system with limited observations, one of the daunting tasks is the large number of unknown parameters in the mathematical modeling whose values directly determine the performance of computational modeling. To tackle this problem, we have developed a data-driven global optimization method, nonlinear RANSAC, based on Random Sample Consensus (a.k.a. RANSAC) method, for parameter estimation of nonlinear system models. Conventional RANSAC method is sound and simple, but it is oriented for linear system models. We not only adopt the strengths of RANSAC, but also extend the method for nonlinear systems with outstanding performance. As a specific application example, we have targeted understanding phagocyte transmigration which is involved in the fibrosis process for biomedical device implantation. With well-defined mathematical nonlinear equations of the system, nonlinear RANSAC is performed for the parameter estimation. Moreover, simulations of the system for propagation prediction over the time are conducted under both normal conditions and knock-out conditions. In order to evaluate the general performance of the method, we also applied the method to signalling pathways where mathematical equations which are representing interaction of proteins are generated using ordinary differential equations as a general format, and public data sets for nonlinear regression evaluation are used to assess its performance.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"25 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114114388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
2011 10th International Conference on Machine Learning and Applications and Workshops
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1