首页 > 最新文献

2012 11th International Conference on Machine Learning and Applications最新文献

英文 中文
Prognosis Based on Handling Drifts in Dynamical Environments: Application to a Wind Turbine Benchmark 动态环境中基于漂移处理的预测:在风力发电机基准测试中的应用
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.131
Antoine Chammas, E. Duviella, S. Lecoeuche
In this paper, we present a prognosis architecture that allows the computation of the Remaining Useful Life (RUL) of a failing process. A process subject to an incipient fault experiments slowly developing degradation. Sensor measurements and Condition Monitoring (CM) data extracted from the system allow to follow up the process drift. The prognosis architecture we propose makes use of a dynamical clustering algorithm to model the data in a feature space. This algorithm uses a sliding window scheme on which the model is iteratively updated. Metrics applied on the parameters of this model are used to compute a drift severity indicator, which is also an indicator of the health of the system. The architecture for prognosis is applied on a benchmark of wind turbine. The used benchmark has been constructed to serve as a realistic wind turbine model. It was used in the context of a global scale fault diagnosis and fault tolerant control competition. The benchmark also proposed a drifting fault scenario that we used to test our approach.
在本文中,我们提出了一个预测体系结构,允许计算失效过程的剩余使用寿命(RUL)。受初期故障影响的过程会逐渐退化。从系统中提取的传感器测量和状态监测(CM)数据允许跟踪过程漂移。我们提出的预测架构利用动态聚类算法在特征空间中对数据建模。该算法采用滑动窗口模式,迭代更新模型。应用于该模型参数的度量用于计算漂移严重性指标,该指标也是系统健康状况的指标。将该预测体系应用于某风力发电机组的基准试验。所使用的基准已被构建为一个现实的风力涡轮机模型。将其应用于全局范围的故障诊断和容错控制竞争。基准测试还提出了一个漂移故障场景,我们用它来测试我们的方法。
{"title":"Prognosis Based on Handling Drifts in Dynamical Environments: Application to a Wind Turbine Benchmark","authors":"Antoine Chammas, E. Duviella, S. Lecoeuche","doi":"10.1109/ICMLA.2012.131","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.131","url":null,"abstract":"In this paper, we present a prognosis architecture that allows the computation of the Remaining Useful Life (RUL) of a failing process. A process subject to an incipient fault experiments slowly developing degradation. Sensor measurements and Condition Monitoring (CM) data extracted from the system allow to follow up the process drift. The prognosis architecture we propose makes use of a dynamical clustering algorithm to model the data in a feature space. This algorithm uses a sliding window scheme on which the model is iteratively updated. Metrics applied on the parameters of this model are used to compute a drift severity indicator, which is also an indicator of the health of the system. The architecture for prognosis is applied on a benchmark of wind turbine. The used benchmark has been constructed to serve as a realistic wind turbine model. It was used in the context of a global scale fault diagnosis and fault tolerant control competition. The benchmark also proposed a drifting fault scenario that we used to test our approach.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128094781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An Efficient Algorithm for top-k Queries on Uncertain Data Streams 不确定数据流上top-k查询的一种高效算法
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.57
Caiyan Dai, Ling Chen, Yixin Chen, Keming Tang
We tackle the problem of answering maximum probabilistic top-k tuple set queries. We use a sliding-window model on uncertain data streams and present an efficient algorithm for processing sliding-window queries on uncertain streams. In each sliding window, the algorithm selects the k tuples with the highest probabilities from sets of different numbers of the tuples with the highest scores. Then, the algorithm computes existential probability of the top-k tuples, and chooses the set with the highest probability as the top-k query result. We theoretically prove the correctness of the algorithm. Our experimental results show that our algorithm requires lower time and space complexity than other existing algorithms.
我们解决了回答最大概率top-k元组集查询的问题。在不确定数据流上使用滑动窗口模型,提出了一种处理不确定数据流上滑动窗口查询的有效算法。在每个滑动窗口中,算法从得分最高的不同数量的元组中选择概率最高的k个元组。然后,算法计算top-k元组的存在概率,选择概率最高的集合作为top-k查询结果。从理论上证明了算法的正确性。实验结果表明,该算法所需的时间和空间复杂度较现有算法低。
{"title":"An Efficient Algorithm for top-k Queries on Uncertain Data Streams","authors":"Caiyan Dai, Ling Chen, Yixin Chen, Keming Tang","doi":"10.1109/ICMLA.2012.57","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.57","url":null,"abstract":"We tackle the problem of answering maximum probabilistic top-k tuple set queries. We use a sliding-window model on uncertain data streams and present an efficient algorithm for processing sliding-window queries on uncertain streams. In each sliding window, the algorithm selects the k tuples with the highest probabilities from sets of different numbers of the tuples with the highest scores. Then, the algorithm computes existential probability of the top-k tuples, and chooses the set with the highest probability as the top-k query result. We theoretically prove the correctness of the algorithm. Our experimental results show that our algorithm requires lower time and space complexity than other existing algorithms.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133318543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multi-atlas Based Image Selection with Label Image Constraint 基于标签图像约束的多图集图像选择
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.232
Yihui Cao, Xuelong Li, Pingkun Yan
Atlas selection plays an important role in multiatlas based image segmentation. In atlas selection methods, manifold learning based techniques have recently emerged as very promisingly. However, due to the complexity of anatomical structures in raw images, it is difficult to get accurate atlas selection results by measuring only the distance between raw images on the manifolds. In this paper, we tackle this problem by proposing a label image constrained atlas selection (LICAS) method to exploit the shape and size information of the regions to be segmented from the label images. Constrained by the label images, a new manifold projection method is developed to help uncover the intrinsic similarity between the regions of interest across images. Compared with other existing methods, the experimental results of segmentation on 60 Magnetic Resonance (MR) images showed that the selected atlases are closer to the target structure and more accurate segmentation can be obtained by using the proposed method.
地图集选择在基于多地图集的图像分割中起着重要的作用。在地图集选择方法中,基于流形学习的技术最近被认为是非常有前途的。然而,由于原始图像中解剖结构的复杂性,仅通过测量原始图像在流形上的距离难以获得准确的图谱选择结果。在本文中,我们通过提出一种标签图像约束图谱选择(LICAS)方法来解决这个问题,该方法利用标签图像中待分割区域的形状和大小信息。在标签图像的约束下,开发了一种新的流形投影方法来帮助揭示图像中感兴趣区域之间的内在相似性。通过对60幅磁共振图像的分割实验,与已有的分割方法进行了比较,结果表明,所选择的地图集更接近目标结构,分割精度更高。
{"title":"Multi-atlas Based Image Selection with Label Image Constraint","authors":"Yihui Cao, Xuelong Li, Pingkun Yan","doi":"10.1109/ICMLA.2012.232","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.232","url":null,"abstract":"Atlas selection plays an important role in multiatlas based image segmentation. In atlas selection methods, manifold learning based techniques have recently emerged as very promisingly. However, due to the complexity of anatomical structures in raw images, it is difficult to get accurate atlas selection results by measuring only the distance between raw images on the manifolds. In this paper, we tackle this problem by proposing a label image constrained atlas selection (LICAS) method to exploit the shape and size information of the regions to be segmented from the label images. Constrained by the label images, a new manifold projection method is developed to help uncover the intrinsic similarity between the regions of interest across images. Compared with other existing methods, the experimental results of segmentation on 60 Magnetic Resonance (MR) images showed that the selected atlases are closer to the target structure and more accurate segmentation can be obtained by using the proposed method.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122631087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Augmented Coupled Dictionary Learning for Image Super-Resolution 图像超分辨率增强耦合字典学习
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.52
M. Rushdi, J. Ho
Recent approaches in image super-resolution suggest learning dictionary pairs to model the relationship between low-resolution and high-resolution image patches with sparsity constraints on the patch representation. Most of the previous approaches in this direction assume for simplicity that the sparse codes for a low-resolution patch are equal to those of the corresponding high-resolution patch. However, this invariance assumption is not quite accurate especially for large scaling factors where the optimal weights and indices of representative features are not fixed across the scaling transformation. In this paper, we propose an augmented coupled dictionary learning scheme that compensates for the inaccuracy of the invariance assumption. First, we learn a dictionary for the low-resolution image space. Then, we compute an augmented dictionary in the high-resolution image space where novel augmented dictionary atoms are inferred from the training error of the low-resolution dictionary. For a low-resolution test image, the sparse codes of the low-resolution patches and the lowresolution dictionary training error are combined with the trained high-resolution dictionary to produce a high-resolution image. Our experimental results compare favourably with the non-augmented scheme.
最近的图像超分辨率方法建议学习字典对来模拟低分辨率和高分辨率图像补丁之间的关系,并对补丁表示进行稀疏性约束。为了简单起见,之前的大多数方法都假设低分辨率补丁的稀疏编码等于相应的高分辨率补丁的稀疏编码。然而,这种不变性假设并不十分准确,特别是对于大型缩放因子,其中代表性特征的最优权重和指标在整个缩放变换中不是固定的。在本文中,我们提出了一种增强型耦合字典学习方案来补偿不变性假设的不准确性。首先,我们学习低分辨率图像空间的字典。然后,我们在高分辨率图像空间中计算增广字典,从低分辨率字典的训练误差中推断出新的增广字典原子。对于低分辨率测试图像,将低分辨率补丁的稀疏编码和低分辨率字典训练误差与训练好的高分辨率字典相结合,生成高分辨率图像。我们的实验结果与非增广方案比较好。
{"title":"Augmented Coupled Dictionary Learning for Image Super-Resolution","authors":"M. Rushdi, J. Ho","doi":"10.1109/ICMLA.2012.52","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.52","url":null,"abstract":"Recent approaches in image super-resolution suggest learning dictionary pairs to model the relationship between low-resolution and high-resolution image patches with sparsity constraints on the patch representation. Most of the previous approaches in this direction assume for simplicity that the sparse codes for a low-resolution patch are equal to those of the corresponding high-resolution patch. However, this invariance assumption is not quite accurate especially for large scaling factors where the optimal weights and indices of representative features are not fixed across the scaling transformation. In this paper, we propose an augmented coupled dictionary learning scheme that compensates for the inaccuracy of the invariance assumption. First, we learn a dictionary for the low-resolution image space. Then, we compute an augmented dictionary in the high-resolution image space where novel augmented dictionary atoms are inferred from the training error of the low-resolution dictionary. For a low-resolution test image, the sparse codes of the low-resolution patches and the lowresolution dictionary training error are combined with the trained high-resolution dictionary to produce a high-resolution image. Our experimental results compare favourably with the non-augmented scheme.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"201 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123017726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Integrating Machine Learning Into a Medical Decision Support System to Address the Problem of Missing Patient Data 将机器学习集成到医疗决策支持系统中,以解决丢失患者数据的问题
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.82
Atif Khan, J. Doucette, R. Cohen, D. Lizotte
In this paper, we present a framework which enables medical decision making in the presence of partial information. At its core is ontology-based automated reasoning, machine learning techniques are integrated to enhance existing patient datasets in order to address the issue of missing data. Our approach supports interoperability between different health information systems. This is clarified in a sample implementation that combines three separate datasets (patient data, drug-drug interactions and drug prescription rules) to demonstrate the effectiveness of our algorithms in producing effective medical decisions. In short, we demonstrate the potential for machine learning to support a task where there is a critical need from medical professionals by coping with missing or noisy patient data and enabling the use of multiple medical datasets.
在本文中,我们提出了一个框架,使医疗决策在部分信息的存在。其核心是基于本体的自动推理,集成了机器学习技术来增强现有的患者数据集,以解决数据缺失的问题。我们的方法支持不同卫生信息系统之间的互操作性。这在一个结合了三个独立数据集(患者数据、药物-药物相互作用和药物处方规则)的示例实现中得到了澄清,以证明我们的算法在产生有效医疗决策方面的有效性。简而言之,我们展示了机器学习的潜力,通过处理缺失或嘈杂的患者数据并启用多个医疗数据集,来支持医疗专业人员迫切需要的任务。
{"title":"Integrating Machine Learning Into a Medical Decision Support System to Address the Problem of Missing Patient Data","authors":"Atif Khan, J. Doucette, R. Cohen, D. Lizotte","doi":"10.1109/ICMLA.2012.82","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.82","url":null,"abstract":"In this paper, we present a framework which enables medical decision making in the presence of partial information. At its core is ontology-based automated reasoning, machine learning techniques are integrated to enhance existing patient datasets in order to address the issue of missing data. Our approach supports interoperability between different health information systems. This is clarified in a sample implementation that combines three separate datasets (patient data, drug-drug interactions and drug prescription rules) to demonstrate the effectiveness of our algorithms in producing effective medical decisions. In short, we demonstrate the potential for machine learning to support a task where there is a critical need from medical professionals by coping with missing or noisy patient data and enabling the use of multiple medical datasets.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127940325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Comparing Two New Gene Selection Ensemble Approaches with the Commonly-Used Approach 两种新的基因选择集成方法与常用方法的比较
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.175
D. Dittman, T. Khoshgoftaar, Randall Wald, Amri Napolitano
Ensemble feature selection has recently become a topic of interest for researchers, especially in the area of bioinformatics. The benefits of ensemble feature selection include increased feature (gene) subset stability and usefulness as well as comparable (or better) classification performance compared to using a single feature selection method. However, existing work on ensemble feature selection has concentrated on data diversity (using a single feature selection method on multiple datasets or sampled data from a single dataset), neglecting two other potential sources of diversity. We present these two new approaches for gene selection, functional diversity (using multiple feature selection technique on a single dataset) and hybrid (a combination of data and functional diversity). To demonstrate the value of these new approaches, we measure the similarity between the feature subsets created by each of the three approaches across twenty-six datasets and ten feature selection techniques (or an ensemble of these techniques as appropriate). We also compare the classification performance of models built using each of the three ensembles. Our results show that the similarity between the functional diversity and hybrid approaches is much higher than the similarity between either of those and data diversity, with the distinction between data diversity and our new approaches being particularly strong for hard-to-learn datasets. In addition to having the highest similarity, functional and hybrid diversity generally show greater classification performance than data diversity, especially when selecting small feature subsets. These results demonstrate that these new approaches can both provide a different feature subset than the existing approach and that the resulting novel feature subset is potentially of interest to researchers. To our knowledge there has been no study which explores these new approaches to ensemble feature selection within the domain of bioinformatics.
近年来,集成特征选择已成为研究人员感兴趣的话题,特别是在生物信息学领域。与使用单一特征选择方法相比,集成特征选择的好处包括增加特征(基因)子集的稳定性和有用性,以及可比较(或更好)的分类性能。然而,现有的集成特征选择工作主要集中在数据多样性上(对多个数据集或单个数据集的采样数据使用单一特征选择方法),而忽略了其他两个潜在的多样性来源。我们提出了两种新的基因选择方法,功能多样性(在单个数据集上使用多特征选择技术)和杂交(数据和功能多样性的结合)。为了证明这些新方法的价值,我们测量了这三种方法在26个数据集和10种特征选择技术(或适当的这些技术的集合)中创建的特征子集之间的相似性。我们还比较了使用这三种集成方法构建的模型的分类性能。我们的研究结果表明,功能多样性和混合方法之间的相似性远远高于它们与数据多样性之间的相似性,对于难以学习的数据集,数据多样性和我们的新方法之间的区别尤其明显。除了具有最高的相似度外,功能多样性和混合多样性通常比数据多样性表现出更高的分类性能,特别是在选择小特征子集时。这些结果表明,这些新方法可以提供与现有方法不同的特征子集,并且由此产生的新特征子集可能会引起研究人员的兴趣。据我们所知,在生物信息学领域还没有研究探索这些集成特征选择的新方法。
{"title":"Comparing Two New Gene Selection Ensemble Approaches with the Commonly-Used Approach","authors":"D. Dittman, T. Khoshgoftaar, Randall Wald, Amri Napolitano","doi":"10.1109/ICMLA.2012.175","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.175","url":null,"abstract":"Ensemble feature selection has recently become a topic of interest for researchers, especially in the area of bioinformatics. The benefits of ensemble feature selection include increased feature (gene) subset stability and usefulness as well as comparable (or better) classification performance compared to using a single feature selection method. However, existing work on ensemble feature selection has concentrated on data diversity (using a single feature selection method on multiple datasets or sampled data from a single dataset), neglecting two other potential sources of diversity. We present these two new approaches for gene selection, functional diversity (using multiple feature selection technique on a single dataset) and hybrid (a combination of data and functional diversity). To demonstrate the value of these new approaches, we measure the similarity between the feature subsets created by each of the three approaches across twenty-six datasets and ten feature selection techniques (or an ensemble of these techniques as appropriate). We also compare the classification performance of models built using each of the three ensembles. Our results show that the similarity between the functional diversity and hybrid approaches is much higher than the similarity between either of those and data diversity, with the distinction between data diversity and our new approaches being particularly strong for hard-to-learn datasets. In addition to having the highest similarity, functional and hybrid diversity generally show greater classification performance than data diversity, especially when selecting small feature subsets. These results demonstrate that these new approaches can both provide a different feature subset than the existing approach and that the resulting novel feature subset is potentially of interest to researchers. To our knowledge there has been no study which explores these new approaches to ensemble feature selection within the domain of bioinformatics.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129193549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Estimating Software Effort Using an ANN Model Based on Use Case Points 使用基于用例点的人工神经网络模型估算软件工作量
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.138
A. B. Nassif, Luiz Fernando Capretz, D. Ho
In this paper, we propose a novel Artificial Neural Network (ANN) to predict software effort from use case diagrams based on the Use Case Point (UCP) model. The inputs of this model are software size, productivity and complexity, while the output is the predicted software effort. A multiple linear regression model with three independent variables (same inputs of the ANN) and one dependent variable (effort) is also introduced. Our data repository contains 240 data points in which, 214 are industrial and 26 are educational projects. Both the regression and ANN models were trained using 168 data points and tested using 72 data points. The ANN model was evaluated using the MMER and PRED criteria against the regression model, as well as the UCP model that estimates effort from use cases. Results show that the ANN model is a competitive model with respect to other regression models and can be used as an alternative to predict software effort based on the UCP method.
在本文中,我们提出了一种新的人工神经网络(ANN)来预测基于用例点(UCP)模型的用例图的软件工作。该模型的输入是软件的大小、生产力和复杂性,而输出是预测的软件工作量。本文还介绍了一个具有三个自变量(人工神经网络相同输入)和一个因变量(努力)的多元线性回归模型。我们的数据存储库包含240个数据点,其中214个是工业项目,26个是教育项目。回归模型和人工神经网络模型均使用168个数据点进行训练,并使用72个数据点进行测试。ANN模型使用MMER和PRED标准对回归模型进行评估,以及使用UCP模型对用例的工作量进行评估。结果表明,相对于其他回归模型,人工神经网络模型是一个有竞争力的模型,可以作为基于UCP方法预测软件工作量的替代方法。
{"title":"Estimating Software Effort Using an ANN Model Based on Use Case Points","authors":"A. B. Nassif, Luiz Fernando Capretz, D. Ho","doi":"10.1109/ICMLA.2012.138","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.138","url":null,"abstract":"In this paper, we propose a novel Artificial Neural Network (ANN) to predict software effort from use case diagrams based on the Use Case Point (UCP) model. The inputs of this model are software size, productivity and complexity, while the output is the predicted software effort. A multiple linear regression model with three independent variables (same inputs of the ANN) and one dependent variable (effort) is also introduced. Our data repository contains 240 data points in which, 214 are industrial and 26 are educational projects. Both the regression and ANN models were trained using 168 data points and tested using 72 data points. The ANN model was evaluated using the MMER and PRED criteria against the regression model, as well as the UCP model that estimates effort from use cases. Results show that the ANN model is a competitive model with respect to other regression models and can be used as an alternative to predict software effort based on the UCP method.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116681097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Adaptive Selection of Helper-Objectives with Reinforcement Learning 基于强化学习的辅助目标自适应选择
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.159
Arina Buzdalova, M. Buzdalov
In this paper a previously proposed method of choosing auxiliary fitness functions is applied to adaptive selection of helper-objectives. Helper-objectives are used in evolutionary computation to enhance the optimization of the primary objective. The method based on choosing between objectives of a single-objective evolutionary algorithm with reinforcement learning is briefly described. It is tested on a model problem. From the results of the experiment, it can be concluded that the method allows to automatically select the most effective helper-objectives and ignore the ineffective ones. It is also shown that the proposed method outperforms multi-objective evolutionary algorithms, that were used with helper-objectives originally.
本文将先前提出的一种选择辅助适应度函数的方法应用于辅助目标的自适应选择。在进化计算中使用辅助目标来增强对主要目标的优化。简要介绍了一种基于目标选择的单目标强化学习进化算法。它在一个模型问题上进行了测试。实验结果表明,该方法可以自动选择最有效的辅助目标,忽略无效的辅助目标。结果表明,该方法优于原先使用辅助目标的多目标进化算法。
{"title":"Adaptive Selection of Helper-Objectives with Reinforcement Learning","authors":"Arina Buzdalova, M. Buzdalov","doi":"10.1109/ICMLA.2012.159","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.159","url":null,"abstract":"In this paper a previously proposed method of choosing auxiliary fitness functions is applied to adaptive selection of helper-objectives. Helper-objectives are used in evolutionary computation to enhance the optimization of the primary objective. The method based on choosing between objectives of a single-objective evolutionary algorithm with reinforcement learning is briefly described. It is tested on a model problem. From the results of the experiment, it can be concluded that the method allows to automatically select the most effective helper-objectives and ignore the ineffective ones. It is also shown that the proposed method outperforms multi-objective evolutionary algorithms, that were used with helper-objectives originally.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115475869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
O-linked Glycosylation Site Prediction Using Ensemble of Graphical Models 使用图形模型集合预测o链糖基化位点
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.210
A. Sriram, Feng Luo
Prediction of O-linked glycosylation sites in proteins is a challenging problem. In this paper, we introduced a new method to predict glycosylation sites in proteins. First, we built a Markov random field (MRF) to represent the sequence position relationship and model the underlying distribution of glycosylation sites. We then considered glycosylation site prediction as a class imbalance problem and employed the AdaBoost algorithm to improve the predictive performance of the classifier. We applied our method to two types of proteins: the transmembrane (TM) proteins and the non-transmembrane (non-TM) proteins. We showed that for both datasets, our methods outperform existing methods. We also showed that the performance of the system was improved significantly with the help of AdaBoost.
预测蛋白质中o链糖基化位点是一个具有挑战性的问题。本文介绍了一种预测蛋白质糖基化位点的新方法。首先,我们建立了一个马尔可夫随机场(MRF)来表示序列位置关系,并对糖基化位点的潜在分布进行建模。然后,我们将糖基化位点预测视为一个类不平衡问题,并采用AdaBoost算法来提高分类器的预测性能。我们将我们的方法应用于两种类型的蛋白质:跨膜(TM)蛋白质和非跨膜(non-TM)蛋白质。我们表明,对于这两个数据集,我们的方法优于现有的方法。我们还表明,在AdaBoost的帮助下,系统的性能得到了显着提高。
{"title":"O-linked Glycosylation Site Prediction Using Ensemble of Graphical Models","authors":"A. Sriram, Feng Luo","doi":"10.1109/ICMLA.2012.210","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.210","url":null,"abstract":"Prediction of O-linked glycosylation sites in proteins is a challenging problem. In this paper, we introduced a new method to predict glycosylation sites in proteins. First, we built a Markov random field (MRF) to represent the sequence position relationship and model the underlying distribution of glycosylation sites. We then considered glycosylation site prediction as a class imbalance problem and employed the AdaBoost algorithm to improve the predictive performance of the classifier. We applied our method to two types of proteins: the transmembrane (TM) proteins and the non-transmembrane (non-TM) proteins. We showed that for both datasets, our methods outperform existing methods. We also showed that the performance of the system was improved significantly with the help of AdaBoost.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114620230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining Parameter Space Search and Meta-learning for Data-Dependent Computational Agent Recommendation 基于参数空间搜索和元学习的数据依赖计算智能体推荐
Pub Date : 2012-12-12 DOI: 10.1109/ICMLA.2012.137
O. Kazík, K. Pesková, M. Pilát, Roman Neruda
The goal of our data-mining multi-agent system is to facilitate data-mining experiments without the necessary knowledge of the most suitable machine learning method and its parameters to the data. In order to replace the experts knowledge, the meta-learning subsystems are proposed including the parameter-space search and method recommendation based on previous experiments. In this paper we show the results of the parameter-space search with several search algorithms - tabulation, random search, simmulated annealing, and genetic algorithm.
我们的数据挖掘多智能体系统的目标是在不需要最合适的机器学习方法及其数据参数的必要知识的情况下促进数据挖掘实验。为了取代专家知识,在前人实验的基础上提出了参数空间搜索和方法推荐等元学习子系统。本文给出了用几种搜索算法——制表法、随机搜索法、模拟退火法和遗传算法进行参数空间搜索的结果。
{"title":"Combining Parameter Space Search and Meta-learning for Data-Dependent Computational Agent Recommendation","authors":"O. Kazík, K. Pesková, M. Pilát, Roman Neruda","doi":"10.1109/ICMLA.2012.137","DOIUrl":"https://doi.org/10.1109/ICMLA.2012.137","url":null,"abstract":"The goal of our data-mining multi-agent system is to facilitate data-mining experiments without the necessary knowledge of the most suitable machine learning method and its parameters to the data. In order to replace the experts knowledge, the meta-learning subsystems are proposed including the parameter-space search and method recommendation based on previous experiments. In this paper we show the results of the parameter-space search with several search algorithms - tabulation, random search, simmulated annealing, and genetic algorithm.","PeriodicalId":157399,"journal":{"name":"2012 11th International Conference on Machine Learning and Applications","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116145254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2012 11th International Conference on Machine Learning and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1