首页 > 最新文献

2011 10th International Conference on Machine Learning and Applications and Workshops最新文献

英文 中文
Max-Coupled Learning: Application to Breast Cancer 最大耦合学习:在乳腺癌中的应用
Jaime S. Cardoso, Inês Domingues
In the predictive modeling tasks, a clear distinction is often made between learning problems that are supervised or unsupervised, the first involving only labeled data (training patterns with known category labels) while the latter involving only unlabeled data. There is a growing interest in a hybrid setting, called semi-supervised learning, in semi-supervised classification, the labels of only a small portion of the training data set are available. The unlabeled data, instead of being discarded, are also used in the learning process. Motivated by a breast cancer application, in this work we address a new learning task, in-between classification and semi-supervised classification. Each example is described using two different feature sets, not necessarily both observed for a given example. If a single view is observed, then the class is only due to that feature set, if both views are present the observed class label is the maximum of the two values corresponding to the individual views. We propose new learning methodologies adapted to this learning paradigm and experimentally compare them with baseline methods from the conventional supervised and unsupervised settings. The experimental results verify the usefulness of the proposed approaches.
在预测建模任务中,通常会明确区分有监督和无监督的学习问题,前者仅涉及标记数据(具有已知类别标签的训练模式),而后者仅涉及未标记数据。人们对一种叫做半监督学习的混合设置越来越感兴趣,在半监督分类中,只有一小部分训练数据集的标签是可用的。未标记的数据,而不是被丢弃,也在学习过程中使用。在乳腺癌应用的激励下,在这项工作中,我们提出了一个新的学习任务,介于分类和半监督分类之间。每个示例都使用两个不同的特性集来描述,而不一定对给定示例都观察到两个特性集。如果观察到一个视图,那么这个类只属于那个特征集,如果两个视图都存在,那么观察到的类标签是对应于单个视图的两个值的最大值。我们提出了适应这种学习范式的新学习方法,并通过实验将它们与传统监督和无监督设置的基线方法进行比较。实验结果验证了所提方法的有效性。
{"title":"Max-Coupled Learning: Application to Breast Cancer","authors":"Jaime S. Cardoso, Inês Domingues","doi":"10.1109/ICMLA.2011.93","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.93","url":null,"abstract":"In the predictive modeling tasks, a clear distinction is often made between learning problems that are supervised or unsupervised, the first involving only labeled data (training patterns with known category labels) while the latter involving only unlabeled data. There is a growing interest in a hybrid setting, called semi-supervised learning, in semi-supervised classification, the labels of only a small portion of the training data set are available. The unlabeled data, instead of being discarded, are also used in the learning process. Motivated by a breast cancer application, in this work we address a new learning task, in-between classification and semi-supervised classification. Each example is described using two different feature sets, not necessarily both observed for a given example. If a single view is observed, then the class is only due to that feature set, if both views are present the observed class label is the maximum of the two values corresponding to the individual views. We propose new learning methodologies adapted to this learning paradigm and experimentally compare them with baseline methods from the conventional supervised and unsupervised settings. The experimental results verify the usefulness of the proposed approaches.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114776619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
MPI-based Parallelization for ILP-based Multi-relational Concept Discovery 基于mpi的多关系概念发现的并行化
Alev Mutlu, P. Senkul, Y. Kavurucu
Multi-relational concept discovery is a predictive learning task that aims to discover descriptions of a target concept in the light of past experiences. Parallelization has emerged as a solution to deal with efficiency and scalability issues relating to large search spaces in concept discovery systems. In this work, we describe a parallelization method for the ILP-based concept discovery system called CRIS. CRIS is modified in such a way that steps involving high query processing are reorganized in a data parallel way. To evaluate the performance of the resulting system, called P-CRIS, a set of experiments is conducted.
多关系概念发现是一种预测性学习任务,旨在根据过去的经验发现目标概念的描述。并行化已经成为解决概念发现系统中与大型搜索空间相关的效率和可伸缩性问题的一种解决方案。在这项工作中,我们描述了一种基于ilp的概念发现系统的并行化方法,称为CRIS。CRIS的修改方式是,以数据并行的方式重新组织涉及高查询处理的步骤。为了评估最终系统(称为P-CRIS)的性能,进行了一系列实验。
{"title":"MPI-based Parallelization for ILP-based Multi-relational Concept Discovery","authors":"Alev Mutlu, P. Senkul, Y. Kavurucu","doi":"10.1109/ICMLA.2011.98","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.98","url":null,"abstract":"Multi-relational concept discovery is a predictive learning task that aims to discover descriptions of a target concept in the light of past experiences. Parallelization has emerged as a solution to deal with efficiency and scalability issues relating to large search spaces in concept discovery systems. In this work, we describe a parallelization method for the ILP-based concept discovery system called CRIS. CRIS is modified in such a way that steps involving high query processing are reorganized in a data parallel way. To evaluate the performance of the resulting system, called P-CRIS, a set of experiments is conducted.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121989366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empirical Normalization for Quadratic Discriminant Analysis and Classifying Cancer Subtypes 二次判别分析与癌症亚型分类的经验归一化
M. Kon, Nikolay Nikolaev
We introduce a new discriminant analysis method (Empirical Discriminant Analysis or EDA) for binary classification in machine learning. Given a dataset of feature vectors, this method defines an empirical feature map transforming the training and test data into new data with components having Gaussian empirical distributions. This map is an empirical version of the Gaussian copula used in probability and mathematical finance. The purpose is to form a feature mapped dataset as close as possible to Gaussian, after which standard quadratic discriminants can be used for classification. We discuss this method in general, and apply it to some datasets in computational biology.
本文提出了一种新的判别分析方法(Empirical discriminant analysis, EDA),用于机器学习中的二分类。给定特征向量的数据集,该方法定义一个经验特征映射,将训练和测试数据转换成具有高斯经验分布的新数据。这张图是概率论和数学金融学中使用的高斯联结公式的经验版本。目的是形成一个尽可能接近高斯的特征映射数据集,之后可以使用标准二次判别器进行分类。我们一般讨论了这种方法,并将其应用于计算生物学中的一些数据集。
{"title":"Empirical Normalization for Quadratic Discriminant Analysis and Classifying Cancer Subtypes","authors":"M. Kon, Nikolay Nikolaev","doi":"10.1109/ICMLA.2011.160","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.160","url":null,"abstract":"We introduce a new discriminant analysis method (Empirical Discriminant Analysis or EDA) for binary classification in machine learning. Given a dataset of feature vectors, this method defines an empirical feature map transforming the training and test data into new data with components having Gaussian empirical distributions. This map is an empirical version of the Gaussian copula used in probability and mathematical finance. The purpose is to form a feature mapped dataset as close as possible to Gaussian, after which standard quadratic discriminants can be used for classification. We discuss this method in general, and apply it to some datasets in computational biology.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125967549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Frequent Substring-Based Sequence Classification with an Ensemble of Support Vector Machines Trained Using Reduced Amino Acid Alphabets 基于频繁子字符串的序列分类,支持向量机集合使用约简氨基酸字母表训练
Charith D. Chitraranjan, Loai Al Nimer, O. Azzam, Saeed Salem, A. Denton, M. Iqbal, S. Kianian
We propose a frequent pattern-based algorithm for predicting functions and localizations of proteins from their primary structure (amino acid sequence). We use reduced alphabets that capture the higher rate of substitution between amino acids that are physiochemically similar. Frequent sub strings are mined from the training sequences, transformed into different alphabets, and used as features to train an ensemble of SVMs. We evaluate the performance of our algorithm using protein sub-cellular localization and protein function datasets. Pair-wise sequence-alignment-based nearest neighbor and basic SVM k-gram classifiers are included as comparison algorithms. Results show that the frequent sub string-based SVM classifier demonstrates better performance compared with other classifiers on the sub-cellular localization datasets and it performs competitively with the nearest neighbor classifier on the protein function datasets. Our results also show that the use of reduced alphabets provides statistically significant performance improvements for half of the classes studied.
我们提出了一种基于频繁模式的算法,用于预测蛋白质的初级结构(氨基酸序列)的功能和定位。我们使用简化的字母表来捕获物理化学上相似的氨基酸之间更高的取代率。从训练序列中挖掘频繁子字符串,转换成不同的字母,并将其用作特征来训练支持向量机集合。我们使用蛋白质亚细胞定位和蛋白质功能数据集来评估算法的性能。比较算法包括基于成对序列对齐的最近邻和基本SVM k-gram分类器。结果表明,基于频繁子字符串的SVM分类器在亚细胞定位数据集上的性能优于其他分类器,在蛋白质功能数据集上的性能优于最近邻分类器。我们的结果还表明,使用简化的字母为所研究的一半的类提供了统计上显着的性能改进。
{"title":"Frequent Substring-Based Sequence Classification with an Ensemble of Support Vector Machines Trained Using Reduced Amino Acid Alphabets","authors":"Charith D. Chitraranjan, Loai Al Nimer, O. Azzam, Saeed Salem, A. Denton, M. Iqbal, S. Kianian","doi":"10.1109/ICMLA.2011.71","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.71","url":null,"abstract":"We propose a frequent pattern-based algorithm for predicting functions and localizations of proteins from their primary structure (amino acid sequence). We use reduced alphabets that capture the higher rate of substitution between amino acids that are physiochemically similar. Frequent sub strings are mined from the training sequences, transformed into different alphabets, and used as features to train an ensemble of SVMs. We evaluate the performance of our algorithm using protein sub-cellular localization and protein function datasets. Pair-wise sequence-alignment-based nearest neighbor and basic SVM k-gram classifiers are included as comparison algorithms. Results show that the frequent sub string-based SVM classifier demonstrates better performance compared with other classifiers on the sub-cellular localization datasets and it performs competitively with the nearest neighbor classifier on the protein function datasets. Our results also show that the use of reduced alphabets provides statistically significant performance improvements for half of the classes studied.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126045871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Looking Beyond Genres: Identifying Meaningful Semantic Layers from Tags in Online Music Collections 超越体裁:从在线音乐收藏的标签中识别有意义的语义层
R. Ferrer, T. Eerola
A scheme for identifying the semantic layers of music-related tags is presented. Arguments are provided why the applications of the tags cannot be effectively pursued without a reasonable understanding of their semantic qualities. The identification scheme consists of a set of filters. The first is related with social consensus, user-count ratio, and n-gram properties of tags. The next relies on look-up functions across multiple databases to determine the probable semantic layer of each tag. Examples of the semantic layers with prevalence rates are given based on application of the scheme to a subset of the Million Song Dataset. Finally, a validation of the results was carried out with an independent, smaller hand-annotated dataset, in which high agreement between the identification provided by the scheme and annotations was found.
提出了一种识别音乐相关标签语义层的方案。在没有对标签语义质量的合理理解的情况下,为什么标签的应用不能有效地进行论证。该识别方案由一组过滤器组成。第一个与社会共识、用户计数比率和标签的n-gram属性有关。下一个依赖于跨多个数据库的查找函数来确定每个标记的可能语义层。基于该方案在百万首歌曲数据集子集上的应用,给出了具有流行率的语义层示例。最后,用一个独立的、较小的手工标注数据集对结果进行验证,发现该方案提供的识别与标注之间的一致性很高。
{"title":"Looking Beyond Genres: Identifying Meaningful Semantic Layers from Tags in Online Music Collections","authors":"R. Ferrer, T. Eerola","doi":"10.1109/ICMLA.2011.89","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.89","url":null,"abstract":"A scheme for identifying the semantic layers of music-related tags is presented. Arguments are provided why the applications of the tags cannot be effectively pursued without a reasonable understanding of their semantic qualities. The identification scheme consists of a set of filters. The first is related with social consensus, user-count ratio, and n-gram properties of tags. The next relies on look-up functions across multiple databases to determine the probable semantic layer of each tag. Examples of the semantic layers with prevalence rates are given based on application of the scheme to a subset of the Million Song Dataset. Finally, a validation of the results was carried out with an independent, smaller hand-annotated dataset, in which high agreement between the identification provided by the scheme and annotations was found.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131556374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Incremental Learning Based on Growing Gaussian Mixture Models 基于增长高斯混合模型的增量学习
A. Bouchachia, C. Vanaret
Incremental learning aims at equipping data-driven systems with self-monitoring and self-adaptation mechanisms to accommodate new data in an online setting. The resulting model underlying the system can be adjusted whenever data become available. The present paper proposes a new incremental learning algorithm, called 2G2M, to learn Growing Gaussian Mixture Models. The algorithm is furnished with abilities (1) to accommodate data online, (2) to maintain low complexity of the model, and (3) to reconcile labeled and unlabeled data. To discuss the efficiency of the proposed incremental learning algorithm, an empirical evaluation is provided.
增量学习旨在为数据驱动的系统配备自我监测和自适应机制,以适应在线环境中的新数据。当数据可用时,可以调整系统底层的生成模型。本文提出了一种新的增量学习算法,称为2G2M,用于学习高斯混合增长模型。该算法具有以下能力:(1)适应在线数据;(2)保持模型的低复杂度;(3)调和标记和未标记数据。为了讨论所提出的增量学习算法的效率,提供了一个经验评估。
{"title":"Incremental Learning Based on Growing Gaussian Mixture Models","authors":"A. Bouchachia, C. Vanaret","doi":"10.1109/ICMLA.2011.79","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.79","url":null,"abstract":"Incremental learning aims at equipping data-driven systems with self-monitoring and self-adaptation mechanisms to accommodate new data in an online setting. The resulting model underlying the system can be adjusted whenever data become available. The present paper proposes a new incremental learning algorithm, called 2G2M, to learn Growing Gaussian Mixture Models. The algorithm is furnished with abilities (1) to accommodate data online, (2) to maintain low complexity of the model, and (3) to reconcile labeled and unlabeled data. To discuss the efficiency of the proposed incremental learning algorithm, an empirical evaluation is provided.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131323968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Bayesian Embedding of Co-occurrence Data for Query-Based Visualization 基于查询的可视化共现数据贝叶斯嵌入
Mohammad Khoshneshin, W. Street, P. Srinivasan
We propose a generative probabilistic model for visualizing co-occurrence data. In co-occurrence data, there are a number of entities and the data includes the frequency of two entities co-occurring. We propose a Bayesian approach to infer the latent variables. Given the intractability of inference for the posterior distribution, we use approximate inference via variational approaches. The proposed Bayesian approach enables accurate embedding in high-dimensional space which is not useful for visualization. Therefore, we propose a method to embed a filtered number of entities for a query -- query-based visualization. Our experiments show that our proposed models outperform co-occurrence data embedding, the state-of-the-art model for visualizing co-occurrence data.
我们提出了一个生成概率模型来可视化共现数据。在共现数据中,存在多个实体,数据中包含两个实体共现的频率。我们提出了一种贝叶斯方法来推断潜在变量。考虑到后验分布推理的难处,我们通过变分方法使用近似推理。所提出的贝叶斯方法可以在高维空间中精确嵌入,而这不利于可视化。因此,我们提出了一种为查询嵌入经过筛选的实体数量的方法——基于查询的可视化。我们的实验表明,我们提出的模型优于共现数据嵌入,这是最先进的共现数据可视化模型。
{"title":"Bayesian Embedding of Co-occurrence Data for Query-Based Visualization","authors":"Mohammad Khoshneshin, W. Street, P. Srinivasan","doi":"10.1109/ICMLA.2011.42","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.42","url":null,"abstract":"We propose a generative probabilistic model for visualizing co-occurrence data. In co-occurrence data, there are a number of entities and the data includes the frequency of two entities co-occurring. We propose a Bayesian approach to infer the latent variables. Given the intractability of inference for the posterior distribution, we use approximate inference via variational approaches. The proposed Bayesian approach enables accurate embedding in high-dimensional space which is not useful for visualization. Therefore, we propose a method to embed a filtered number of entities for a query -- query-based visualization. Our experiments show that our proposed models outperform co-occurrence data embedding, the state-of-the-art model for visualizing co-occurrence data.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125521247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Book Recommendation Signage System Using Silhouette-Based Gait Classification 基于轮廓的步态分类图书推荐标识系统
M. Mikawa, S. Izumi, Kazuyo Tanaka
A library creates new services for attracting library users continuously. This paper presents a new book recommendation digital signage system. The system classifies characteristics such as gender or age of a walking library user, and displays a recommended book on an LCD for him/her. A set of silhouette image sequence of a walker extracted from real-time video is used for classification with Support Vector Machine (SVM). Since a calculation amount of a silhouette-based classification method is less than a three-dimensional model-based classification, it is suitable for real-time classification. We design a classifier that has better performance by evaluating some parameters and image features for classification. Some experimental results reveal the validity and effectiveness of our proposed signage system.
图书馆不断创造新的服务,吸引读者。本文提出了一种新的图书推荐数字标牌系统。该系统根据行走的图书馆使用者的性别、年龄等特征进行分类,并在液晶显示器上显示推荐的图书。从实时视频中提取一组行走者的轮廓图像序列,利用支持向量机(SVM)进行分类。由于基于轮廓的分类方法计算量小于基于三维模型的分类方法,因此适合于实时分类。我们通过评估一些参数和图像特征来设计一个性能更好的分类器。实验结果表明了该系统的有效性。
{"title":"Book Recommendation Signage System Using Silhouette-Based Gait Classification","authors":"M. Mikawa, S. Izumi, Kazuyo Tanaka","doi":"10.1109/ICMLA.2011.43","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.43","url":null,"abstract":"A library creates new services for attracting library users continuously. This paper presents a new book recommendation digital signage system. The system classifies characteristics such as gender or age of a walking library user, and displays a recommended book on an LCD for him/her. A set of silhouette image sequence of a walker extracted from real-time video is used for classification with Support Vector Machine (SVM). Since a calculation amount of a silhouette-based classification method is less than a three-dimensional model-based classification, it is suitable for real-time classification. We design a classifier that has better performance by evaluating some parameters and image features for classification. Some experimental results reveal the validity and effectiveness of our proposed signage system.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131925141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Hybrid Evolution of Convolutional Networks 卷积网络的混合进化
Brian Cheung, Carl Sable
With the increasing trend of neural network models towards larger structures with more layers, we expect a corresponding exponential increase in the number of possible architectures. In this paper, we apply a hybrid evolutionary search procedure to define the initialization and architectural parameters of convolutional networks, one of the first successful deep network models. We make use of stochastic diagonal Levenberg-Marquardt to accelerate the convergence of training, lowering the time cost of fitness evaluation. Using parameters found from the evolutionary search together with absolute value and local contrast normalization preprocessing between layers, we achieve the best known performance on several of the MNIST Variations, rectangles-image and convex image datasets.
随着神经网络模型越来越趋向于更大、更多层的结构,我们预计可能的结构数量会相应呈指数增长。在本文中,我们应用混合进化搜索程序来定义卷积网络的初始化和结构参数,卷积网络是最早成功的深度网络模型之一。我们利用随机对角线Levenberg-Marquardt来加速训练的收敛,降低适应度评估的时间成本。使用从进化搜索中找到的参数以及层间的绝对值和局部对比度归一化预处理,我们在几种MNIST变量、矩形图像和凸图像数据集上实现了最佳性能。
{"title":"Hybrid Evolution of Convolutional Networks","authors":"Brian Cheung, Carl Sable","doi":"10.1109/ICMLA.2011.73","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.73","url":null,"abstract":"With the increasing trend of neural network models towards larger structures with more layers, we expect a corresponding exponential increase in the number of possible architectures. In this paper, we apply a hybrid evolutionary search procedure to define the initialization and architectural parameters of convolutional networks, one of the first successful deep network models. We make use of stochastic diagonal Levenberg-Marquardt to accelerate the convergence of training, lowering the time cost of fitness evaluation. Using parameters found from the evolutionary search together with absolute value and local contrast normalization preprocessing between layers, we achieve the best known performance on several of the MNIST Variations, rectangles-image and convex image datasets.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128875105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Fault Detection through Sequential Filtering of Novelty Patterns 基于新模式序列滤波的故障检测
John Cuzzola, D. Gašević, E. Bagheri
Multi-threaded applications are commonplace in today's software landscape. Pushing the boundaries of concurrency and parallelism, programmers are maximizing performance demanded by stakeholders. However, multi-threaded programs are challenging to test and debug. Prone to their own set of unique faults, such as race conditions, testers need to turn to automated validation tools for assistance. This paper's main contribution is a new algorithm called multi-stage novelty filtering (MSNF) that can aid in the discovery of software faults. MSNF stresses minimal configuration, no domain specific data preprocessing or software metrics. The MSNF approach is based on a multi-layered support vector machine scheme. After experimentation with the MSNF algorithm, we observed promising results in terms of precision. However, MSNF relies on multiple iterations (i.e., stages). Here, we propose four different strategies for estimating the number of the requested stages.
多线程应用程序在当今的软件环境中很常见。程序员不断突破并发性和并行性的界限,最大化了涉众所要求的性能。然而,多线程程序在测试和调试方面具有挑战性。测试人员容易出现他们自己的一组独特的错误,比如竞态条件,他们需要求助于自动验证工具。本文的主要贡献是一种称为多阶段新颖性滤波(MSNF)的新算法,它可以帮助发现软件故障。MSNF强调最小的配置,没有特定领域的数据预处理或软件指标。MSNF方法基于多层支持向量机方案。通过对MSNF算法的实验,我们观察到在精度方面有希望的结果。然而,MSNF依赖于多个迭代(即阶段)。在这里,我们提出了四种不同的策略来估计所请求阶段的数量。
{"title":"Fault Detection through Sequential Filtering of Novelty Patterns","authors":"John Cuzzola, D. Gašević, E. Bagheri","doi":"10.1109/ICMLA.2011.69","DOIUrl":"https://doi.org/10.1109/ICMLA.2011.69","url":null,"abstract":"Multi-threaded applications are commonplace in today's software landscape. Pushing the boundaries of concurrency and parallelism, programmers are maximizing performance demanded by stakeholders. However, multi-threaded programs are challenging to test and debug. Prone to their own set of unique faults, such as race conditions, testers need to turn to automated validation tools for assistance. This paper's main contribution is a new algorithm called multi-stage novelty filtering (MSNF) that can aid in the discovery of software faults. MSNF stresses minimal configuration, no domain specific data preprocessing or software metrics. The MSNF approach is based on a multi-layered support vector machine scheme. After experimentation with the MSNF algorithm, we observed promising results in terms of precision. However, MSNF relies on multiple iterations (i.e., stages). Here, we propose four different strategies for estimating the number of the requested stages.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125716944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2011 10th International Conference on Machine Learning and Applications and Workshops
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1